Faculty: Bruno Abrahao, Course Spotlight - Network Analytics
The world we live in is built upon a myriad of networks: Human society is defined by our interpersonal relationships. Organizations are structured around interconnecting roles and lines of authority between workers, colleagues, and bosses. Global information is conveyed across a world-wide web of linked content. As we have witnessed recently, epidemics spread over a social network of contacts, in the same way in which we buy products as we are influenced by our peers. In networks, local interactions among members of small communities can often propagate and further affect the outcomes of an entire system.
Assistant Professor of Information Systems and Business Analytics Bruno Abrahao studies the power of networks and their effect on business and society. A computer scientist by training, Abrahao spent years using the tools of data science for networks in his own research and in collaborations with companies, such as Airbnb and Microsoft, to find solutions to problems rooted in network abstractions. For Airbnb, he mapped out the network of trust among users – determining which factors influenced hosts’ decisions on who to accept as tenants, and guest decisions on who to select as their potential hosts. For Microsoft Research AI, he studied corporate email networks to understand fundamental questions about human relationships and to find ways to make organizations more efficient.
In his graduate-level Network Analytics course, he brings his work and academic experience to the classroom, teaching the students of the Data Analytics & Business Computing masters program every fall semester how to tackle business problems using the tools of computer science, economics, and social science to analyze networks. Professor Abrahão also teaches an undergraduate version of this course, called "Data Science for Social and Information Networks".
Last Fall, twenty-seven students joined Abrahao in learning the interdisciplinary nature of the subject, as well as the tools of network analysis - network modeling and algorithms, Python's NetworkX library, machine learning, and natural language processing - and then in applying them to pressing problems in our society.
For their final assignment, students brought all their new skills and knowledge to group projects that addressed big questions in business or in society in general, modelling them mathematically to make them precise, and running algorithms that analyze networks to illuminate them.
"What I like the most about teaching this course is the process of creating the means for a group of students from diverse backgrounds, and with different technical perspectives and worldviews, to get together and to employ problem solving strategies that transcend disciplinary boundaries," Abrahao says. The outcomes of their projects are just mind-blowing!"
For instance, Phionna Teo applied network analytics to the social phenomenon of cyberbullying. “Social media platforms are still a bit slow in using automated systems to identify abusive content. They still rely on user-generated judgements and reporting,” said Teo. “We wanted to identify the types of people who are more susceptible to cyberbullying, so that detection can be automated more efficiently.”
Teo’s final project video.
Teo’s team started with a set of 30 thousand tweets from 2013, and trained an AI classifier to identify “bullying” relationships. This was done by learning from comments pulled from another publicly available Wikipedia dataset, annotated with the goal of identifying behaviors that were labeled “bullying.” After applying the trained classifier to the data set of 30 thousand tweets, the team ended up with three thousand “bullying incidents” directed at other users.
Following the pre-processing of the tweets, they correlated the network structure around a user, i.e., account information and follower/following relationships, with the labels they previously produced to investigate what kinds of local social networks around a user make them susceptible to cyberbullying. They also used natural language processing to analyze word usage, and found correlations between the most commonly used words in bullying comments and the next most common words that follow. The result of such analyses could potentially be used by social media platforms to automatically detect misbehavior and to identify individuals susceptible to bullying. Their method has the potential to immediately improve how platforms like Twitter handle online harassment.
Yao Guanhua and Xu Zhixing wanted to see how the topics of songs and their lyrics have changed over time, and how much creativity and innovation are introduced in the music industry each year. They used web scraping to download lyrics from 140,000 English songs on Billboards from 1970 to 2000. They linked the songs, thereby forming a network, where two songs are connected if they exhibit similarity between their topics, which were previously extracted using natural language processing. Then they plotted the evolving structure of the networks divided by year and along 15 different topics to show changing trends over time. As the networks they built reveal where the industry lacks diversity, their work could offer ways for music producers to identify where creativity is needed.
Yao and Xu’s project tracked changing music trends over four decades.
Other student projects included a creative analysis of e-commerce recommendation systems, and building the Bespoke Travel Recommendation app. More project videos from students can be found here.
Abrahao and the class also made the most of being in Shanghai - visiting innovative local companies and meeting exciting researchers and entrepreneurs.
Chen Yang, a professor of computer science at Fudan University, showed students how he and his research team used machine learning applied to social networks to detect fraudulent accounts on Dianping and Momo -- China’s Yelp and OkCupid.
Students visited the new Amazon Joint Innovation Center in Shanghai’s Jing’an District, and the Shanghai Data Exchange Center -- a central hub that integrates and analyzes big data from Yangtze River Delta Region and the entire country for the purposes of business, transportation, security, and urban planning.
“We were so excited to learn how real companies are using new technology,” said Yao Guanhua. “At Amazon, we learned how they used cloud computing and big data to build intelligent systems in China. And at the Data Center, they showed us how they stored, maintained, and used massive amounts of data for marketing, entrepreneurship, and innovation.”