Congratulations to Yan Chen, whose first PhD paper is published (open access) today in Frontiers in Big Data, titled From theory to practice: Insights and hurdles in collecting social media data for social science research. She started her PhD in 2018, the year of the Cambridge Analytica scandal, and this paper is a perspective piece that documents her challenges of data access for her PhD, which took her through eight different options before finding one that worked (enough). This was in stark contrast to Instagram data collection for her Master’s several years before, for which she used the academic tool Netlytic. The closing of APIs starting in 2018 did not make people safer, it just concentrated the data in a smaller set of (commercial) hands. This paper advocates for a stronger role for government and other regulators in ensuring access to social media for public good research.
Tag: big data (Page 1 of 3)
Delighted to have a new open source scoping review paper in New Media and Society led by PhD student Yan Chen, Using social media images as data in social science research. This paper is the result of one of her comprehensive exams and allows us to see some of the strengths and biases of this emerging practice. There is a distinctly English-language and small-data approach to these datasets, compared to ‘culturomics’ approaches emerging in conservation contexts, for instance, that use lots of machine learning approaches. There is also a chaos of approaches to ethics and copyright in the handling and use, characteristic of a method that has not yet matured and is subject to a constantly shifting context in terms of platforms and usage norms.
As decision-makers tackle the challenge of adapting Bay of Fundy dykelands to climate change, they need to understand who uses and values dykelands and salt marshes, and for what. This new paper in Ocean and Coastal Management, Comparing cultural ecosystem service delivery in dykelands and marshes using Instagram: A case of the Cornwallis (Jijuktu’kwejk) River, Nova Scotia, Canada, used four months of geocoded Instagram data to understand the cultural ecosystem service (CES) tradeoffs that might result from removing/realigning dykes and restoring salt marshes where dykelands can’t be sustained. Dykelands provide a much wider set of CES for a wider demographic than do marshes for this set of social media users. However, a big surprise is that while salt marshes were present in many photos they were not named as such; users spoke only about the dykes and dykelands behind those marshes. As such, the marsh CES in the dataset came from visitors to an impounded freshwater wetland trail which is a local attraction walkable from the downtown centre of Kentville. Many of the messages triangulate well with the 2016 online Q survey I ran with Nova Scotians about the same topic and the paper provides another nice case study as to the utility of social media data for social impact assessment. One of the really great things about this paper is that it is a real ‘lab’ output. The work was initiated as a follow-up to that 2016 study and to inform the new ResNet work when I knew Camille was going to be joining as an intern from AgroCampus Ouest. PhD student Yan collected a few months of Instagram posts for Camille to analyze with her help, postdoc Tuihedur helped with statistics, and then Yan picked it up again to write up after Camille went back to France. I’m proud of this paper and this collaborative team.
The second paper from Yan Chen’s MES thesis is now out in Society and Natural Resources, Leveraging social media to understand younger people’s perceptions and use of hydroelectric energy landscapes. It is a research note demonstrating the utility of manual coding and conceptual mapping of a year of Instagram images around two hydroelectricity sites to predict how changes might affect young residents. Unlike her first thesis paper in Landscape and Urban Planning, which carried out spatial mapping of value ‘hotspots’–a method widespread in today’s growing literature on cultural ecosystem services–this paper makes statistical links between features, activities and values conveyed through Instagram. The diagrams provide insight to the lifestyle and emotions associated with different landscape features, some changeable with hydro development or removal, and informs our new work on conservation culturomics for social impact assessment. Yan continues to drive this work as an IDPhD student. Congratulations, Yan.
Yan Chen is wrapping up a few days in Singapore for the NSF-funded Research Coordination Network (RCN) in Science, Engineering and Education for Sustainability (SEES) on Putting Sustainability into Convergence: Connecting Data, People, and Systems. This international workshop has been diverse in attendees and disciplines. Yan reflected, “The most discussed question is how people from different disciplines can collaborate. There are many scholars like me, as social scientists who are using sophisticated data analysis models; while others are engineers working on social issues. We both, at a certain degree, struggle in ‘cultural shocks’ between different disciplines.” It’s been a great opportunity for her to workshop with similarly cross-cutting folks. She described her session as discussing, “data sources, sizes, validity, sharing, proxies, and so on. …. [agreeing] that data or method cannot develop only on the technologies, but has to answer certain questions. For social scientists, finding a good mechanism of data sharing or archiving may be very useful. Also, how to cope with the rapidly developing technologies will be another challenge for us.” Thanks to SSHRC for supporting Yan’s trip, via Mike Smit’s Insight Grant, on which I’m a CI, Assessing the social impacts of hydroelectricity-driven landscape changing using text, images and archives: a Big Data approach.