We’ve reached the final annotation in our series on Social Media Data Collection, Processing, and Use in Research, Marketing, and Political Communication. Toward the end of the project my research drifted from traditional academic sources to investigative journalism. We now veer further off-track into blog posts and GitHub repos. Some videos and a course syllabus on “Data Science for Social Systems.” Tools, documentation, and related sources that don’t fit neatly into any particular box. This isn’t so much an annotation as a grab bag of annotated links. I apologize in advance.
My bibliography for IS452 has officially become a network of small pieces loosely joined. But there’s fodder here for additional research, and a starting point for not just programming tasks, but the social and political context of social media analytics as currently practiced.
I hope you find this interesting and possibly useful.
Facebook. “The Graph API.” https://developers.facebook.com/docs/graph-api
Where would we be without the Facebook Graph API? Probably still in Paris Climate Accord. Anyway, this was the source of data mined by the folks at Cambridge Analytica through the Facebook app created by Aleksandr Kogan, which crawled through 80 million Facebook profiles to build a dataset for sentiment analysis and psychographic messaging strategies. Facebook has since locked it down to some extent, but it’s still pretty useful for social network analysis and opinion mining.
Klipfolio. “Using Facebook’s Graph API Explorer to Retrieve Insights Data.” Klipfolio.Com, 11 Apr. 2014, https://www.klipfolio.com/blog/facebook-graph-api-explorer.
This short tutorial is written for non-programmers and those unfamiliar with APIs. It provides step-by-step instructions for accessing and using the Graph API Explorer, setting up an access token, and retrieving insights from Facebook pages. Klipfolio is a commercial vendor that provides proprietary dashboard solutions for data analytics, and this blog post is couched in terms of feeding Facebook data into their product. It may still be useful to those who need a very basic introduction to the Facebook API and API Explorer.
Ranjan, Ravi. “How to Use Facebook Graph API and Extract Data Using Python?” Towards Data Science, 2016, https://towardsdatascience.com/how-to-use-facebook-graph-api-and-extract-data-using-python-1839e19d6999.
A data scientist explains how to extract data from the Facebook Graph API using Python. Ranjan walks through the process of getting an access token, which is required for making API calls. He references Graph API version 2.7, whereas the current version is 3.0, but the programming patterns are the same. (The Graph API Reference https://developers.facebook.com/docs/graph-api/reference/v2.7/ provides documentation for the current and past versions.) The guidance will be useful for anyone with some Python experience who is just beginning to explore what data can be mined from Facebook.
Ferrara, Emilio. “Data Science for Social Systems.” http://www.emilio.ferrara.name/i400-590-mining-the-social-web/. Accessed 10 May 2018.
This site is a comprehensive syllabus by Prof. Elimio Ferrara, Research Assistant Professor at the Deptartment of Computer Science at the University of Southern California, covering “how to unleash the full power and potential of the Social Web for research and business application purposes!” Topics include machine learning, Natural Language Processing, sentiment analysis, topic modeling, network visualization, and recommender system among many other area of social media processing and analysis. The course has a strong Python orientation.
Shaik, Afiz. “Facebook Data Analysis Using Python: Explore GraphAPI Part 2.” 2018. YouTube, https://www.youtube.com/watch?v=o1qeNwoLh68.
Shaik walks through the process of using Jupyter Notebook and Python 3 to mine and process Facebook Graph API data. After setting up the development environment using Anaconda, he explains the use of Facebook access tokens and API queries. He then demonstrates how to work with the Graph API Explorer to pull specific data in JSON format. This brief tutorial may be useful for those who prefer learning from video sources.
Spring. “Accessing Facebook Data.” https://spring.io/guides/gs/accessing-facebook/. Accessed 20 Apr. 2018.
Spring.io presets a “getting started” guide to the process of creating a web application to access Facebook data using Java. The guide walks though the requirements and the steps needed to develop working code. This resource will be more useful for those with at least intermediate programming experience, especially in Java.
bigdataenthusiast. “Mining Facebook Data Using R & Facebook API!” Data Enthusiast, Mar. 19, 2016. https://bigdataenthusiast.wordpress.com/2016/03/19/mining-facebook-data-using-r-facebook-api/.
A blog post by an enthusiastic programmer showing how to extract Facebook API data using the R programming language, and the Rfacebook package. The author provides a detailed, step-by-step guide using screenshots and code examples. Even with recent changes to the Facebook Graph API, the author’s basic approach should still be valid.
Conkwright, William. “How to Get Public Data from Facebook with PHP.” Will Conkwright, June 14, 2017. https://www.willconkwright.com/how-to-get-public-data-from-facebook-with-php/.
Conkwright provides a guide to accessing Facebook API data using PHP, with an example of getting a “talking about” cont for locations around Raleigh, North Carolina. First he shows how to use the Facebook API explorer to generate queries. He explains the specifics of the query string with screenshots, and breaks down the query url to show the parameters. He then shows how to retrieve Facebook data using a custom PMP function, and provides a link to a gist of the PMP code snippet on GitHub.
Computational Linguistics Research Group. “Pattern: Web Mining Module for Python, with Tools for Scraping, Natural Language Processing, Machine Learning, Network Analysis and Visualization.” Last commit 2017. https://github.com/clips/pattern.
This resource is a repo on the GitHub account of the Computational Linguistics Research Group at the University of Antwerp. Pattern is a Python module with tools for data mining, Natural Language Processing, machine learning, and network analysis. It supports a variety of methods for extracting syntactic, semantic, and sentiment information, including n-gram search, clustering, and SVM. Pattern appears well documented and includes bundled examples. The main branch supports Python 2.7, but a Python 3 version is available in the development branch. The documentation https://www.clips.uantwerpen.be/pages/pattern includes code examples and several case studies.
GW Libraries. “Social Feed Manager.” Social Feed Manager, https://gwu-libraries.github.io/sfm-ui/. Accessed 10 May 2018.
This site from George Washington University Libraries offers code, documentation, and how-to articles related to the Social Feed Manager, an open source project that harvests social media data from a variety of sources. The project also maintains extensive documentation on readthedocs http://sfm.readthedocs.io/en/latest/
Routley, Nick. “The Multi-Billion Dollar Industry That Makes Its Living From Your Data.” Visual Capitalist, Apr. 14, 2018. http://www.visualcapitalist.com/personal-data-ecosystem/.
Finally, here’s a fun little guide for consumers on how big tech companies and data aggregators mine and monetize our personal information. The article serves as a reminder that Facebook is only one company in a large industry that consumes data and excretes all the products of contemporary marketing and financial management. The article covers the nature of personal digital profiles compiled by data brokers like Acxiom and Experian, and suggest ways consumers can limit the exposure of their data.
News Sources
As if it wasn’t bad enough that I’m annotating YouTube videos and GitHub repos, here are some recent works of journalism that provide a social and political framework for what the technology now affords. I’ll let the titles speak for themselves. Don’t get too depressed!
- Dewey, Caitlin. “98 Personal Data Points That Facebook Uses to Target Ads to You.” Washington Post, 19 Aug. 2016. https://www.washingtonpost.com/news/the-intersect/wp/2016/08/19/98-personal-data-points-that-facebook-uses-to-target-ads-to-you/.
- Lapowsky, Issie. “This Is How Facebook Actually Won Trump the Presidency.” WIRED, 15 Nov. 2016, https://www.wired.com/2016/11/facebook-won-trump-election-not-just-fake-news/.
- Scola, Nancy. “How Facebook, Google and Twitter ‘embeds’ Helped Trump in 2016.” POLITICO, 26 Oct. 2017, http://politi.co/2hcrCRZ.
- Halpern, Sue. “Cambridge Analytica, Facebook, and the Revelations of Open Secrets.” The New Yorker, Mar. 2018. http://www.newyorker.com, https://www.newyorker.com/news/news-desk/cambridge-analytica-facebook-and-the-revelations-of-open-secrets.
- Lewis, Paul. “‘Utterly Horrifying’: Ex-Facebook Insider Says Covert Data Harvesting Was Routine.” The Guardian, 20 Mar. 2018. http://www.theguardian.com, http://www.theguardian.com/news/2018/mar/20/facebook-data-cambridge-analytica-sandy-parakilas.
- Lee, Michelle Ye Hee, et al. “Cambridge Analytica Harnessed Facebook Data in Work for Super PAC Led by John Bolton, According to Former Employees.” Washington Post, 23 Mar. 2018. https://www.washingtonpost.com/politics/cambridge-analytica-harnessed-facebook-data-in-work-for-super-pac-led-by-john-bolton-according-to-former-employees/2018/03/23/d756967a-2ea3-11e8-8ad6-fbc50284fce8_story.html.
- Gassée, Jean-Louis. “Mark Zuckerberg Thinks We’re Idiots.” Monday Note, 25 Mar. 2018, https://mondaynote.com/mark-zuckerberg-thinks-were-idiots-638c64dfab12.
- Malik, Om. “The #1 Reason Facebook Won’t Ever Change.” Om on Tech, 20 Feb. 2018, https://om.co/2018/02/20/the-1-reason-facebook-wont-ever-change/.
- Editors. “Cambridge Analytica Controversy Must Spur Researchers to Update Data Ethics.” Nature, vol. 555, no. 7698, Mar. 2018, p. 559. http://www.nature.com, doi:10.1038/d41586-018-03856-4.
- Mahdawi, Arwa. “Facebook: Is It Time We All Deleted Our Accounts?” The Guardian, 20 Mar. 2018. http://www.theguardian.com/technology/2018/mar/20/facebook-is-it-time-we-all-deleted-our-accounts.
- Bogost, Ian. “My Cow Game Extracted Your Facebook Data.” The Atlantic, Mar. 2018. https://www.theatlantic.com/technology/archive/2018/03/my-cow-game-extracted-your-facebook-data/556214/.
- @dylanmckaynz: “Downloaded My Facebook Data as a ZIP File Somehow It Has My Entire Call History with My Partner’s Mum a Historical Record of Every Single Co […].” Mar. 2018, https://threadreaderapp.com/thread/976368845635035138.html.
- Thompson, Ben. “The Facebook Brand.” Stratechery by Ben Thompson, 19 Mar. 2018, https://stratechery.com/2018/the-facebook-brand/.
- Chen, Adrian. “Cambridge Analytica and Our Lives Inside the Surveillance Machine.” The New Yorker. 21 Mar. 2018, https://www.newyorker.com/tech/elements/cambridge-analytica-and-our-lives-inside-the-surveillance-machine.
- Palmer, Shelley. “What Facebook Data Did They Get and What Did They Do?” Shelly Palmer, 25 Mar. 2018, https://www.shellypalmer.com/2018/03/facebook-data-get/.
- Faux, Zeke. “How Facebook Helps Shady Advertisers Pollute the Internet.” Bloomberg, 27 Mar. 2018. http://www.bloomberg.com, https://www.bloomberg.com/news/features/2018-03-27/ad-scammers-need-suckers-and-facebook-helps-find-them.
- Marvin, Ginny. “Facebook’s Removing Third-Party Targeting Data: What Marketers Need to Know.” Marketing Land, 30 Mar. 2018, https://marketingland.com/facebooks-removal-of-third-party-targeting-data-what-we-know-237260.
- @saradannerdukic: “Okay, This Is Going to Be Incredibly Dense with Resources, and Long (Just a Warning That This Ain’t Something to Dive into While You’re Wait […].” Apr. 2018, https://threadreaderapp.com/thread/985363902413398017.html.
- Coren, Michael J. “Facebook’s Crisis Demands a Reevaluation of Computer Science Itself.” Quartz, 31 Mar. 2018, https://qz.com/1240120/facebooks-crisis-demands-a-reevaluation-of-computer-science-itself/.
- Cadwalladr, Carole. “AggregateIQ: The Obscure Canadian Tech Firm and the Brexit Data Riddle.” The Guardian, 31 Mar. 2018. http://www.theguardian.com/uk-news/2018/mar/31/aggregateiq-canadian-tech-brexit-data-riddle-cambridge-analytica.
- Cadwalladr, Carole. “Revealed: Graphic Video Used by Cambridge Analytica to Influence Nigerian Election.” The Guardian, 4 Apr. 2018. http://www.theguardian.com, http://www.theguardian.com/uk-news/2018/apr/04/cambridge-analytica-used-violent-video-to-try-to-influence-nigerian-election.
- Maguire, Robert. “EXCLUSIVE: Robert Mercer Backed a Secretive Group That Worked with Facebook, Google to Target Anti-Muslim Ads at Swing Voters.” OpenSecrets Blog, 5 Apr. 2018, https://www.opensecrets.org/news/2018/04/exclusive-robert-mercer-backed-a-secretive-group-that-worked-with-facebook-google-to-target-anti-muslim-ads-at-swing-voters/.
- Wylie, Christopher. “Christopher Wylie: Why I Broke the Facebook Data Story – and What Should Happen Now.” The Guardian, 7 Apr. 2018. http://www.theguardian.com/uk-news/2018/apr/07/christopher-wylie-why-i-broke-the-facebook-data-story-and-what-should-happen-now.
- UpGuard. “The Aggregate IQ Files, Part One: How a Political Engineering Firm Exposed Their Code Base.”” 30 Apr. 2018, https://www.upguard.com/breaches/aggregate-iq-part-one.
- UpGuard. The AggregateIQ Files, Part Two: The Brexit Connection. 30 Apr. 2018, https://www.upguard.com/breaches/aggregate-iq-part-two-brexit.
- Siegelman, Wendy. “Cambridge Analytica Is Dead – but Its Obscure Network Is Alive and Well.” The Guardian. 5 May 2018, https://www.theguardian.com/uk-news/2018/may/05/cambridge-analytica-scl-group-new-companies-names.
- Knight. “Hewlett, Knight, Koch Foundations, with Other Funders, Will Support Independent Research on Facebook’s Role in Elections and Democracy.” Knight Foundation, 9 Apr. 2018, https://www.knightfoundation.org/press/releases/independent-research-on-facebook-role-in-elections-and-democracy.