Scraping data from Facebook groups and pages

Posted at 23 Dec 2015
Tags: facebook, scraping, nlp, python, php

I’ve written a small set of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages. It is available on github. It can be used to collect posts and comments (including their hierarchical structure and some metadata) from public groups and pages automatically. For closed groups, manually saving the HTML output and parsing it with a provided Python script is necessary.

After collecting the data, statistical analyses can be performed on it. For now, identifying and counting nouns as described in a previous blog post is implemented.

If you spotted a mistake or want to comment on this post, please contact me: post -at- mkonrad -dot- net.
← “Finding out annual music favorites from Clementine music player
View all posts
Extracting Nouns in German texts with Python using Pattern library and libleipzig” →