Sex disaggregation of social media posts

Pulse Lab New York

Project Description

The plight of women and men differ in many ways and one step towards understanding those differences is sex-disaggregation of available data. Global Pulse collaborated with Data2X and the University of Leiden to develop and prototype a tool to infer the sex of users. The tool automates the process of looking up public information from Twitter profiles, in particular the username and profile picture. Using open source software, the tool analyses usernames from a built-in database of predefined names (from sources such as official statistics) that contain gender information. Username alone may sometimes not be enough to discern sex, in which case the tool analyses profile photos, using face recognition software.

Global Pulse used the sex-disaggregation tool to improve an existing real-time online dashboard showing the volume of tweets around priority topics related to sustainable development. The tool was tested on more than 50 million Twitter accounts and you can view the results of applying the tool at http://post2015.unglobalpulse.net/.

By embarking on the development of this prototype tool, and testing it on Global Pulse’s dashboard of global development tweets, the tool shows several early examples of the insights that can be gleaned about the differences between how men and women discuss global development on social media.

For more, see the brief:

Did you find this project interesting? Share it with your networks!

Share on facebook
Share on twitter
Share on linkedin
Share on email

Below you can find our latest examples of our collaborative research, prototypes and experiments, where we analyse digital data to advance global development, support humanitarian action, and promote peace. For more, go to the research projects page.

Scroll to Top