Online media is in a state of flux. Twitter, Facebook, blogs, so-called fake news – these are all recent developments that have radically altered the landscape of news and information online. We call this the “networked public sphere”, and the Media Cloud project was created to track and understand it. Come help us build data-centric tools for academic internet researchers and human rights activists that let them investigate coverage and conversations online about topics they care about.
The Media Cloud project is seeking a contract software engineer to help us build tools that facilitate research about the role of online media in civic discourse. We are an open source project producing research about the networked public sphere, and helping others do their own research about online media. We make available to the public our existing archive of more than 550 million stories, adding more than 40,000 new stories daily.
The contract software engineer will work on our server architecture, which collects, processes, and makes these stories available via an API. They will work under senior engineers to plan, design, build, maintain, and run all levels of the project’s platform. This includes back-end tools that collect and archive the data, researcher tools that enable analysis of that data, and occasional contributions to front end tools that expose the data and analysis to the public. Buzzwords – big data, quantitative text analysis, machine learning, etc.
Media Cloud is a joint project between the Center for Civic Media at MIT and the Berkman Klein Center for Internet & Society at Harvard University. The position will be a 6-month contract position based at the Center for Civic Media (at the MIT Media Lab), but the engineer will work closely with members of the team from both centers. The project is funded by human rights foundations. We produce both the open platform and research that helps our funders make decisions about how best to influence online civic conversations about democracy, activism, and health. This is a grant-funded contract position that we hope to extend, or turn into a staff position.
We are a diverse project of researchers and technologists who love to wrestle with hard questions about online media by using a combination of social, computer, and data sciences. The ideal candidate will work well with all members of the team, from senior faculty to junior developers, and will thrive in an academic atmosphere that privileges constant questioning and validation at all levels of the platform and of our research products. Experience building text-based big data systems, or working as a data scientist, is helpful, as is experience working on projects investigating online media.
- B.A. degree, preferably in computer science or data science related field;
- at least two years experience working as a software engineer;
- demonstrated ability to design, build, test, and deploy robust code;
- demonstrated ability to iterate quickly through prototypes;
- demonstrated ability to use data to validate architectural decisions using data.
- interest in working on issues related to democracy, gender, race, health, and globalization.
- passion for solving difficult engineering and data problems;
- experience writing, maintaining, and optimizing SQL queries against large databases;
- experience implementing and maintaining a production ETL pipeline;
- experience scaling platforms to handle large data sets;
- experience writing web crawlers;
- experience working with PostgreSQL and Solr / Lucene in Ubuntu environments;
- knowledge and interest in social sciences;
- work with senior engineers to establish technical vision for project;
- contribute to, and follow, a technical roadmap to meet research needs and complete grant deliverables;
- collaborate with other developers, designers, and system administrators in implementing technical roadmap;
- communicate project status internally and externally to our community of users;
- maintain, upgrade, and build systems within large, existing codebase to collect, archive, and analyze content from online media;
- writing code to scale systems to handle ever expanding data requirements.
Much of our substantive work focuses on issues of gender, race, and globalization. We strongly encourage women, people of color, and people of any sexual identity to apply.
The job is based in Cambridge, MA, but much of our team is distributed around the world. We are open to alternative working arrangements that include part time residence in Cambridge.
Apply by sending a cover letter, resume, and link to your GitHub profile to email@example.com