When I started using software to analyze data as a reporter in the late 1980s, “data journalism” ended once my stories were published in the newspaper.
Now the publication of the story is just the beginning. The same data can also be turned into compelling visualizations and into news applications that people can use long after the story is published. So data journalism — which was mostly a one-person job when I started doing it — is now a team sport.
So if you’re running a news organization that wants to start doing data journalism, how should you proceed? At the recent American Society of News Editors conference in Washington, I argued that news leaders need to think about three dimensions of data journalism — and three different types of journalism skills. Here’s the graphic I showed them:
The point of the diagram is that to do data journalism well, a news organization needs to employ people with three discrete sets of skills. The numbered areas on the graphic represent people or jobs that blend more than one of these skills, and it’s certainly great when a news organization can find someone who can do more than one thing well. But typically, the skills involved in data journalism will be found in at least three different people.
The three dimensions of data journalism
Here’s what I think are the three dimensions of data journalism:
Computer-assisted reporter: These journalists typically start as news reporters and develop expertise in obtaining, analyzing and finding newsworthy angles in data. (When this discipline emerged in the late 1980s, journalists called it “computer-assisted reporting,” a term that is still commonly used.) These journalists have a “data state of mind” for reporting and know how to query (or interview) data to discover the most interesting and relevant findings. They typically use spreadsheets, database and mapping software, but many are now also acquiring programming skills so they can simplify some of their work — for instance, writing scrapers that pull down data from websites that don’t offer API’s or data downloads.
News applications developer: These are the newest data journalists to arrive in newsrooms, either moving from careers in computer programming or learning programming skills as part of their journalistic work. Their job is to figure out how best to make data available to users online. They have hands-on skills in back-end programming, database configuration and administration, as well as HTML, CSS and JavaScript so they can build the interface to their applications.
Data visualization specialist: Historically news organizations had graphic artists whose job was to create static graphics that illustrated data-driven (and other) stories. As graphic artists did when they created static graphics, data visualization specialists understand how to present data in interesting and accessible ways, and they understand the principles of good information design. But since graphics on the Web can be clickable and interactive, these journalists increasingly do front-end programming and make use of open-source Javascript libraries such as D3.
All of these disciplines have historic roots in news organizations. The computer-assisted reporter worked on the city desk, the news applications developer was probably a programmer for the company’s business operations, and the data visualization specialist was a graphic artist. As their jobs have evolved, though, they all increasingly need programming skills.
News organizations seeking to fill these roles are finding it difficult. Quite simply, there is more demand for people with these skills — both inside and outside of journalism — than there are people qualified to fill them. At the ASNE panel, “Data Journalism: The Myths and the Magic,” I argued that news organizations need to take a long-term view and “grow your own talent.” This means identifying current staff members with potential to develop these skills and recruiting promising college students who can build their skills as interns or part-time employees.
Among current staff members, potential computer-assisted reporters can be identified by their use of spreadsheet or database software in their work, their comfort with math and data, and the way they use numbers effectively in their stories. Potential news applications developers might be working in a news organization’s IT department, but can be identified by their interest in journalism, their participation in “open government” initiatives and their use of contemporary programing languages such as Ruby and Python. And potential data visualization specialists can be identified by their interest in working on data-intensive graphics, their comfort with math and data and possibly some prior experience with mapping applications or JavaScript.
The kind of people who do data journalism are, of course, the core audience for the Knight Lab blog. Do you see yourself in this graphic? Are you a computer-assisted reporter, a news applications developer or a data visualization specialist? Or maybe you fit into one of the numbered areas where these skill sets overlap? Let us know in the comments below — or on Twitter mentioning @knightlab and the hashtag “#datadimensions.”
Rich Gordon is a professor and director of digital innovation at the Medill School of Journalism. At Medill, he launched the school’s graduate program in new media journalism. He has spent most of his career exploring the areas where journalism and technology intersect. Prof. Gordon was an early adopter of desktop analytical tools (spreadsheets and databases) to analyze data for journalistic purposes. At The Miami Herald, he was among the first generation of journalists to lead online publishing efforts at newspapers. At Medill, he has developed innovative courses through which students have explored digital content and communities and developed new forms of storytelling that take advantage of the unique capabilities of interactive media. In addition to teaching and writing about digital journalism, he is director of new communities for the Northwestern Media Management Center, where he is responsible for a research initiative focusing on the impact of online communities, including social networks, on journalism and publishing.
The Knight Lab is a team of technologists, journalists, designers and educators working to advance news media innovation through exploration and experimentation. Straddling the sciences and the humanities the Lab develops projects, prototypes and innovative bits of code that help make information meaningful, and promote quality journalism, storytelling and content on the internet. The Knight Lab is a joint initiative of Northwestern University’s Robert R. McCormick School of Engineering and Applied Science and the Medill School of Journalism. The Lab was launched and is sustained by a grant from the John S. and James L. Knight Foundation, with additional support from the Robert R. McCormick Foundation and the National Science Foundation.
Very interesting publication! I fully agree that data journalism should be teamwork. My observation is that many journalists are loners however. As doctors were in the past (I am also a MD)
As mentioned, the computer-assisted reporter should be quite skilled in statistics also, in order to convey properly the statistical quality of the visualized data, an information that must be stressed. Nowadays these skills are collected under the term ‘data-mining’ and, for example, they are useful for analysis of social networks streaming.