<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
    xmlns:admin="http://webns.net/mvcb/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:content="http://purl.org/rss/1.0/modules/content/">

    <channel>
    
    <title>DDJ &#45; Featured Projects</title>
    <link>http://datadrivenjournalism.net/featured_projects</link>
    <description>DDJ &#45; Featured Projects</description>
    <dc:language>en</dc:language>
    <dc:creator>support@ejc.net</dc:creator>
    <dc:rights>Copyright 2012</dc:rights>
    <dc:date>2012-07-04T14:54:37+00:00</dc:date>
    <admin:generatorAgent rdf:resource="http://expressionengine.com/" />
    

    <item>
      <title>Behind the Scenes of BBC&#8217;s &#8220;Most Wanted Global Migrants&#8221; Guide</title>
      <link>http://datadrivenjournalism.net/featured_projects/Behind_the_Scenes_of_BBCs_Most_Wanted_Global_Migrants_Guide</link>
      <guid>http://datadrivenjournalism.net/featured_projects/Behind_the_Scenes_of_BBCs_Most_Wanted_Global_Migrants_Guide#When:04:39:30Z</guid>
      <description><![CDATA[<p>
	<em>By <a href="https://twitter.com/robertobelo">Roberto Belo Rovella</a> and Camilla Costa, BBC journalists.&nbsp;</em></p>
<p>
	BBC Global News recently published a special report on Global Migrants, featuring a clickable guide in which the users can explore the 20 most wanted professions across a selection of 30 countries, including those of the OECD, BRICS and Singapore. The project, published in English and <a href="http://goo.gl/WW6eT">other 11 languages</a>, is the result of an investigation by BBC Brasil&rsquo;s journalist Camilla Costa.</p>
<p>
	This idea was pitched in the context of a training scheme, to give journalists the time and space to develop their data and visual journalism skills, working hand in hand with other editorial, design and development staff from the Online Specials Team. Camilla led the research and production effort, with Designer Nour Saab, Client Side Developer Martyn Rees and Editor Roberto Belo-Rovella.</p>
<p>
	The investigation about highly skilled migration focuses on the professions most sought after and how the countries surveyed are changing their legislation to facilitate and manage the admission of those professionals.</p>
<p>
	The visual guide invites the users to select the profession they are more curious about, to find out which countries are seeking those professionals and then more information about the country in question in the context of migration.</p>
<p>
	The database compiled confirms the ongoing demand for health professionals, especially nurses, and also the increase in the need for engineering and IT professionals all over the world. But it also shows some surprising facts, such as a demand for chefs, psychologists and even technicians such as radiographers.</p>
<h3>
	How We Went About It</h3>
<p>
	We used data about the migration policies of the OECD countries from the International Migration Outlook 2012 and other OECD reports and lists of professions in demand obtained from Migration Departments and Labour Ministries of the 30 countries explored in this graphic.</p>
<p>
	Market overviews and surveys were also used for some countries, as well as the data compiled in the European Job Mobility Portal (EURES).</p>
<p>
	The big challenge was to compile all this information in a consistent way, as the definitions of &quot;highly skilled professions&quot; and their categorizations varied country by country. In many cases we had to request specific information and clarifications directly to government offices in the surveyed countries. The footnotes and methodology story offer detailed information on how it&#39;s all worked out, and the full dataset compiled about wanted professions per country.</p>
<p>
	The research and data compilation took about six weeks. The formatting, design and build took the other six weeks, including the creation of a master version in English and the localisation of the content for 11 other languages.</p>
<p>
	For compiling and formatting the information we used spreadsheets, mainly Google Documents. Middle East Adobe CS kit was used for mocking up ideas and finalising designs. To be more specific, Illustrator was used to chart the migration flows based on the data provided by the journalists, lay out the other bits of content around it and mark it up for the final hand-over to developers.</p>
<p>
	Photoshop was used for exporting graphics and preparing assets for the developers. The graphs were supplied in SVG format. In terms of build, HTML and CSS was used for the layout and Javascript to load and display the 50 graphs and also for the interactivity.</p>
<h3>
	Looking Back</h3>
<p>
	The Most Wanted Global Migrants clickable guide and its related content were welcome by the audience, with almost one million page views on the first two days after being published on the BBC News website Business index in English. On Twitter, there were more than 1,400 messages recommending it, and the Migration Unit within the United Nations Alliance of Civilizations also promoted the guide.</p>
<p>
	It is always a challenge to strike the right balance between being fully comprehensive and presenting the facts in a way that is simple, intuitive for the non-academic or specialised user. We know that online users have very little time to spend, and if they are not immediately attracted to what you have to say online, off they go.</p>
<p>
	Therefore, once we were clear about what data was actually available to use, and we decided on the story we wanted to tell, they key was to stay focused on that story, and to put the users at its centre.</p>
<p>
	Would we do it in a different way if we had a second chance? Possibly yes, maybe simpler, maybe more playful, maybe thinking more about mobile devices and responsive design, maybe doing more on the social sphere beyond being able to share information about professions in a particular country on Facebook and Twitter. A big bag of lessons for the next project, no doubt; those lessons that you can only harvest after giving it a proper go.</p>
]]></description> 
      <dc:date>2013-05-10T04:39:30+00:00</dc:date>
    </item>

    <item>
      <title>“Is It Really Worth It?” – The Map of Italian Journalists Facing Threats</title>
      <link>http://datadrivenjournalism.net/featured_projects/is_it_really_worth_it_the_map_of_italian_journalists_facing_threats</link>
      <guid>http://datadrivenjournalism.net/featured_projects/is_it_really_worth_it_the_map_of_italian_journalists_facing_threats#When:13:44:47Z</guid>
      <description><![CDATA[<p>
	<em>By <a href="https://twitter.com/jackottaviani" target="_blank">Jacopo Ottaviani</a>, contributor at <a href="http://www.ilfattoquotidiano.it" target="_blank">Il Fatto Quotidiano</a> and the&nbsp;<a href="http://www.guardian.co.uk/data" target="_blank">Guardian Datablog</a>.</em></p>
<p style="text-align: center;">
	<img alt="logo.jpg" src="http://datadrivenjournalism.net/uploads/logo.jpg" /></p>
<p>
	More than 300 journalists faced threats in Italy in 2012. This is what emerges from the data&nbsp;journalism work by three Italian journalists and developers, Isacco Chiaf, Andrea Fama and Jacopo&nbsp;Ottaviani, who published a cross-platform project that gives a snapshot of the phenomenon.</p>
<p>
	An interactive map presents the cases recorded in 2012 across Italy, giving a top-down view of the&nbsp;phenomenon. Threats are clustered on the basis of the type: intimidation (104), legal actions (63), assaults&nbsp;(16), malicious mischief (12 cases).</p>
<p style="text-align: center;">
	<img alt="im1.jpg" src="http://datadrivenjournalism.net/uploads/im1.jpg" /><br />
	<em><span style="font-size: 12px;">A batchgeo map traces all the cases of threats that involved 324 journalists in 2012 alone.</span></em></p>
<p>
	A more detailed perspective comes from three individual cases of threat that have been represented&nbsp;through timelines including audio, video and other multimedia contents. Through these timelines it is&nbsp;possible to explore unreleased interviews, original documents underlying the investigations, features from&nbsp;the news and the media, and to share them across the social web.</p>
<p>
	Two more maps show the density of the phenomenon at the regional level, focusing on the number of&nbsp;cases of threat and of journalists involved. The regions recording the most threats are southern Sicily and&nbsp;Calabria.</p>
<p style="text-align: center;">
	<img alt="map.jpg" src="http://datadrivenjournalism.net/uploads/map.jpg" /><br />
	<em>The density maps show the number of threats recorded in the regions of Italy out of the number of journalists working in them.&nbsp;</em><em>The southern regions such as Calabria and Sicily show a major presence of the phenomenon.</em></p>
<p>
	The project is based on the data by the observatory <a href="http://www.ossigenoinformazione.it/?page_id=1347" target="_blank">Ossigeno per l&rsquo;Informazione</a> (&quot;oxygen for information&quot;)&nbsp;which has been monitoring the phenomenon of Italian journalists under threat for years now. In its annual&nbsp;reports Ossigeno sheds light on the situation, recording single stories of threats and aggregating the&nbsp;whole dataset.</p>
<h3>
	How the project was born</h3>
<p>
	The project was born during the DiGiT conference on New Media in Florence, Italy, in July 2013, from an&nbsp;idea of one of the journalists behind the project, Andrea Fama. Andrea talked about the idea of building a new narrative representation of Ossigeno&rsquo;s&nbsp;data on threatened journalists, by building a set of maps and timelines. In six months we got in&nbsp;touch with the observatory and built up the whole project, with the support of <a href="http://www.ahref.eu/en" target="_blank">ahref foundation</a>, who&nbsp;financed our work with a grant.</p>
<p>
	The team consisted of three people coming from three different backgrounds: a data journalist/computer&nbsp;scientist, an artist/designer and a journalist/editor. The three of us share the passion of storytelling and civic journalism. A rational division of work drove the collaboration. However, everyone could contribute to the other colleagues&#39; tasks.&nbsp;</p>
<h3>
	Advice for journalists</h3>
<p>
	Our suggestion for people who would like to develop similar projects is to specialise in a particular field without&nbsp;forgetting to learn a bit about other fields as well. It helps for programmers to learn how to write news and for&nbsp;journalists to learn how to code. By blurring the boundaries between different skills and backgrounds innovation in the media will&nbsp;be possible. Computer scientists, journalists, statisticians, designers and illustrators&nbsp;should learn to work together and influence each other. Data journalism builds on this blend&nbsp;of skills and experiences.</p>
<p>
	GiornalistiMinacciati.it is available both in <a href="http://www.giornalistiminacciati.it" target="_blank">Italian</a> and <a href="http://www.giornalistiminacciati.it/en/" target="_blank">English</a>. Below is the infographic that summarizes the results.</p>
<p>
	&nbsp;</p>
<p style="text-align: center;">
	<img alt="infographics.jpeg" src="http://datadrivenjournalism.net/uploads/infographics.jpeg" /></p>
]]></description> 
      <dc:date>2013-04-16T13:44:47+00:00</dc:date>
    </item>

    <item>
      <title>Migration Coverage Study: National Interests Frame Media Debate</title>
      <link>http://datadrivenjournalism.net/featured_projects/Migration_Coverage_Study_National_Interests_Frame_Media_Debate</link>
      <guid>http://datadrivenjournalism.net/featured_projects/Migration_Coverage_Study_National_Interests_Frame_Media_Debate#When:15:52:02Z</guid>
      <description><![CDATA[<p>
	&ldquo;Migration predominantly tends to be viewed through the lens of the particular interest of&nbsp;the host country. Because of this, the media does not sufficiently treat migration as an&nbsp;issue that relates to an international, cross-border framework.&rdquo; This is the conclusion of the <a href="http://mediapusher.eu/unaoc/unaoc.ejc.study.summary.2013.pdf" target="_blank">Comparative Pilot Study on Media Coverage of Migration</a>, a joint project of the European Journalism Centre (EJC) and the United Nations Alliance of Civilizations (UNAOC), conducted in collaboration with five academic and media research institutes in Europe and North America.</p>
<p style="text-align: center;">
	<img alt="Migration_pic_1.png" src="http://datadrivenjournalism.net/uploads/Migration_pic_1.png" style="height: 293px; width: 600px;" /></p>
<p style="text-align: center;">
	<em>Figure 1: Type of migration per country in percentages</em></p>
<p>
	The pilot project, which analysed and compared the content and tone of press articles about migration published in Canada, France, Germany, The Netherlands, and the U.S. around election periods, indicated that the focus of media coverage is heavily influenced by the societal problems specific to each country. The media in France, for instance, tended to concentrate on the strain migrants allegedly put on the social system, while in the U.S., the press largely reflected on the voting power of migrant groups.</p>
<p style="text-align: center;">
	<img alt="Migration_2.png" src="http://datadrivenjournalism.net/uploads/Migration_2.png" style="height: 386px; width: 600px;" /></p>
<p style="text-align: center;">
	<em>Figure 2: Migration type and topic co-occurrence</em></p>
<p>
	An analysis of the co-occurrence of migration types and migration-related topics in the press also indicated the media tend to correlate irregular migrants with law and policy topics, and minorities with issues of citizenship and political participation. These two clusters, in turn, are strongly connected with themes of culture and religion, indicating a trans-national preoccupation with questions of security, identity, and new, ethnic epicentres of political power.</p>
<p>
	Finally, a study of the articles&rsquo; tone revealed an attempt on the media&rsquo;s part to maintain a neutral perspective towards migrants, an attempt that was at times offset by a tendency to cite one-sided studies or offensive political declarations, without putting them in perspective or counter-balancing them with other sources.</p>
<h3>
	About the pilot study</h3>
<p>
	For this study, the EJC partnered with the University of King&#39;s College (Canada), Institut National de l&#39;Audiovisuel (France), Deutsche Welle Akademie (Germany), Christelijke Hogeschool Ede (The Netherlands), and the University of Missouri (United States). The research team took snapshots of press reports on the subject of migration, analysed and coded them according to the related migration topic discussed and the overall tone of the report. All in all, 650 written press articles were collected in the five countries within a four-week time frame surrounding national and regional election periods.</p>
<p>
	The findings of the study were presented at the 2013 UNAOC Global Forum in Vienna, on 28 February 2013. You can watch a video recording of the panel discussion <a href="http://webtv.un.org/meetings-events/conferencessummits/un-alliance-of-civilizations-5th-global-forum-27-28-february-2013-vienna/">here</a>.</p>
]]></description> 
      <dc:date>2013-03-28T15:52:02+00:00</dc:date>
    </item>

    <item>
      <title>Moldova Launches Website Dedicated to Open Budget Data and Public Spending Stories</title>
      <link>http://datadrivenjournalism.net/featured_projects/Moldova_Launches_Website_Dedicated_to_Public_Spending_Stories</link>
      <guid>http://datadrivenjournalism.net/featured_projects/Moldova_Launches_Website_Dedicated_to_Public_Spending_Stories#When:08:25:04Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by <a href="https://twitter.com/victoriavladd">Victoria Vlad</a> from <a href="http://www.budgetstories.md/">BudgetStories.md</a> on the <a href="http://openspending.org/">Spending Blog</a>&nbsp;under a Creative Commons Attribution licence.</em></p>
<p style="text-align: center;">
	<img alt="cit-ne-costa-parlamentul1.jpg" src="http://datadrivenjournalism.net/uploads/cit-ne-costa-parlamentul1.jpg" style="height: 311px; width: 500px;" /></p>
<p>
	Yesterday&nbsp;<a href="http://www.expert-grup.org/">Expert-Grup</a>, an independent think tank based in Chișinău, Moldova, launched <a href="http://www.budgetstories.md/">BudgetStories.md</a>. BudgetStories.md is an open budget website, which includes infographics, budget data visualizations and analysis of the use of public money in Moldova across sectors such as: the public administration, agriculture, education and health. In recent years Moldovan government has become more transparent regarding budget data as well as other types of data. The Ministry of Finance used the World Bank&rsquo;s <a href="http://web.worldbank.org/WBSITE/EXTERNAL/TOPICS/EXTPUBLICSECTORANDGOVERNANCE/0,,contentMDK:23150652~pagePK:148956~piPK:216618~theSitePK:286305,00.html">BOOST tool</a>, to release detailed and disaggregated data on public expenditures.</p>
<p>
	Since mid 2012, we&rsquo;ve however worked to make sense and meaning in the huge sets of data. In November 2012, while finalizing the project concept, members of the organising team took part in the <a href="http://openspending.org/blog/2012/11/26/Sarajevo-Workshop-Writeup.html">Balkan Budget Workshop</a> organised by the OpenSpending team. We decided to use OpenSpending as a tool for visualizing Moldova&#39;s <a href="http://www.budgetstories.md/bugetul-2013/">2013 budget</a>.</p>
<p>
	Until now civil society (NGOs, journalists and universities) has shown little knowledge or interest in the existence of open data. Our target groups are mainly journalists and policy makers, who will now with this site for the first time have access to &ldquo;cleaned data sets&rdquo;. We are therefore hoping that they will republish and reuse the analysis and visualizations, which could trigger increased public attention to inefficiencies identified in government spending. Also, we&rsquo;d like to expand the network of stakeholders who use the budget data and disseminate information about how the Government of Moldova spends public money.</p>
<p>
	It is our hope that this project will create a better understanding among citizens and active members of the society about the way the public finance system operates and the way it influences their everyday lives. If we could reach such increased understanding this could ultimately lead to greater contributions from society to the budget process and more efficient spending of public money.</p>
<p>
	In the future we plan to add interactive modules such as a real time budget calendar and a tax calculator. You can find out more information about BudgetStories.md on <a href="https://www.facebook.com/pages/Budget-Stories/572468439448024?sid=0.5174039560370147">Facebook</a> and <a href="https://twitter.com/BudgetStories">Twitter</a>. BudgetStories.md is supported by Soros Foundation Moldova and Open Society Foundations.</p>
]]></description> 
      <dc:date>2013-03-01T08:25:04+00:00</dc:date>
    </item>

    <item>
      <title>Real&#45;Time Data Journalism &#45; Hurricane Sandy</title>
      <link>http://datadrivenjournalism.net/featured_projects/real_time_data_journalism_-_Hurricane_Sandy</link>
      <guid>http://datadrivenjournalism.net/featured_projects/real_time_data_journalism_-_Hurricane_Sandy#When:09:32:20Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by <a href="http://@jkeefe">John Keefe</a> on&nbsp;<a href="http://johnkeefe.net/">JohnKeefe.Net</a>&nbsp;under a&nbsp;</em><em>Creative Commons license</em><em>.</em></p>
<p>
	&nbsp;</p>
<p>
	While preparing for the real-time challenge of Election night, the <a href="http://datanews.wnyc.org/">WNYC Data News Team</a> -- and the entire city -- turned its attention to an oncoming storm.</p>
<p>
	For our Hurricane Sandy coverage, we quickly built and maintained several data projects to help convey information people needed. All used open, public data and several were updated regularly -- either automatically or by hand.</p>
<p>
	<iframe frameborder="0" height="700" scrolling="no" src="http://project.wnyc.org/news-maps/hurricane-zones/hurricane-zones.html" width="100%"></iframe></p>
<p>
	Our projects included:</p>
<ul>
	<li>
		The evacuation map above, built using public shapefiles from New York City&#39;s Department of Emergency Management.</li>
	<li>
		A <a href="http://project.wnyc.org/storm-surge/">storm-surge map</a> for the entire New Jersey and New York coastlines, stitched together from a variety of U.S. Army Corps of Engineers shapefiles.</li>
	<li>
		A <a href="http://project.wnyc.org/hurricane-tracker-sandy/index.html">Hurricane Tracker</a> to watch the storm&#39;s forecast track and its radar echo, fed by four real-time feeds from the National Weather Service.</li>
	<li>
		A <a href="http://project.wnyc.org/transit-tracker/embed.html">Transit Tracker</a> with the latest information about several public transportaiton systems, driven by a Google spreadsheet updated by a half-dozen producers and reporters from transportation agency tweets, websites and public announcements.</li>
	<li>
		A live <a href="http://project.wnyc.org/flood-monitoring/embed.html">flood-gauge map</a> showing where the water was rising, driven by a real-time feed from the National Weather Service.</li>
	<li>
		A <a href="http://project.wnyc.org/news-maps/traffic-map/">traffic map</a> for the back-to-work crush sans subways, fed live by the Google Maps traffic layer.</li>
	<li>
		A <a href="http://project.wnyc.org/mtatiles/embed.html">subway-restoration map</a>, updated several times a day with new maps issued by the city transit agency.</li>
</ul>
<p>
	For details on the data, follow the source links on each project.</p>
<p>
	With more time, we would have worked more on the aethetics. But time wasn&#39;t something we had much of, so we did our best to be accurate and clear given the resources available.</p>
<p>
	Read more about the WNYC Data News team&#39;s <a href="http://datanews.tumblr.com/post/35008200131/predicting-questions-building-answers">thinking behind our Sandy coverage</a>.</p>
]]></description> 
      <dc:date>2012-12-03T09:32:20+00:00</dc:date>
    </item>

    <item>
      <title>Il Fondo Al Mar: Data Visualisation and Suspicious Sinkings</title>
      <link>http://datadrivenjournalism.net/featured_projects/il_fondo_al_mar_data_visualisation_and_suspicious_sinkings</link>
      <guid>http://datadrivenjournalism.net/featured_projects/il_fondo_al_mar_data_visualisation_and_suspicious_sinkings#When:23:24:59Z</guid>
      <description><![CDATA[<p>
	<a href="http://www.infondoalmar.info/index.php?lang=en">In fondo al mar</a> (Under the sea) is a data mapping/information visualisation project depicting toxic waste dumping in the Mediterranean Sea. Since its conception in 2009, the project has received attention from Italian and Euro-Mediterranean media, as well as the international scientific and academic press. Italian authorities also took notice, particularly the environmental police and the Ministry of the Interior, but, according to the site&rsquo;s creators, have yet to act on the information.</p>
<p>
	In late 2009, journalist <a href="http://www.linkedin.com/pub/paolo-gerbaudo/1a/83a/1b6">Paolo Gerbaudo</a> was working on an investigative piece when he noticed clear patterns in sinking incidents involving ships suspected of carrying hazardous waste, including radioactive material. Not satisfied with just writing about these &lsquo;accidents&rsquo;, Gerbaudo contacted MIT researcher&nbsp;<a href="http://www.linkedin.com/profile/view?id=8327553&amp;authType=NAME_SEARCH&amp;authToken=8Z3i&amp;locale=en_US&amp;srchid=74492eaf-4a06-4afb-b0e9-2debab9eb23e-0&amp;srchindex=2&amp;srchtotal=75&amp;goback=%2Efps_PBCK_*1_David_Boardman_*1_*1_*1_*1_*2_*1_Y_*1_*1_*1_false_1_R_*1_*51_*1_*51_true_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2_*2&amp;pvs=ps&amp;trk=pp_profile_name_link">David Boardman</a>, an expert in data mapping, and, working together from different continents, got the project off the ground a month and a half later.</p>
<p>
	&ldquo;We decided to use information visualisation because the linear format of both written journalism and video journalism were, in our opinion, not effective in conveying the scale of the phenomenon.&rdquo;</p>
<p style="text-align: center; ">
	<img alt="" src="http://farm9.staticflickr.com/8194/8121456039_9350aaba11.jpg" style="height: 259px; width: 500px; " /></p>
<p style="text-align: center; ">
	<em>Il fondo al mar - map</em></p>
<p>
	Over the last 30 years dozens of commercial vessels have sunk in mysterious circumstances across the Mediterranean, from Spain to Syria, and particularly around Southern Italy. Many suspect the involvement of organised crime syndicates, shady entrepreneurs and even governments in the incidents, generating huge profits in the process. To map these incidents Gerbaudo and Boardman mined the archives of the <a href="http://www.lr.org/default.aspx">Lloyds Register of Shipping in London</a> and collected information from press clippings, specialist websites and reports compiled by environmental organisations.</p>
<p>
	&ldquo;Looking at that archive, and speaking with people working at the register it seemed very clear that there was something deeply wrong in the frequency with which ships sank around the coasts of Italy in the 80s and 90s.&rdquo;</p>
<p style="text-align: center; ">
	<img alt="" src="http://calabria.indymedia.org/attachments/nov2009/nave_veleni.jpg" style="width: 400px; height: 266px; " /></p>
<p style="text-align: center; ">
	<em>The Rosso,&nbsp;an Italian container ship that ran suspiciously aground in 1990.&nbsp;Image courtesy of&nbsp;<a href="http://www.legambiente.it/">Legambiente</a>.</em></p>
<p>
	Once launched, <em>Il fondo al mar</em> became a participatory project with a strong emphasis on crowdsourcing. Gerbado and Boardman say they have received &ldquo;tens of emails informing us about other incidents or giving us more information about the ones we had already included in the map,&rdquo; even corrections to errors in the data. This highlights not only the public interest in the project, but also the level of expert attention it receives. Il fondo al mar also garnered interested among environmental and humanities groups, appearing at international conferences such as MIT&rsquo;s Digital + Humanities Conference, Ars Electronica and the International Journalism Festival.</p>
]]></description> 
      <dc:date>2012-10-25T23:24:59+00:00</dc:date>
    </item>

    <item>
      <title>NYPD Stop and Frisk Data for You</title>
      <link>http://datadrivenjournalism.net/featured_projects/nypd_stop_and_frisk_data_for_you</link>
      <guid>http://datadrivenjournalism.net/featured_projects/nypd_stop_and_frisk_data_for_you#When:15:57:14Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by J</em><em>ohn Keefe (</em><em>New York Public Radio)</em><em>&nbsp;</em><em>on&nbsp;</em><em><a href="http://johnkeefe.net/">johnkeefe.net</a>&nbsp;</em><em>on 19 July 2012 </em><em>under a <a href="http://creativecommons.org/licenses/by-nc/3.0/">Creative Commons Attribution-NonCommercial</a> license.</em></p>
<p>
	&nbsp;</p>
<p>
	Two weeks ago, we <a href="http://www.wnyc.org/articles/wnyc-news/2012/jul/16/wnyc-map-police-find-guns-where-they-stop-and-frisk-less/">published</a>&nbsp;a map showing total NYPD (New York City Police Department) stop and frisks by block together with locations where guns were discovered during such stops.&nbsp;In the tradition of <a href="http://www.poynter.org/how-tos/digital-strategies/150243/6-reasons-journalists-should-show-your-work-while-learning-creating/">showing our work</a>, here&#39;s some information about how we built it -- and data you can download and explore yourself.</p>
<p>
	&nbsp;</p>
<p>
	<iframe frameborder="0" height="650" scrolling="no" src="http://project.wnyc.org/stop-frisk-guns" width="100%"></iframe></p>
<h3>
	The Data</h3>
<p>
	The major bumps I hit working with the NYPD&#39;s <a href="http://www.nyc.gov/html/nypd/html/analysis_and_planning/stop_question_and_frisk_report.shtml">Stop, Question and Frisk data sets</a> were 1) they&#39;re in a format I don&#39;t know, and 2) the geographic locations aren&#39;t in latitudes and longitudes.</p>
<p>
	For bump #1, I used the free <a href="http://www.r-project.org/">statistical program &quot;R&quot;</a> to convert the NYPD&#39;s &quot;.por&quot; files into something I could use. R is also great at handling big data sets, and easily managed the 685,724 stops in the 2011 file.</p>
<p>
	For bump #2, I noticed that each stop had data fields called &quot;XCOORD&quot; and &quot;YCOORD.&quot; A couple of tests confirmed that those values described the stop&#39;s position on the New York-Long Island <a href="http://en.wikipedia.org/wiki/State_Plane_Coordinate_System">State Plane Coordinate System </a>-- something I&#39;ve seen in a lot of city data. So I used the free geographic software <a href="http://qgis.org/">QGIS </a>to load in the data and convert (technically, <a href="http://qgis.spatialthoughts.com/2012/04/tutorial-working-with-projections-in.html">reproject</a>) those coordinates into latitudes and longitudes.</p>
<p>
	And now you can have the data I used to make the map. Just click to download:</p>
<ul>
	<li>
		<a href="http://stopfrisk2011_databundle_sans_allstops.zip">stopfrisk2011_databundle_sans_allstops.zip</a></li>
</ul>
<p>
	(4.3MB download, unzips to 12MB)<br />
	Contains a shapefile of all NYC blocks with the total stop-and-frisks calculated for each block, a shapefile with the points for all stops where guns were found, raw data on each of the 768 stops where guns were found and notes about each data set. <a href="http://project.wnyc.org/stop-frisk-guns/datastore/NOTES.txt">Here&#39;s more detail</a> on the contents.</p>
<ul>
	<li>
		<a href="http://stopfrisk2011_databundle_with_allstops.zip">stopfrisk2011_databundle_with_allstops.zip</a></li>
</ul>
<p>
	(51MB download, unzips to 500MB)<br />
	This file has of the above <em><strong>and </strong></em>a .csv with the raw data for all 685,724 stops in 2011. While it&#39;s in a more common format than what the NYPD provides, it&#39;s too big to open in Excel and maxes out the limits for Google Fusion Tables. So you&#39;ll need a stats program like <a href="http://www.r-project.org/">R </a>or some database know-how to handle it.</p>
<p>
	<strong>The Map</strong></p>
<p>
	I built the map using <a href="http://mapbox.com/tilemill/">TileMill</a> from <a href="http://mapbox.com/">Mapbox</a>, which I&#39;ve been playing with for some months now.</p>
<p>
	While it&#39;s tricker than <a href="http://johnkeefe.net/not-just-big-data-fusion-tables-for-little-ma">generating quick maps</a> from Fusion Tables, if you&#39;re patient and spend some time with it, you can make some pretty <a href="http://mapbox.com/reinventgreen/">gorgeous maps</a>.</p>
<p>
	Besides providing wonderful control over styles and colors, TileMill solves an important problem: New York City has roughly 38,500 census blocks -- and loading the data to draw them all onto a Google map will anger any browser. With TileMill, you bake the data into individual image tiles, which get served up to the user as they zoom and pan.</p>
<p>
	To cover the area of NYC and provide 8 levels of zoom, I pre-cooked 59,095 tiles. But once they&#39;re uploaded to the MapBox server, which took about 15 minutes, they load almost instantly.</p>
<p>
	<em>As always, I welcome comments and questions below or at john (at) johnkeefe.net.</em></p>
]]></description> 
      <dc:date>2012-08-02T15:57:14+00:00</dc:date>
    </item>

    <item>
      <title>Case Study: The Eurozone Meltdown</title>
      <link>http://datadrivenjournalism.net/featured_projects/Case_Study_The_Eurozone_Meltdown</link>
      <guid>http://datadrivenjournalism.net/featured_projects/Case_Study_The_Eurozone_Meltdown#When:15:04:30Z</guid>
      <description><![CDATA[<p>
	<em>This post by&nbsp;</em><em><a href="http://www.sarahslobin.com/">Sarah Slobin</a> (Wall Street Journal)</em><em>, is an excerpt from the <a href="http://www.datajournalismhandbook.org/">Data Journalism Handbook</a> (chapter 3: Case Studies), freely available online under a <a href="http://creativecommons.org/licenses/by-sa/3.0/">Creative Commons Attribution-ShareAlike</a> license</em>.</p>
<p>
	&nbsp;</p>
<p>
	So we&rsquo;re covering the <a href="http://online.wsj.com/article/SB10001424052970204879004577111212152742988.html?mod=wsj_share_twitter#articleTabs=interactive">Eurozone meltdown</a>. Every bit of it.&nbsp;The drama as governments clash and life savings are lost; the reaction from world&nbsp;leaders, austerity measures, and protests against austerity measures. Every day in the&nbsp;Wall Street Journal, there are charts on jobs loss, declining GDP, plunging world markets.&nbsp;It is incremental. It is numbing.&nbsp;</p>
<p>
	The Page One editors call a meeting to discuss ideas for year-end coverage and as we&nbsp;leave the meeting, I find myself wondering: what must it be like to be living through this?</p>
<p>
	Is this like 2008 when I was laid off and dark news was incessant? We talked about jobs&nbsp;and work and money every night at dinner, nearly forgetting how it might upset my&nbsp;daughter. And weekends, they were the worst. I tried to deny the fear that seemed to&nbsp;have a permanent grip at the back of my neck and the anxiety tightening my rib cage.&nbsp;Is this what was it like right now to be a family in Greece? In Spain?</p>
<p>
	I turned back and followed Mike Allen, the Page One editor, into his office and pitched&nbsp;the idea of telling the crisis through families in the Eurozone by looking first at the data,&nbsp;finding demographic profiles to understand what made up a family and then surfacing&nbsp;that along with pictures and interviews&sbquo; audio of the generations. We&rsquo;d use beautiful&nbsp;portraiture, the voices&mdash;and the data.&nbsp;</p>
<p>
	Back at my desk, I wrote a pr&eacute;cis and drew a logo.</p>
<p style="text-align: center; ">
	<img alt="Image_1.png" src="http://datadrivenjournalism.net/uploads/Image_1.png" /><br />
	<em>The Eurozone Meltdown: precis (Wall Street Journal)</em></p>
<p>
	For the next three weeks I chased numbers: metrics on marriage, mortality, family size,&nbsp;and health spending. I read up on living arrangements and divorce rates, looked at&nbsp;surveys on well-being and savings rates. I browsed national statistics divisions, called&nbsp;the UN population bureau, the IMF, Eurostat, and the OECD until I found an economist&nbsp;who had spent his career tracking families. He led me to a scholar on family composition.&nbsp;She pointed me to white papers on my topic.</p>
<p>
	With my editor, Sam Enriquez, we narrowed down the countries. We gathered a team&nbsp;to discuss the visual approach and which reporters could deliver words, audio and story. Matt Craig, the Page One photo editor, set to work finding the shooters. Matt&nbsp;Murray, the Deputy Managing Editor for world coverage, sent a memo to the bureau&nbsp;chiefs requesting help from the reporters. (This was crucial: sign-off from the top).</p>
<p>
	But first the data. Mornings I&rsquo;d export data into spreadsheets and make charts to see&nbsp;trends: savings shrinking, pensions disappearing, mothers returning to work, health&nbsp;spending, along with government debt and unemployment. Afternoons I&rsquo;d look at&nbsp;those data in clusters, putting the countries against each other to find stories.&nbsp;</p>
<p>
	I did this for a week before I got lost in the weeds and started to doubt myself. Maybe&nbsp;this was the wrong approach. Maybe it wasn&rsquo;t about countries, but it was about fathers&nbsp;and mothers, and children and grandparents. The data grew.&nbsp;And shrank. Sometimes I spent hours gathering information only to find out that it told&nbsp;me, well, nothing. That I had dug up the entirely wrong set of numbers. Sometimes the&nbsp;data were just too old.</p>
<p style="text-align: center; ">
	<img alt="Image_2.png" src="http://datadrivenjournalism.net/uploads/Image_2.png" /><br />
	<em>Judging the usefulness of a dataset can be a very time-consuming task (Sarah Slobin)</em></p>
<p>
	And then the data grew again as I realized I still had questions, and I didn&rsquo;t understand&nbsp;the families.&nbsp;I needed to see it, to shape it. So I made a quick series of graphics in Illustrator, and&nbsp;began to arrange and edit them.&nbsp;As the charts emerged, so did a cohesive picture of the families.</p>
<p style="text-align: center; ">
	<img alt="Image_3.png" src="http://datadrivenjournalism.net/uploads/Image_3.png" /><br />
	<em>Graphic visualization: making sense of trends and patterns hidden in the datasets (Sarah&nbsp;Slobin)</em></p>
<p style="text-align: center; ">
	<img alt="Image_4.png" src="http://datadrivenjournalism.net/uploads/Image_4.png" /><br />
	<em>Numbers are people: the value of data lies in the individual stories they represent (Wall&nbsp;Street Journal)</em></p>
<p>
	We launched. I called each reporter. I sent them the charts, the broad pitch and an&nbsp;open invitation to find stories that they felt were meaningful, that would bring the crisis&nbsp;closer to our readers. We needed a small family in Amsterdam, and larger ones in Spain&nbsp;and Italy. We wanted to hear from multiple generations to see how personal history&nbsp;shaped responses.&nbsp;</p>
<p>
	From here on in, I would be up early to check my email to be mindful of the time-zone&nbsp;gap. The reporters came back with lovely subjects, summaries, and surprises that I&nbsp;hadn&rsquo;t anticipated.&nbsp;</p>
<p>
	For photography, we knew we wanted portraits of the generations. Matt&rsquo;s vision was&nbsp;to have his photographers follow each family member through a day in their lives. He&nbsp;chose visual journalists who had covered the world, covered news and even covered &nbsp;war. Matt wanted each shoot to end at the dinner table. Sam suggested we include the&nbsp;menus.</p>
<p>
	From here it was a question of waiting to see what story the photos told. Waiting to&nbsp;see what the families said. We designed the look of the interactive. I stole a palette from&nbsp;a Tintin novel, we worked through the interaction. And when it was all together and&nbsp;we had storyboards, we added back in some (not much but some) of the original charts.&nbsp;Just enough to punctuate each story, just enough to harden the themes. The data became&nbsp;a pause in the story, a way to switch gears.</p>
<p style="text-align: center; ">
	<img alt="Image_5.png" src="http://datadrivenjournalism.net/uploads/Image_5.png" /><br />
	<em>Life in the Euro Zone (Wall Street Journal)</em></p>
<p>
	In the end, the data were the people; they were the photographs and the stories. They&nbsp;were what was framing each narrative and driving the tension between the countries.&nbsp;By the time we published, right before the New Year as we were all contemplating what&nbsp;was on the horizon, I knew all the family members by name. I still wonder how they&nbsp;are now. And if this doesn&rsquo;t seem like a data project, that&rsquo;s fine by me. Because those&nbsp;moments that are documented in Life in the Eurozone&sbquo; these stories of sitting down for&nbsp;a meal and talking about work and life with your family was something we were able&nbsp;to share with our readers. Understanding the data is what made it possible.</p>
]]></description> 
      <dc:date>2012-07-24T15:04:30+00:00</dc:date>
    </item>

    <item>
      <title>Data Journalism Awards Featured Winner: Terrorists for the FBI</title>
      <link>http://datadrivenjournalism.net/featured_projects/Data_Journalism_Awards_Featured_Winner_Terrorists_for_the_FBI</link>
      <guid>http://datadrivenjournalism.net/featured_projects/Data_Journalism_Awards_Featured_Winner_Terrorists_for_the_FBI#When:17:03:44Z</guid>
      <description><![CDATA[<p>
	<em>The winners of the 2012 edition of the Data Journalism Awards were announced on 31 May during a <a href="http://datadrivenjournalism.net/news_and_analysis/Data_Journalism_Awards_ceremony_-_livestreamed">ceremony</a>&nbsp;held at the <a href="http://www.google.com/url?q=http%3A%2F%2Fwww.news-worldsummit.org%2F&amp;sa=D&amp;sntz=1&amp;usg=AFQjCNH3VA6oocnZViiUfYYbKEy5yuXYZg">News World Summit</a> in Paris. In this series of posts we will present each of the winning projects and&nbsp;</em><em>and honorable mentions</em><em>&nbsp;to understand their relevance to the field of data journalism and provide an overview of the tools and methods used by participants.</em></p>
<p>
	<em>The winning project to be showcased here, <a href="http://www.motherjones.com/special-reports/2011/08/fbi-terrorist-informants"><strong>Terrorists for the FBI</strong></a>, achieved first place under the category Data-Driven Investigations, National/International category.&nbsp;</em><em>To see the jury comments click <a href="http://datajournalismawards.org/">here</a>.</em></p>
<p style="text-align: center; ">
	<iframe allowfullscreen="" frameborder="0" height="281" mozallowfullscreen="" src="http://player.vimeo.com/video/43964956" webkitallowfullscreen="" width="500"></iframe></p>
<p style="text-align: center; ">
	<em>Short video presenting the story behind the Terrorists for the FBI investigation</em></p>
<h3>
	Conspiracy theories &ndash; for real</h3>
<p>
	Mother Jones reporter, Trevor Aaronson found that subsequent to 9/11, the FBI built up a network of 15.000 domestic informants - many of whom were tasked with surveilling and infiltrating Muslim neighborhoods and institutions. As part of this 18-month project, Mother Jones exposed the driving force behind these terrorist conspiracy plots. They discovered that nearly half of the 508 federal terrorism convictions involved the use of informants, many of whom were incentivized by money.</p>
<p>
	Another major finding was when Nick Baumann, another Mother Jones reporter, uncovered a secret program to have American citizens detained and interrogated by despotic foreign regimes. As a result, the US government confirmed for the first time that domestic law enforcement authorities have a program in place which requests the detention and interrogation of American citizens by overseas regimes.</p>
<h3>
	Uncovering the facts and presenting the story</h3>
<p style="text-align: center; ">
	<a href="http://www.motherjones.com/special-reports/2011/08/fbi-terrorist-informants"><img alt="Screen_shot_2012-07-19_at_18.46.48.png" src="http://datadrivenjournalism.net/uploads/Screen_shot_2012-07-19_at_18.46.48.png" style="width: 400px; height: 301px; " /></a></p>
<p style="text-align: center; ">
	<em>Terrorism cases state by state</em></p>
<p>
	In order to prove that domestic informants do in fact exist, a pattern needed to be established - so a system was set in place. Aaronson spent the next few months researching more than 500 cases in order to build a database that would illustrate exactly how frequently informants were used in terrorism investigations and what role they played. The story began with the case of Gulet Mohamed, a Virginia teenager who, whilst detained in Kuwait, managed to call a New York Times reporter. Mohamed alleged that he was being held and brutally interrogated at the FBI&rsquo;s behest. The story died down when Mohamed was released, however Baumann didn&rsquo;t let go. Instead he discovered a series of similar cases, which finally confirmed a pattern.<br />
	<br />
	The investigation heavily relied on federal court records and the database was built from scratch using available court records and information from primary sources. The reporter and a research assistant spent hundreds of hours combing through case files and assembling the database. The team used MySQL and Excel for the first build; as well as Drupal for the online database.&nbsp;<br />
	<br />
	The investigation also relied on interviews with current and former FBI agents and the use of internal FBI files. In addition, lawyers who defended accused terrorists provided information about their cases. In many cases, the court records were available online. But in other cases, Aaronson had to obtain the records from individual courthouses nationwide or one of the national archive warehouses.&nbsp;Another obstacle was the fact that the federal government does not promote its use of informants. Thus the only way to determine whether an informant was used in a case was to read through the records and find the information buried within.</p>
<p>
	The unearthed data was presented through charts, graphics, and videos. In an interactive online package, users could search and sort terrorism suspects and check how often they were convicted of lesser charges. Users could also review internal FBI documents, watch surveillance videos, and decode counterterrorism jargon.&nbsp;This project took approximately one year from concept to publication and the core team consisted of five people, plus another dozen team members from the editorial web team.</p>
<p style="text-align: center; ">
	<a href="http://www.motherjones.com/fbi-terrorist?tid=All&amp;tid_1=All&amp;tid_2=All&amp;tid_3=All&amp;tid_4=21246&amp;tid_5=All&amp;date_filter[value][year]="><img alt="Screen_shot_2012-07-19_at_18.39.38.png" src="http://datadrivenjournalism.net/uploads/Screen_shot_2012-07-19_at_18.39.38.png" style="height: 262px; width: 600px; " /></a><br />
	<em>Screenshot of&nbsp;searchable terror trial database</em></p>
<h3>
	The challenges</h3>
<p>
	In the competition entry the team behind the project explains: &quot;The FBI is a notoriously secretive organization. As a result it took months of meetings and introductions to put together enough FBI sources to tell the story with authority. The primary challenge was to find the necessary information buried in thousands of pages of court records. Secondly, the information was never in the same place in every court file and thus required weeks of document review. Now the database is <a href="http://www.motherjones.com/fbi-terrorist">available</a> to everyone in searchable form from the Mother Jones&rsquo; website.&quot;</p>
<h3>
	Advice for aspiring data journalists</h3>
<p>
	&quot;All the information needed is out there; it just needs to be put together. In fact there are many stories like this waiting to written&mdash;where a trend can be explained by collecting information no else has assembled. Very few journalists have the time to pursue them today, which makes them all the more critical.&quot;</p>
<p>
	&nbsp;</p>
<p>
	<em>The Data Journalism Awards is a <a href="http://www.globaleditorsnetwork.org/">Global Editors Network</a> initiative supported by Google and organized in collaboration with the <a href="http://www.ejc.net/">European Journalism Centre</a>. Please visit the Data Journalism Awards <a href="http://datajournalismawards.org/">website</a>&nbsp;for the full list of winners.</em></p>
]]></description> 
      <dc:date>2012-07-19T17:03:44+00:00</dc:date>
    </item>

    <item>
      <title>Message Machine: When Are 190 Emails Like Six Emails?</title>
      <link>http://datadrivenjournalism.net/featured_projects/Message_Machine_When_Are_190_Emails_Like_Six_Emails</link>
      <guid>http://datadrivenjournalism.net/featured_projects/Message_Machine_When_Are_190_Emails_Like_Six_Emails#When:16:19:57Z</guid>
      <description><![CDATA[<p>
	<em>Originally published by <a href="http://www.propublica.org/site/author/jeff_larson">Jeff Larson</a> on <a href="http://www.propublica.org/nerds/item/when-are-190-emails-like-six-emails">ProPublica</a> on 8 March 2012 under a <a href="http://creativecommons.org/licenses/by-nc-nd/3.0/us/">Creative Commons Attribution-NonCommercial-NoDerivs</a> license.&nbsp;</em></p>
<p style="text-align: center; ">
	<img alt="smoot-nerds-post_1.jpg" src="http://datadrivenjournalism.net/uploads/smoot-nerds-post_1.jpg" style="height: 267px; width: 400px; " /></p>
<p>
	In March we published a <a href="http://www.propublica.org/special/message-machine-you-probably-dont-know-janet">graphic</a>&nbsp;that looks at six variations of a single email sent out by the Obama re-election campaign on the 1st of March.</p>
<p>
	This all started when fellow news nerd <a href="http://danielsinker.com/">Dan Sinker</a>&nbsp;got an email from the campaign on the same night his wife did and noticed that although they were both apparently from the same person at the campaign &mdash; Julianna Smoot &mdash; the e-mails had subtle differences. So Dan set up a Google form and <a href="https://twitter.com/dansinker/status/175429582894272512">asked his Twitter followers</a> to send in their own examples of the &ldquo;Smoot Email.&rdquo;</p>
<p>
	At ProPublica, we&rsquo;d been wanting to dig more deeply into how &ldquo;big campaign data&rdquo; works so we struck a deal with Dan: He&rsquo;d share his database with us and we&rsquo;d help analyze and visualize it.</p>
<p>
	Of course, it was far from a valid sample, but we thought analyzing the data would yield interesting observations if not statistically significant conclusions.</p>
<h3>
	Preprocessing</h3>
<p>
	We found six &ldquo;clusters&rdquo; in the 190 emails by grouping the emails using a statistical formula called the <a href="http://en.wikipedia.org/wiki/Pearson_product-moment_correlation_coefficient">Pearson Correlation</a>. More on that in a second.</p>
<p>
	In order to combat changes in whitespace or accidental insertions or deletions in the emails from influencing comparisons, we used a process called &ldquo;stemming&rdquo; to reduce each word to a common prefix. For example <a href="https://github.com/aurelian/ruby-stemmer">the stemmer we used</a> changes the word &ldquo;writing&rdquo; into &ldquo;write&rdquo; and &ldquo;journalism&rdquo; into &ldquo;journal&rdquo;. Then we removed common words like &ldquo;and&rdquo;, &ldquo;or&rdquo;, and &ldquo;but&rdquo; from each email.</p>
<p>
	Finally, we translated each document into a list of word frequencies, which converted each email into an abstract representation called a &ldquo;bag of words&rdquo; &mdash; like this:</p>
<p>
	&nbsp; &quot;curious&quot; =&gt; 1.0,<br />
	&nbsp; &quot;elig&quot; =&gt; 1.0,<br />
	&nbsp; &quot;told&quot; =&gt; 1.0,<br />
	&nbsp; &quot;payment&quot; =&gt; 3.0</p>
<h3>
	Stemming, Bagging and Correlating</h3>
<p>
	By treating each email as a &ldquo;bag of words&rdquo;, we were able to use the Pearson Correlation to group them together with others in the set.</p>
<p>
	In statistics, the <a href="http://en.wikipedia.org/wiki/Longest_common_subsequence_problem">Pearson Correlation</a> describes the strength of the dependence between two variables. In other words, it is a measure of how two variables change with one another. It returns a value from -1 to 1, with 0 meaning no correlation and 1 meaning perfect positive correlation and -1 meaning perfect inverse correlation. The Pearson Correlation wikipedia page explains the math behind the formula.</p>
<p>
	To determine whether a document belonged to a group or not, we set the threshold at 0.85.</p>
<p>
	After running the stemming, bagging and correlating we found that our sample contained six distinct emails.</p>
<h3>
	Diffing and Visualizing</h3>
<p>
	As it turns out the problem of computing differences between documents is very well understood, although it&rsquo;s quite <a href="http://en.wikipedia.org/wiki/Longest_common_subsequence_problem">complex</a>. Rather than implementing our own version of the algorithm, we used <a href="http://ejohn.org/projects/javascript-diff-algorithm/">one written by John Resig</a>, the creator of jQuery.</p>
<p>
	Because the emails ranged from very similar to quite different, we wanted an interface that made the variability easy to see at a glance. In addition to a small graphical indicator in the tabs that lets a reader know how strongly other emails match the one inside the tab, a reader can hover over each tab and compare emails very quickly.</p>
<p>
	In order to keep track of everything client-side, we modeled the data as an array of arrays. We wrote a small table-based javascript framework to do so, which turned out surprisingly like our Ruby-based library <a href="https://github.com/propublica/table-fu">table-fu</a>. You can check out the source over on <a href="https://gist.github.com/1951671">github</a>.</p>
<p>
	That JavaScript code &mdash; we&rsquo;re calling it <a href="https://gist.github.com/1951671">table.js</a> &mdash; is really easy to use. You can call the regular functional programming methods &mdash; each, map, reduce &mdash; on each table. It also has simple statistics built in. For example:</p>
<p>
	<img alt="Screen_shot_2012-07-19_at_18.05.38.png" src="http://datadrivenjournalism.net/uploads/Screen_shot_2012-07-19_at_18.05.38.png" style="height: 124px; width: 600px; " /></p>
<p>
	Table.js also allowed us to build out the sidebar which shows a simple breakdown of the recipients of each email. Because HTML is really good at displaying rectangles, the graphs in the sidebar are built entirely out of carefully positioned divs.</p>
<p>
	<strong>Correction: </strong>This post originally stated that a Pearson Correlation score of -1 meant no correlation, in fact a score of 0 means no correlation, and a score of -1 means negative correlation. The post has been updated to reflect this fact.</p>
]]></description> 
      <dc:date>2012-07-19T16:19:57+00:00</dc:date>
    </item>

    
    </channel>
</rss>