Automating Summaries of Scientific Papers
Researchers from Yale University and the University of Washington have released a dataset of 1,000 scientific papers, sentences from other research that cite the reference paper, and summaries to...
View ArticleDetecting Objects in Difficult Conditions
Researchers from the Georgia Institute of Technology have released a dataset of nearly 3,000 algorithmically-augmented videos to improve the ability of systems to detect street signs in difficult...
View ArticleMeasuring the Diversity of U.S. School Districts
The Washington Post has released a series of data visualizations showing that U.S. schools have become increasingly diverse since 1995 but that many students in large cities and the South still live in...
View Article10 Bits: the Data News Hotlist
This week’s list of data news highlights covers September 7-13, 2019, and includes articles about automating the analysis of brain scans and using facial recognition at the 2020 Olympics. 1. Sharing...
View ArticleVisualizing the Tweets of the U.S. Congress
The Pudding has created a data visualization tool that allows users to visualize the people, places, and things that members of the U.S. Congress most frequently tweet about. Users can filter the data...
View ArticleRebooting AI: Building Artificial Intelligence We Can Trust
In their new book, Rebooting AI: Building Artificial Intelligence We Can Trust, professors Gary Marcus and Ernest Davis argue that society is still far away from developing superintelligent machines....
View Article10 Bits: the Data News Hotlist
This week’s list of data news highlights covers September 14-20, 2019, and includes articles about identifying cyberbullies online and a system that can predict the size of wildfires. 1. Identifying...
View ArticleVisualizing Speeches Made in the German Parliament
Zeit Online has created a series of data visualizations illustrating how debate and the topics discussed in the German parliament have changed since 1949. For example, speakers have increasingly used...
View ArticleMaking AI Agents Better Conversationalists
Amazon has released a dataset of nearly 11,000 conversations between Mechanical Turk workers to advance the development of AI agents that can have engaging conversations with humans. The dataset...
View Article10 Bits: the Data News Hotlist
This week’s list of data news highlights covers September 21-27, 2019, and includes articles about an AI system that can predict El Niño and a charity using AI to tackle homelessness. 1. Achieving...
View ArticleVisualizing the Effects of Bad Weather and the Trade War on Farmers
The Wall Street Journal has created a series of data visualizations showing how bad weather and the United States-China trade war have created significant challenges for U.S. farmers. The...
View Article10 Bits: the Data News Hotlist
This week’s list of data news highlights covers September 28-October 4, 2019, and includes articles about using AI to restore movement in individuals with spinal cord injuries and using AI to judge...
View ArticleDeveloping Systems to Identify Deepfakes
Google has released a dataset of deepfake videos to further the development of systems that can detect deepfakes. The dataset includes 363 real videos of 28 consenting actors and an additional 3,068...
View ArticleVisualizing Debate at the UN General Assembly
Al Jazeera has created a series of data visualizations showing how each nation has voted at the United Nations. Al Jazeera analyzed the 6,112 roll-call votes that took place between 1946 to 2018 at the...
View Article10 Bits: the Data News Hotlist
This week’s list of data news highlights covers October 5-11, 2019, and includes articles about a system that uses AI to see through walls and a new genomics technique that identifies the causes of...
View ArticleCreating a Search Engine for Finding Code
Github and Microsoft have released a dataset of search queries for code and annotated results to advance the development of search engines that can locate specific code. The dataset includes 99 search...
View ArticleVisualizing Each Row in a Data Set
Microsoft has open-sourced SandDance, a data visualization tool that allows users to create 2D and 3D visualizations. Rather than showing the sum of data points, SandDance represents each row in a...
View Article10 Bits: the Data News Hotlist
This week’s list of data news highlights covers October 12-18, 2019, and includes articles using smart speakers to monitor the health of infants and using AI to prevent harmful drug interactions. 1....
View ArticleBuilding Systems That Can Verify Facts
Researchers from the University of California, Santa Barbara, and Tencent have released a dataset of 118,000 statements and 17,000 related Wikipedia tables to spur the development of systems that can...
View ArticleTyping or Speaking Commands to Create Data Visualizations
Researchers from New York University have developed a data visualization tool that allows users to create data visualizations by typing or speaking simple instructions instead of coding. The tool,...
View Article