BERKELEY, CA -- (Marketwired) -- 07/03/14 -- The International Computer Science Institute (ICSI), a leading center for computer science research, today announced a collaboration with Yahoo Labs and Lawrence Livermore National Laboratory to process and analyze the recently released Yahoo Flickr Creative Commons 100 Million (YFCC100M) dataset, a publicly available corpus of user-generated content comprising more than 100 million images and videos.
ICSI has developed a number of research tools to extract meaning from the vast amounts of multimedia data freely available online, giving researchers the ability to draw powerful conclusions from the data. Such work includes:
ICSI is collaborating directly with Lawrence Livermore Lab to process the massive dataset using the lab's supercomputer, the Cray Catalyst.
"The media that people choose to upload with a Creative Commons License are full of information: they tell us about the people in them, where they are and what is happening, even if none of that is explicitly laid out," said Gerald Friedland, research director of Audio and Multimedia at ICSI. "ICSI's sophisticated computing tools help us make sense of that data at scale, and there is so much we can learn by fully leveraging the rich Creative Commons dataset that Flickr has amassed over the past decade."
The dataset can be requested through Yahoo's Webscope program here, and ICSI's research analytics tools will be hosted on an Amazon instance via ICSI's web site in August of this year.
The development of ICSI's research tools are supported by grants from the NSF, NGA, and IARPA's ALADDIN program.
The International Computer Science Institute (ICSI) is a leading center for research in computer science and one of the few independent, nonprofit research institutes in the United States. With its unique focus on international collaboration and its affiliation with the University of California at Berkeley, ICSI brings together the most influential U.S. scientists and experts from around the world in areas such as computer networking and security, speech and language processing, algorithms, bioinformatics, computer architecture, computer vision, multimedia analysis, and artificial intelligence. For more information, visit ICSI at www.ICSI.berkeley.edu, follow us at www.twitter.com/ICSIatBerkeley, or read our blog at http://www.ICSI.berkeley.edu/icsi/blog.