Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Revealing Data: Why We Need Humans to Curate Web Collections

Circulating Now – NIH – “In this Revealing Data series we explore data in historical medical collections, and how preserving this data helps to ensure that generations of researchers can reexamine it, reveal new stories, and make new discoveries. Future researchers will likely want to examine the data of the web archive collections, collected and preserved by libraries, archives, and others, using a wide range of approaches, to document unfolding events.  Today Circulating Now welcomes guest blogger Alexander Nwala (@acnwala), writing on his research using NLM web archive collections to compare different methods of selecting web content, and some of the difficulties encountered in generating seeds automatically.”

I am a Computer Science PhD student and member of the Web Science and Digital Libraries research group at Old Dominion University, Norfolk Virginia. For the past three years, I have been researching generating collections for stories and events under the supervision of Dr. Michael Nelson and Dr. Michele Weigle. There is a shortage of curators to build web archive collections in a world of rapidly unfolding events. A primary objective of my research is investigating how to automatically generate seeds (in the absence of domain knowledge) to create or augment web archive collections…”

Sorry, comments are closed for this post.