Translating unstructured, crowdsourced content into structured data



The use of crowdsourcing in biology is gaining popularity as a mechanism to tackle challenges of massive scale.  However, to maximize participation and lower the barriers to entry, contributions to crowdsourcing efforts are typically not well-structured, which makes computing on these data challenging and difficult.  The presentation will discuss strategies for translating this unstructured content into structured data.  Three vignettes (in varying degrees of completion) will be described, one each from our Gene Wiki [1], BioGPS [2], and serious gaming [3] initiatives.






Andrew Su is an Associate Professor at the Scripps Research Institute.  His group develops bioinformatics tools for biomedical research, focusing on harnessing the collective efforts of the biology community.  The lab also applies these and other tools to a variety of projects in integrative genomics.  Andrew previously led bioinformatics efforts at GNF from 2002 to 2011.  Prior to that, he received his Ph.D. from the Scripps Research Institute and did his undergraduate studies at Northwestern University.





Meeting information 


Date: Wednesday, February 20, 2013 

Time: 10:00 am, Pacific Standard Time (San Francisco, GMT-08:00) 



To start or join the online meeting 


Go to


Meeting Number: 925 756 393 

Meeting Password: ncbo 



Audio conference information 


To receive a call back, provide your phone number when you join the meeting, or call the number below and enter the access code. 

Call-in toll number (US/Canada): 1-650-429-3300 

Global call-in numbers:


Access code:925 756 393