One of the big challenges of working with heterogeneous data is curating it. Below are introductions to two tools for doing do:
- Gridworks, developed by David Huynh, Stefano Mazzocchi, and their colleagues at Metaweb, the company behind Freebase.
- Needlebase, developed by Justin Boyan and colleagues at ITA Software, the company powering travel search for Kayak, Orbitz, and others.
If you’re concerned with building and maintaining collections of semi-structured data, or building your own technology for this purpose, I suggest you check out these state-of-the-art tools.
If you enjoyed this post, make sure you subscribe to my RSS feed!
1 response so far ↓
1 LC // Jul 18, 2010 at 10:53 pm
How prescient! Now Google is going to own both!!
Leave a Comment