• ADVERTISEMENT

    DocumentCloud Releases More Code, Continues to Attract Developer Interest

    by Amanda Hickman
    December 10, 2009

    A public beta of DocumentCloud, one that journalists can kick the wheels on and upload documents to, won’t be ready for a few more months, but work is continuing apace in our corner of the cloud.

    We’ve released a handful of code that comprises some of the components of our big picture, and it is great to see how well received our work has been by the Ruby and JavaScript communities. Last week we hit a little milestone: more than 1,000 developers are watching DocumentCloud projects on Git Hub, which is pretty cool. The advantage for us is that many of these developers are actually trying out our software releases and helping us make them stronger.

    ADVERTISEMENT

    Gregg Pollack included a great review of CloudCrowd in a recent episode of his show, Scaling Rails. CloudCrowd will still be Greek to the truly non-technical readers out there, but if you have enough of a handle on software development to wish you understood“scaling” better, his review just might help.

    Our latest release, Docsplit, is a command-line utility and Ruby library for splitting documents into distinct components such as raw text (which you need for searches), page thumbnails, and document metadata (details like the document’s author or the number of pages it contains).

    Splitting documents apart is a pretty key functionality for DocumentCloud: everything else DocumentCloud does depends on the presence of one or another of these pieces. Docsplit got a lot of attention when we released it on Monday — and we’re all looking forward to seeing what other folks do with it.

    ADVERTISEMENT

    Tagged: code documentcloud javascript open source ruby

    2 responses to “DocumentCloud Releases More Code, Continues to Attract Developer Interest”

    1. amanda says:

      For the “looking forward to seeing” files …

      Seems the Guardian turned to Docsplit early this morning in the rush to launch their latest crowdsourcing project, which invites readers to help investigate MP’s expense records.

    2. Amanda says:

      Speaking of looking forward to seeing … nice to wake up to this note http://twitter.com/simonw/status/6530220753 from a software architect at the Guardian. Their MP Expense Log crowd sourcing project is pretty great:

      http://mps-expenses2.guardian.co.uk/

  • ADVERTISEMENT
  • ADVERTISEMENT
  • Who We Are

    MediaShift is the premier destination for insight and analysis at the intersection of media and technology. The MediaShift network includes MediaShift, EducationShift, MetricShift and Idea Lab, as well as workshops and weekend hackathons, email newsletters, a weekly podcast and a series of DigitalEd online trainings.

    About MediaShift »
    Contact us »
    Sponsor MediaShift »
    MediaShift Newsletters »

    Follow us on Social Media

    @MediaShiftorg
    @Mediatwit
    @MediaShiftPod
    Facebook.com/MediaShift