On Track to Data-Driven

Posted by Aaron Ling on December 25, 2013 in Big Data, Distributed Computing, Technical Management

Ancestry.com becomes more and more aware of the value of the data our website generates every single day. We have a lot of customers coming to the website to discover, preserve and share their family history. They come from different parts of the world and are looking for information that helps them tell the story… Read more

Throttling Image Processing

Posted by Tyler Jensen on June 21, 2013 in Distributed Computing, Image Processing and Analysis

Ancestry.com, like any other site with millions of subscribers, experiences predictable load patterns throughout the day. To maximize site performance and customer satisfaction, we make every effort to schedule maintenance during off-peak intervals. Content processing, especially our repository of hundreds of millions of images, on the other hand, is a constant ongoing effort, and in… Read more

Distributed Parallel Computing at Ancestry.com

Posted by Tyler Jensen on April 24, 2013 in Distributed Computing, Image Processing and Analysis

About 450 years ago John Heywood wrote, “many hands make light work.” The same can be said of image and data processing. Distributed parallel computing (DPC) makes it possible for us to do the work described by Michael Murdock in his series on the image processing pipeline. If you haven’t already, take a moment to… Read more