Like many organizations, Ancestry.com is constantly accumulating more high volume, high velocity data of all kinds. We apply innovation at scale to handle 10 petabytes of highly dynamic family history data, and a flood of new data derived from our autosomal DNA test, AncestryDNA.
How do we do it? Join our dynamic duo Bill Yetman (manager perspective) and Jeremy Pollack (developer perspective) next week at QConSF for a lively presentation that will dive into the details of how the company scaled to handle a DNA pool size of 200,000 samples and generate 10 million 4th cousin matches to our users. Learn how Agile techniques, the “measure everything” principle, and a unique mix of scientists and engineers worked together to create a unique breakthrough architecture, and created a one-of-a-kind family history product along the way.
By working together and leveraging technologies like Hadoop and HBase, they’ve created an industrial, scalable DNA processing pipeline to bring fun and unique insights to customers regarding their ethnicity and distant cousin matches.
We hope to see you next week!
Scaling AncestryDNA using Hadoop and HBase Session Info:
Monday, November 11, 2:50 – 3:40PM PT
Grand Ballroom A
Presented by Bill Yetman and Jeremy Pollack (Ancestry.com)
About Bill Yetman
Bill Yetman has served as Senior Director of Engineering at Ancestry.com since January 2011. Bill has held multiple positions with Ancestry.com from August 2002, including Senior Director of Engineering, Director of Sites, Mobile and APIs, Director of Ad Operations and Ad Sales, Senior Software Manager of eCommerce and Senior Software Devloper. Prior to joining Ancestry.com, he held several developer and programmer roles with Coresoft Technologies, Inc., Novell/Word Perfect, Fujitsu Systems of America and NCR. Mr. Yetman holds a B.S. in Computer Science and a B.A. in Psychology from San Diego State University.
About Jeremy Pollack
Jeremy Pollack is a senior engineer at Ancestry.com, where his team supports a team of scientists and makes their discoveries scale. In the past, he’s written code that withstood the traffic from a Superbowl ad, created the content management system for one of the web’s most popular parenting sites, and looked after the technology needs of a well-known online magazine. When he’s not coding, he enjoys reading, playing the darbuka, and throwing awesome dinner parties.