Mercator BioLogic developed an innovative approach to the genetic assembly process but needed a high performance storage solution to turn their approach into reality. Mercator BioLogic teamed up with SanDisk to deploy a 4-node Oracle RAC cluster that used 25TB of Fusion ioMemory application accelerators with SanDisk ION Accelerator software.
Mercator BioLogic understands that the process of digitizing (scanning) DNA is imperfect. Currently, an entire chromosome cannot be scanned in a single, unbroken line. This means that after biological samples have been scanned, they must be assembled into a full chromosome map for proper data analysis. The assembly process involves taking multiple chunks of data and accurately matching them together to create a single dataset that represents a set of chromosomes, creating a genome. During this process, the sample DNA is compared with known genetic markers to determine if the specific sample has a genetic tendency towards a known disease. A genetic code baseline is used to help measure a system’s performance when it reassembles the scanned biological sample.
The current industry standard for this assembly process with the genetic code baseline is 37 to 45 days per biological sample. The shortest time to accomplish this assembly, with reduced accuracy, is 27 days. This means that currently a person with a rare cancer who wants a targeted treatment approach would be waiting for more than a month for test results. But the situation is even more daunting; there are thousands of active DNA sequencing requests to resolve, and the current assembly process is performed only on a limited number of specialized machines. A waiting issue like this could easily occur in other industries that do genome processing, such as law enforcement and agriculture.
The challenge that Mercator BioLogic faced was to drastically reduce the 37-day window^1 for assembling the scanned genetic code.
Roger Arvisais, Founding Partner, Mercator BioLogic
With its industry-leading expertise, Mercator BioLogic developed an innovative approach to the genetic assembly process, but they also needed a high- performance storage solution to turn their approach into reality. Mercator BioLogic teamed up with SanDisk to deploy a 4-node Oracle RAC cluster that used 25TB of Fusion ioMemory application accelerators with SanDisk ION AcceleratorTM software. With this system, Mercator BioLogic was able to reduce the time required for genetic code assembly from 37 days to an astonishing 84 minutes for benchmark genomic data. This represents a performance improvement of more than 600x.
“This system has proven to give more than three times the performance of any other system we’ve seen,” remarked Roger Arvisais, a Founding Partner of Mercator Biologic. “There is no other performance enhancement available that produces these results.”
The configuration for the Oracle RAC cluster solution is detailed below. The Oracle RAC utilized ASM to manage the LUNs presented by the SanDisk ION Accelerator software.
Four Oracle RAC nodes, each with:
One host system with:
Using their leading expertise in bio-informatics and a powerful data storage solution from SanDisk, Mercator BioLogic took a giant leap forward in genome mapping. With Fusion ioMemory storage, SanDisk ION Accelerator software, and an Oracle RAC cluster, Mercator BioLogic slashed genetic code assembly from 37 days to less than an hour and a half, with no loss in accuracy. This opens the doors to exciting advances in industries such as: Healthcare: Much faster diagnosis and treatment of genetic-based maladies. Law enforcement: Accurately identifying a perpetrator of a crime with faster, accurate DNA matching. Agriculture and farming: Safer genetic manipulation of vegetation, or the development of healthier animals by better selection of breeding stock through genetic scans.
For more information, see the Alignment, Assembly, and Analysis of Genomic Information white paper on the Mercator BioLogic website (http://www.mercatorbiologic.com).
1 The Planck Institute in Bern, Switzerland accomplished this same task in only 27 days; however, they reduced oversampling from the standard 28 to 18, which can affect accuracy.
The performance results and cost savings discussed herein are based on internal testing and use of SanDisk products. Results and performance may vary according to configurations and systems, including drive capacity, system architecture and applications.
Whether you'd like to ask a few initial questions or are ready to discuss a SanDisk solution tailored to your organizations's needs, the SanDisk sales team is standing by to help.
We're happy to answer your questions, so please fill out the form below so we can get started. If you need to talk to the sales team immediately, please phone: 800.578.6007
Thank you. We have received your request.