This comparative ecosystem analysis of key commercial data analysis and interpretation tool providers reveals a range of capabilities that uniquely address both secondary and tertiary sequence data analysis needs. At first glance, the value propositions and communication of capabilities may seem overlapping or similar. However, a deeper analysis clearly uncovers substantial differences in the product characteristics related to analysis, interpretation, data management, and more.
While this report does not intend to provide direct recommendations to end-users forspecific product selection, the deep-dive evaluation is meant to be an insightful review to help researchers and commercial entities choose the best partner for success.
The advancements in high-throughput sequencing demand powerful, yet user-friendly bioinformatics solutions to analyze and extract meaningful knowledge from the generated sequence data. In line with this demand, the sequence data analysis and interpretation space is rich with commercial bioinformatics and/or platform solutions that provide different analysis, interpretation, and reporting solutions, and is paralleled by the ever-growing open source tools space. Commercial solutions provide a multitude of bioinformatics offerings as stand-alone packages. They each differ in the capabilities offered, the user-friendliness they provide, the product(s) cost, and the target audience. This creates a challenging and sometimes bewildering environment for the end-users faced with selecting the right offering for their specific analysis needs. Accordingly, the commercial entities in turn need to clearly communicate their competitive advantage to customers to be among the winners in this field.
A set of 25 commercial companies was analyzed revealing top players who are strong in different areas of data analysis and interpretation: DNAnexus and Seven Bridges Genomics have emerged as open and flexible platforms for data analysis, while Bluebee, Genalice, and Edico Genome, as alternatives, enable fast DNA processing. On the data interpretation side, notable players are Cartagenia (Agilent), Congenica, Ingenuity (Qiagen), Omicia, Station X, Tute Genomics, and WuXi NextCODE (WuXi AppTec). Content is clearly king on the interpretation side, and tertiary companies with rich content are highly valued: it is therefore not surprising to see Ingenuity with its manually curated content receiving high marks, or Cartagenia and Omicia with the additional phenotype content being highlighted as value providers. End-user interviewees repeatedly pointed out the challenges associated with the complexity of data analysis that favors complete and integrated solutions that include secondary, tertiary and reporting capabilities. This is particularly relevant for organizations that have a range of users at different computer sophistication levels. Tute Genomics is a company that was mentioned favorably supporting the entire workflow from data analysis to interpretation, and findings reporting.
In summary, this detailed report comprises a deep ecosystem level analysis of a major set of key commercial software and platform providers (N=25) in the secondary and tertiary NGS analysis space, to understand product focus, strategy to address end-user needs, and company momentum (incl. funding situation, number of employees, and more):
- Advaita, Appistry, BaseSpace (Illumina), Bina Technologies (Roche), Bluebee, Cartagenia (Agilent), CLC Bio (Qiagen), Congenica, DNAnexus, DNASTAR, Edico Genome, Genalice, Genestack, Genomatix, Genospace, Golden Helix, Ingenuity Systems (Qiagen), Partek, Omicia, Seven Bridges Genomics, Softgenetics, SolveBio, Station X, Tute Genomics, and WuXi NextCODE (WuXi AppTec).