Why We Need Big Infrastructure For Tackling Rare Disease

Hannes Smarason big data rare diseases

Having bigger databases, and having a mechanism that flags genomic variants, is key to optimizing patient care for entire families

Uncovering the genetics of schizophrenia is vital but challenging. As I wrote in my last post, mutations in more than 100 spots in the genome have been linked to the condition. But which ones actually play a role in the disease, and which ones are just there for the ride—innocent bystanders that just happen to occur alongside the real culprits? That’s the crucial question for scientists seeking new treatments for this condition, among them leading researchers and clinicians at our close partner, Boston Children’s Hospital (BCH).

One thing we’ve learned recently is that even a small amount of knowledge about genetic underpinnings of disease can have a big potential benefit for patients. For example, the 16p13.11 region deletion I described in that last post ended up being very important for several patients later, particularly one father and his son, recently described by our colleagues at BCH. This case highlights the importance of expanding the scope and scale of such research, and of updating and alerting patients as more is discovered—not just in schizophrenia, but across rare disease.

In their previous work, the BCH team used chromosomal microarray analysis to determine that a young boy with symptoms of schizophrenia, including psychosis, was missing an entire chunk of DNA—one copy of the chromosomal region 16p13.11, which spans several genes.

Schizophrenia in children is rare, and some researchers believe it could be an extreme variation of the disease, and so might hold important clues for the treatment of this condition in both young and old. A search of our and BCH’s databases showed that several other young patients also showed variations in that region. Just as important, it was confirmed that a parent of one of those patients also carried that deletion, and it seemed likely that another parent (not available for testing, but with reported symptoms of schizophrenia) also likely carried the deletion.

Clearly 16p13.11 seemed to be emerging as a “hotspot” for variations linked to psychosis. But the scientists were only finding this because they could go back and search the databases, and they were working their way backwards from pediatric cases to learn information that might have been medically relevant to the parents as well. All this suggests that having bigger databases, and having a mechanism that flags such variants, is key to optimizing patient care for entire families.

One case uncovered by the BCH scientists, regarding a young man who we will call Jack, brought this into sharp focus. As a teenager, Jack had undergone detailed genetic screening at BCH because of symptoms that included learning disabilities and recurrent seizures. It was determined that he had a 16p13.11 deletion, but at the time of his screening, that mutation hadn’t yet been linked to psychosis. So it became just one more detail in Jack’s medical record.

Separately, a few years later, Jack’s father was diagnosed with ADHD and treated with a high dose of mixed amphetamine salts. Within a few weeks Jack’s father experienced a manic-psychotic episode. He was prescribed an anti-psychotic and eventually recovered. Unfortunately, his son was deeply affected by his father’s breakdown and became withdrawn and depressed. Eventually, Jack also developed psychotic symptoms, which were so serious he was hospitalized.

Jack’s symptoms, thankfully, responded to anti-psychotic medication, but his doctors wondered if there was any connection between the breakdowns suffered by the father and son.

A check of Jack’s medical record revealed the 16p13.11 deletion. And seeing that detail after the link had been made between 16p13.11 and psychosis, his doctors immediately speculated that it might be a cause of Jack’s symptoms. Further, they suspected that mutation could be the “linchpin” causing psychosis in the father and the son. Jack’s father was tested, and he also carries a 16p13.11 deletion.

So here’s the lesson: if Jack’s doctors had known about the link between 16p13.11 and psychosis as soon as it emerged, they might have also suggested testing Jack’s father. If they had, the BCH doctors “believe that the psychosis could have been averted in both father and son.”

In light of this case, the BCH researchers write that they see a keen need for broad, integrated, and sophisticated infrastructure to support genomics-driven precision medicine. They have several recommendations, including that physicians need to receive regularly updated risk information about specific mutations; genetic reports on parents who are “carriers” but seem unaffected should note that problems could arise later, and families that include carriers of variations that increase risk should be monitored and given counseling.

Such activities will be well supported by tools such as WuXi NextCODE’s Genomically Ordered Relational (GOR) database and global platform for diagnosing rare disease and building a global knowledgebase. This can act as one of the key spokes in the “wheel” of genomic diagnostic process. But we also need to build in others, such as means to automatically alert doctors to important knowledge updates, monitor patient records, and connect doctors to specialists who can help refine a diagnosis as new discoveries are made. We and our partners at BCH are committed to helping create these tools.

*  *  *  *  *  *  *  *  *  *  *  *

Headed to ASHG? If you are attending ASHG this month, join us to hear more about how rare disease studies can inform our understand of common diseases at two of our “Genomes for Breakfast” events: “Using Population Genomics to Understand Common and Rare Disease” (Oct. 18), and “Using NGS to Diagnose Rare Disease—Experiences from Three Continents” (Oct. 19).

email

Domain Expertise: Jumpstarting Artificial Intelligence in Biomedicine

Is artificial intelligence the “single most transformative technology in modern history?” That’s the view of Tom Chittenden, who leads WuXiNextCODE’s AI program. And Tom is not alone in his enthusiasm, as numerous analysts are predicting this technology will be one of the fastest growing fields in the world.

In recent talks at Boston’s BioIT World and the EmTech conference in Hong Kong, Tom described some of the strides we’ve been making with our DeepCODE AI tools. Their power is in part thanks to a novel, causal statistical-learning method and deep-learning classification strategy. But another advantage is that they were built on—and are extending the reach of—our global platform for genomic data. That means that Tom’s team has that rare combination of both of the key ingredients to AI making an impact in biomedicine: cutting-edge algorithms AND deep domain expertise and access to the biggest datasets.

Tom—who also holds appointments at Harvard, MIT, and Boston Children’s Hospital—and his growing team have the former in spades; our platform and expertise in genomics provide a key edge in the latter. Our platform has been built over more than 20 years and today underpins the majority of the world’s largest genomics efforts and includes all major global reference databases. It stores, manages, and integrates any type of genomic data and correlates it with phenotype, ‘omics’, biology, outcome, and virtually any other type of data that may be relevant to a particular medical challenge.

That means that we can routinely train and test our AI tools on some of the most comprehensive data sets in the world, such as that in The Cancer Genome Atlas (TCGA). “Today we can take ‘omics data and clinical information and map those to curated resources such as SNOMED CT and biomedical ontologies, and then use AI to identify patterns that lead us to novel findings,” Tom says.

This is a powerful approach to tease out which of hundreds of genetic variants are really involved in a particular disease, based on which ones are actually associated with aberrant expression pathways. You may find hundreds of genetic mutations in a single type of breast cancer tumor, for example, but it is determining which ones are drivers of the disease that matters.

Put simply, AI can lead us to both better diagnoses and easier discovery of more and better drug targets, by taking a range of genomic data and marrying it to clinical information and scientific knowledge. AI is not just going to better match patients to the right drugs, it is going to help further our understanding of the relationships between genes and complex molecular signaling networks, one of the most challenging arenas in our field and the most sought-after starting point for discovering validated pathways and targets.

Valuable insights in real-world medical challenges are already emerging from this AI effort uniquely developed on and applied to the genomic and medical data that counts.

WuXi NextCODE  recently presented preliminary data from analyses using our novel AI technology to diagnose subtypes of tumors. Our DeepCODE tools were validated on six patient-derived tumor xenografts from mouse models, and then tested against approximately 8,200 human tumors from a collection of 22 cancer types in The National Cancer Institute’s TCGA collection. That study included five ‘omics data types. We achieved 98% accuracy overall, and our analyses of human breast and lung cancer subtypes were accurate in 96% and 99% of cases, respectively. This points to an improvement over current methods for matching patients to treatments for their particular cancer, and we have refined that accuracy further still. This capability is also going to be central to the development of liquid biopsies.

http://hannessmarason.com/blog/2017/04/04/a-perfect-pairing-ai-and-precision-medicine/

In another oncology study, using the same multi-omics data, DeepCODE identified a signal predictive of survival across 21 cancers, pointing to novel and holistic pathways for developing broad oncotherapies.

A recent study published in Nature, meanwhile, describes a potential new role for a well-known growth factor. That report, led by Yale University scientist Michael Simons, looked at blood vessel growth regulation—a crucial process in some very common conditions, including cardiovascular disease and cancer. Our Shanghai team provided RNA sequencing for this study. Our Cambridge AI team drove some of the key insights pointing to novel disease mechanisms.

Simons’ team studied knockout mice, whose fibroblast growth factor (FGF) receptor genes were turned off. They proved, for the first time, that FGFs have a key role in blood vessel growth, uncovering some metabolic processes that were “a complete surprise,” according to scientists on the team. Further, they mapped out pathways that could help provide new drug leads.

http://hannessmarason.com/blog/2017/05/15/bringing-artificial-intelligence-cardiovascular-medicine-cancer-genomics-action/

Our AI team is just getting started. We’re looking forward to many more intriguing findings from this group as they leverage their expertise and massive amounts of the relevant data to improve medicine and healthcare.

Speeding Diagnosis of Rare Diseases

WuXi NextCODE Claritas

Claritas Genomics combines physician experience with next-generation sequencing and WuXi NextCODE’s analytics to accelerate rare disease diagnosis.

It’s one of the most heartbreaking and frustrating things for parents and pediatricians. When a child presents with a constellation of symptoms that doesn’t point to a known disease, what do you do?

Typically, these kids undergo a battery of tests, some of which will eventually be for single genes suspected to play a role in their health problems. But what if those tests come up negative? That leaves the families and doctors wringing their hands as they wonder what to do next.

That was the case with a patient at Boston Children’s Hospital (BCH). He was a boy who, at six months, wasn’t sitting up, smiling, or doing most of the things babies his age typically do. Instead, he seemed “rigid” to his mom, and then he developed a severe respiratory virus and was hospitalized. He also had repeated seizures and eventually needed a tracheotomy—a tube placed through an incision in his throat to help him breath.

Usually, such kids then begin going through what is known as a “diagnostic odyssey”—a long and arduous journey from doctor to doctor and lab to lab.

BCH doctors are trying a new approach. In 2013, the hospital spun out Claritas Genomics, a specialized genetics diagnostics business that combines the experience of the hospital’s physicians with the power of next-generation sequencing and WuXi NextCODE’s advanced analytics. Timothy Yu, a neurologist and researcher at BCH, helped found Claritas to provide a more holistic approach to rare disease.

WuXi NextCODE’s advanced analytics play a key role in improving the speed and efficiency of such diagnostics. Reading the genome isn’t the major challenge anymore—now the issue is finding the relevant mutations in those three billion base pairs.

The data from a single genome can comprise more than 100 gigabytes, which is enough to fill the hard drive on a good laptop computer. Even the exome, which comprises the parts of the genome that encode proteins, can be 15 gigabytes. To diagnose a rare disease, doctors need to find sequence variations and then scour the research to find out what those actually do. That used to take months to years, and many of the variants were simply classified as being of “unknown significance,” without any further information or the ability to check again as the field of knowledge grew.

WuXi NextCODE’s system has begun to make this a click-and-search task. Our knowledgebase can mine all publicly available global reference datasets simultaneously and in real time to show all there is to know about any given variant and its likely biological impact. By keeping the data in a WuXi NextCODE research database, such as the one BCH is growing every day, our system can also quickly rerun the analysis and provide new information as soon as it becomes known.

Claritas is continually expanding the range of its services. Most recently, the group received conditional approval from the New York Department of Health for three new “region of interest assays” as well as one for mitochondrial DNA. That brings the number of Claritas’s approved tests in that state up to six and means more patients in New York will benefit from this new technology.

Children at BCH with ambiguous diagnoses now regularly undergo a whole exome scan early in their clinical journey. The data is then triaged. It is examined first for the most obvious mutations and then more data is progressively analyzed as necessary. With the consent of parents and security measures for privacy, that data can also become part of research datasets at BCH and other major hospitals around the world, so that the growing data pool can benefit that child and others.

This combination of expertise and technology helped Claritas Genomics find an answer for that baby boy and his family mentioned earlier. Heather Olson, the boy’s treating neurologist, had the boy’s exome scanned through Claritas Genomics, and 130 genetic variations were identified that could have caused one or more of the symptoms. WuXi NextCODE’s system helped narrow that down to only six variants that could have possibly been passed on by the boy’s parents. Olson and Yu finally focused on one, a mutation of the BRAT1 gene, which served as a diagnosis. A paper published by Yu, Olson, and colleagues, which describes this mutation and children affected by it, should help other physicians make this diagnosis more quickly in the future.

Yu presented more on Claritas’s novel platform recently at Boston’s Bio-IT World meeting. He explained how the platform helps doctors to much more quickly and accurately diagnose kids with diseases not previously described.

“Thanks to the speed of the platform, we can get a whole clinical exome completed in as little as two weeks,” he said.

The growing database of genetic variants and their effects also means more patients will get an actual diagnosis, rather than walking away still wondering what could be going on.

The ability to diagnose more cases is a start to unravelling the causes of the estimated 7,000 different rare diseases estimated to exist. And it’s a necessary first step towards developing new therapies for those conditions, too.

Imagine the Potential: The World’s First Online Hub for Global Genomic Data Access

The NextCODE Exchange, a new browser-based hub, allows for real-time sharing of whole genome collections in a simple, consistent format.

The NextCODE Exchange, a new browser-based hub, allows for real-time sharing of whole genome collections in a simple, consistent format.

The field of genomic medicine is rapidly advancing as the research community becomes more comfortable manipulating genomic data with the goal of discovering insights about disease causes and risks. Yet each database is hosted within separate organizations, organized in unique ways and vastly too cumbersome to easily share with others who may be working on similar research.

This weekend a new tool launched to enable just that. The NextCODE Exchange (see release here), a new browser-based hub, allows for real-time sharing of whole genome collections in a simple, consistent format.

The availability of this Exchange is a critical advance in extending the utility of genomic data by allowing organizations around the world to access and harmonize large complementary datasets, potentially multiplying their study data sets to gain more reliable insights than ever before.

Already, numerous organizations are participating in the NextCODE Exchange to add and share their genomic data, including clinicians and researchers affiliated with Boston Children’s Hospital, University College Dublin, Queensland Institute of Medical Research (Australia), and Saitama Medical University (Japan).

As new institutions look to the Exchange to share genomic data, this hub holds significant potential to help advance progress in genomic-based medicine.

Learn more about the NextCODE Exchange here.

Genomics-Based Medicine Coming Into View

NextCODE Health

NextCODE Health has quickly gained recognition for its unique capabilities to address unmet needs in the genomics space through a massive genomics database that interprets DNA samples to identify relevant disease markers.

The practice and adoption of genomic medicine is accelerating as technologies improve, costs fall and new insights drive better patient care. While many companies are supporting this emerging field, a select few are providing the unique perspectives and capabilities to advance progress even faster.

NextCODE Health made headlines less than a year ago with the announcement of its launch and funding by major investors in healthcare and biotechnology. The company quickly gained recognition for its unique capabilities to address unmet needs in the genomics space through a massive genomics database that interprets DNA samples to identify relevant disease markers. (See the features in Xconomy, Bio-IT World and PLOS Blog.) The company was later mentioned in Nature Biotechnology News for its potential contributions to genome studies by leveraging key reference data from deCODE’s Icelandic work in Iceland.

Its rapid trajectory since launch and the utility of its genomic analysis technology was featured in BioCentury in May, featuring testimonials from clinicians using NextCODE capabilities to diagnose patients at Boston Children’s Hospital, the Baylor College of Medicine, and the Sanford School of Medicine. In June, it was featured in a major interview with Bio-IT World and the company continues to expand. Since then, NextCODE has announced several programs through which global pioneers in clinical genomics research are applying its interpretation and analysis technology to support research and diagnosis in rare diseases, including:

As more organizations employ genomics in major research initiatives, NextCODE’s interpretation technology will be an increasingly important asset in delivering meaningful insights from the wealth of genomic data being produced. Visit NextCode for the latest on how the future of genomics-based medicine continues to evolve.