The TDS17 Workshop is an important step towards developing a community around translational data science. Translational data science is a new term that is being used for an emerging field that applies data science principles, techniques, and technologies to challenging scientific problems that hold the promise of having an important impact on human or societal welfare. The term is also used when data science principles, techniques and technologies are applied to problems in different domains in general, including—but not restricted to—science and engineering research. The workshop will bring together a group focus on this field and collaborate to write a white paper on translational data science.
We recently achieved a milestone in building out data commons technology for the Blood Profiling Atlas in Cancer (BloodPAC) by adding the first set of users. BloodPAC is a consortium effort working to accelerate the development and validation of liquid biopsy assays to improve the outcomes of patients with cancer. We are contributing our data commons technology to build out a collaborative infrastructure that enables sharing of information between stakeholders in industry, academia, and regulatory agencies.
We will be at the AACR Annual Meeting again this year offering demonstrations of the Genomic Data Commons at the NCI Exhibit #1407. Stop by and see we've been working on!
Meet the Expert Session
Date: Monday, April 3rd
Time: 10:15AM – 10:45AM ET
Location: Exhibit Booth #1407
Title: Genomic Data Commons Live Demonstration
Presenter: Michael Fitzsimons, Genomic Data Commons
Christina Yung, PhD recently joined CDIS as the Director of Data Commons for the GDC. Christina has a PhD in Biomedical Engineering from Johns Hopkins University. She was most recently at the Ontario Institute for Cancer Research, leading the technical working groups for the ICGC PanCancer Analysis of Whole Genomes project. During her time there, she managed a team of software developers and bioinformaticians from 15 institutes to develop 5 analytical pipelines; runthe pipelines on 700TB of sequencing data in 14 compute environments, and coordinate data dissemination to over 300 researchers.