The Chan Zuckerberg Initiative was founded by Priscilla Chan and Mark Zuckerberg in 2015 to help solve some of society’s toughest challenges — from eradicating disease and improving education to addressing the needs of our local communities. Our mission is to build a more inclusive, just, and healthy future for everyone.
CZI supports the science and technology that will make it possible to help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem like an audacious goal, in the last 100 years, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.
Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems — paving the way for new discoveries that will change medicine in the decades that follow:
CZI’s work in science includes grantmaking programs, open-source software development, and close collaboration with the Chan Zuckerberg Biohub Network. The CZ Biohub Network includes the San Francisco, Chicago, and New York Biohubs as well as the Chan Zuckerberg Imaging Institute. CZI also collaborates with institutional partners like the Kempner Institute for the Study of Natural & Artificial Intelligence at Harvard University. Join us in accelerating science.
The Principal Data Scientist will lead a team that will define and create the datasets and covariates required to train a Virtual Cell model that understands how each cell type functions at a molecular level under healthy conditions, and is able to predict the impact on cells and cell populations of genetic or environmental perturbations. You and your team will partner closely with our Science Program team to partner or generate the required datasets, with Data Engineering and ML Engineering to decide data formats, schemas, and access patterns, and with AI Research to define the annotations, covariates, and quality measures required to use the data.
CZI manages and processes scientific datasets specifically designed to enable biological modeling. We handle over 100 million fully standardized unique cells worth of single cell transcriptomic data, over 15 thousand cryoET tomograms that are in imaging datasets as large as 20TB and counting. This year, we are expanding data operations by > 10x to support a higher volume of imaging, sequencing, literature, and mass spectrometry datasets. These data are available via public resources, CELLxGENE Discover and CryoET Portal. Our resources provide access to open source data that is structured and used by tens of thousands of scientists each month to quickly query and form hypotheses on understanding how genetic variants in cells impact disease risk, define drug toxicities, and eventually discover better therapies.
As the Principal Data Scientist, you will manage a team of data scientists who are embedded in cross-functional, AI-modeling focused teams and are responsible for leading dataset definition and delivery to support modeling work. Success is measured by the speed with which datasets are available, the ease with which they can be used, the suitability of the dataset to address the modeling task, and the clarity with which data quality is available to the research team. As a manager of a small team, this role will be a player/coach position. Your individual contributions will focus on defining a strategic, integrated dataset design sufficient to realize a Virtual Cell Model, and to ensure that the individual datasets your team creates for each modeling project each create measurable progress towards the integrated design.
The Chicago, Illinois base pay range for this role is 204,850 - 307,700. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.
Pay ranges outside Redwood City are adjusted based on cost of labor in each respective geographical market.
We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.
If you’re interested in a role but your previous experience doesn’t perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
Explore our work modes, benefits, and interview process at www.chanzuckerberg.com/careers.
#LI-Hybrid