Tagged: Genomics Toggle Comment Threads | Keyboard Shortcuts

  • richardmitnick 10:12 am on January 13, 2017 Permalink | Reply
    Tags: , Could affect future treatments for some types of infertility, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, Genomics, , Metabolic proteins relocate to jump-start an embryo’s genome, UCLA study finds   

    From UCLA: “Metabolic proteins relocate to jump-start an embryo’s genome, UCLA study finds” 

    UCLA bloc


    January 12, 2017
    Sarah C.P. Williams


    No image caption. No image credit.

    To turn on its genome — the full set of genes inherited from each parent — a mammalian embryo needs to relocate a group of proteins, researchers at the Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research at UCLA have discovered. The metabolic proteins, normally found in the energy-generating mitochondria of cells, move to the DNA-containing nuclei about two days after a mouse embryo is fertilized, according to the new study, led by senior author Utpal Banerjee.


    Early in development, a mammalian embryo — or zygote — has all the materials it needs to grow and divide from genes and proteins that were contained in the egg cell. But after a few cell divisions, the zygote needs to activate its own genome. Researchers have never fully understood how this shift is made. They knew that certain metabolic compounds, such a pyruvate, were required, but had also observed that the mitochondria — which normally process pyruvate into energy — were small and inactive during this stage of development.


    Banerjee, a professor of molecular, cell, and developmental biology and co-director of the UCLA Broad Stem Cell Research Center, and colleagues confirmed that pyruvate was required for zygotes to activate their genomes by growing mouse zygotes in a culture dish lacking pyruvate. Then, in both mouse and human embryos, researchers used a number of methods to determine the location of proteins that process pyruvate through a metabolic program called the TCA cycle. Just before the embryos activated their genomes, the two-cell stage in mice, the TCA cycle proteins moved from the mitochondria to the nuclei of cells, the researchers discovered. While mouse cells grown in dishes lacking pyruvate normally stopped growing at the two-cell stage, the researchers could rescue these cells by adding a metabolic compound that’s produced by the TCA cycle. Repeating some of the experiments in human embryos, they confirmed that the metabolic proteins move from the mitochondria to the nucleus just as the genome is activated — at the six- to eight-cell stage for humans.


    The importance of metabolic proteins to early embryonic development could affect future treatments for some types of infertility. In addition, the researchers hypothesize that some stem cells that have similar metabolic properties to early zygotes — including cancer stem cells — may relocate the TCA cycle proteins. Better understanding of the relocation could shed light on stem cell biology and alter cancer treatments.


    In addition to Banerjee, the first authors of the study are Raghavendra Nagaraj and Mark Sharpley; the co-authors are Daniel Braas, Fangtao Chi, Amander Clark, Rachel Kim and Yonggang Zhou, all of UCLA.


    The study was published in the journal Cell.


    The study was funded by an NIH Director’s Pioneer Award (DP1DK098059-04) and by the UCLA Broad Stem Cell Research Center.

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    UC LA Campus

    For nearly 100 years, UCLA has been a pioneer, persevering through impossibility, turning the futile into the attainable.

    We doubt the critics, reject the status quo and see opportunity in dissatisfaction. Our campus, faculty and students are driven by optimism. It is not naïve; it is essential. And it has fueled every accomplishment, allowing us to redefine what’s possible, time after time.

    This can-do perspective has brought us 12 Nobel Prizes, 12 Rhodes Scholarships, more NCAA titles than any university and more Olympic medals than most nations. Our faculty and alumni helped create the Internet and pioneered reverse osmosis. And more than 100 companies have been created based on technology developed at UCLA.

  • richardmitnick 1:59 pm on December 17, 2016 Permalink | Reply
    Tags: , Boy in the bubble, , Genomics, ,   

    From UC Berkeley: “From a single genetic mutation, secrets of ‘boy in the bubble’ disease revealed” 

    UC Berkeley

    UC Berkeley

    December 15, 2016
    Brett Israel

    UC Berkeley was part of an interdisciplinary, international research team that has identified the rare genetic mutation responsible for a unique case of “boy in the bubble” disease, known as severe combined immunodeficiency (SCID), a deadly immune system disorder. The researchers found that the cause was a mutated version of a gene called BCL11B, which also plays an unexpected role in the normal processes of immune system development.

    World of his own: David Vetter (Photo: Courtesy Baylor College of Medicine Archives) http://i2.mirror.co.uk/incoming/article3196066.ece/ALTERNATES/s810/The-Boy-in-the-Bubble.jpg. Just a single case chosen at random from many.

    The discovery of this genetic mutation is the latest of several breakthroughs from this team, which has been accomplished by analyzing exomes — the roughly 2 percent of DNA that contains the instructions for building proteins — to identify the cause of mysterious immunological diseases in newborns.

    “This is a gene that had never been associated with SCID before, which required more advanced genome analysis techniques to discover,” said Berkeley computational biologist Steven Brenner, co-author of the study. “Moreover, unlike variants in every other known SCID gene, this mutation is dominant, which means you only need one copy of this mutation to disrupt multiple aspects of development.”

    The study was published Dec. 1 in the New England Journal of Medicine. The research article was accompanied by a perspective by Michael Lenardo, chief of the Molecular Development of the Immune System Section at the National Institute of Allergy and Infectious Diseases, commissioned by the journal. Lenardo wrote that the study is “an exciting example of recent achievements in the application of contemporary molecular genomics to clinical medicine, especially with regard to congenital diseases…This study reflects remarkable advances in molecular diagnosis.”

    The infant patient featured in the new study was identified through a population-based neonatal screening approach for SCID, which was developed in 2005 by Jennifer Puck, the study’s senior author and a UCSF professor of immunology and pediatrics. The screening indicated a severely compromised immune system, leaving the patient open to a likely fatal series of infections. However, UCSF doctors performed a bone marrow transplant, the standard of care for SCID, which provided the infant with a fully functional immune system.

    In addition to SCID, however, the infant was born with a constellation of abnormal features including craniofacial deformities, loose skin, excess body hair and neurological abnormalities, which suggested that a single rare genetic defect could underlie the patient’s disease.

    In part to determine whether the infant’s parents were carriers of a genetic mutation that could be passed on to future children, the research team set out to scan the genomes of both infant and parents for mutations that could be responsible for the disease. Researchers at UC Berkeley and UCSF built on their productive collaboration with researchers at Tata Consultancy Services to use next-generation exome sequencing to identify a single mutation present in the infant but not the parents — referred to as a de novo mutation — in the BCL11B gene, which had previously been associated primarily with lymphatic cancer. So finding the BLC11B mutation to be causative for SCID was a surprise.

    “We’re entering a new era of genomic medicine,” Puck said. “Our technology has progressed to the point that we can learn a great deal about a disease, and even learn important new facts about normal biology, from just a single patient. In this case we were able to unearth the potentially unique underlying genetic cause of one patient’s disease and come away with brand new understanding of how the immune system develops.”

    In order to understand the biological effects of the patient’s mutation, the researchers collaborated with the team of David Wiest at Fox Chase Cancer Center, in Philadelphia, to introduce the patient’s mutated form of BCL11B into zebrafish, whose immune systems are similar to those of humans. They found that the mutated form of BCL11B produced abnormalities in the zebrafish that mimicked those observed in the patient, including not only a disabled immune system but also similar craniofacial abnormalities. Blocking the mutated gene and replacing it with the normal human gene in embryonic zebrafish reversed all these symptoms, strongly suggesting that abnormal BCL11B was the cause of the symptoms seen in both zebrafish and the human patient.

    The normal BCL11B protein binds to DNA at sites across the genome to activate a wide variety of developmental genes in a precisely orchestrated sequence. Experiments revealed that the BCL11B gene mutation identified in the new study disrupts this protein’s ability to bind to DNA, thereby resulting in the wide array of immunological, neurological and craniofacial disruptions seen in both the human patient and in zebrafish.

    “In this case, however, a mutation in BCL11B turned the protein it produces into a monkey wrench that disrupted many different systems in the body,” Puck said.

    According to Puck, the findings illustrate the power of deeply studying rare diseases in individual patients: “We may never get another patient just like this one,” she said. “But as a result of studying this one case we were able to learn so much about a critical gene in a critical pathway that hadn’t been appreciated before.”

    The research was supported by the National Institutes of Health, Tata Consultancy Services, the Commonwealth of Pennsylvania, the M.D. Anderson Cancer Center, the Fox Chase Cancer Center, the Jeffrey Modell Foundation, the Lisa and Douglas Goldman Fund and the Michelle Platt-Ross Foundation.

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    Founded in the wake of the gold rush by leaders of the newly established 31st state, the University of California’s flagship campus at Berkeley has become one of the preeminent universities in the world. Its early guiding lights, charged with providing education (both “practical” and “classical”) for the state’s people, gradually established a distinguished faculty (with 22 Nobel laureates to date), a stellar research library, and more than 350 academic programs.

    UC Berkeley Seal

  • richardmitnick 6:45 pm on June 2, 2016 Permalink | Reply
    Tags: A massive approach to finding what's "real" in genome-wide association data, , , Genomics   

    From Broad Institute: “A massive approach to finding what’s “real” in genome-wide association data” 

    Broad Institute

    Broad Institute

    June 2nd, 2016
    Tom Ulrich

    What could we learn if we probed the subtle effects of thousands of DNA variations on gene expression, all at once? Two recent Cell papers hint at how an assay called MPRA could help us get there.

    MPRA could help reveal DNA variants’ subtle influences on diseases and traits. Image: Sigrid Knemeyer.

    Genome-wide association studies (GWAS) have been a boon for geneticists by revealing thousands of genetic variants associated with human disease. At the same time, GWAS are the bane of geneticists because they reveal thousands of genetic variants associated with human disease. Which variants are the drivers, the ones that truly cause or contribute to disease development and progression?

    “With GWAS, you get a set of signals, which can tell you which regions of the genome are associated with a particular disease or trait,” said Vijay Sankaran, a Broad associate member and a pediatric hematologist/oncologist at Dana-Farber/Boston Children’s Cancer and Blood Disorders Center who studies blood cell disorders. “But it’s hard to know which hits are causal hits, and which are just going along for the ride.”

    The picture gets particularly complicated when talking about variants in non-coding DNA, including the vast stretches of DNA containing sequences that control gene expression. By some estimates, between 85 and 90 percent of the variants picked up by GWAS lie in such regions.

    Many scientists are trying to figure out how to connect the dots between non-coding GWAS variants and human biology, health, and ultimately, disease. Three Broad teams, led by Sankaran, Pardis Sabeti, and Broad alum Tarjei Mikkelsen (now with the biotechnology company 10X Genomics), respectively, have focused their efforts on scaling up a staple of the genomics toolkit — the reporter assay — to create a massively parallel reporter assay (MPRA).

    “We want to move from understanding the component pieces of the genome to understanding what changes in those components do,” said Sabeti, an institute member and Harvard computational geneticist and evolutionary biologist, whose lab probes the role genetic variation writ large plays in human and microbial evolution. “We need very sensitive technology to be able to identify these functional changes, particularly if they’re subtle.”

    Access mp4 video here .

    Going massive

    The reporter assay helps scientists sift through GWAS data to find variants that truly affect gene expression or function. A researcher takes a DNA fragment from what may be an enhancer, couples it within a plasmid to a “reporter” gene that provides a readout (e.g., the luciferase gene), and inserts the plasmid into cells. If the readout materializes (e.g., if the cells glow), the enhancer sequence drove expression of the reporter. By running the assay with different variations of the same fragment, a pattern can emerge suggesting whether certain variants affect expression.

    Such classic reporter assays, however, have one major disadvantage: They don’t scale to the level needed to investigate the thousands to tens of thousands of variants that might turn up in a GWAS.

    Mikkelsen and Broad research scientist Alexandre Melnikov worked out the principles of one flavor of MPRA while working in the lab of Broad founding director and president Eric Lander. In a 2012 Nature Biotechnology paper*, they noted that tagging each plasmid with a short, unique DNA barcode provided a second readout. By sequencing and counting the mRNAs produced from each plasmid, they could easily identify the variant(s) with the greatest influence on gene expression and quantify the magnitude of that influence.

    And because each barcode was unique to each plasmid, Mikkelsen and Melnikov’s team could pool and assay thousands of variants simultaneously.

    Homing in on blood cell traits

    Sankaran’s lab is the latest to make use of Mikkelsen and Melnikov’s MPRA system, harnessing it to scrutinize more than 2,750 non-coding variants in 75 GWAS hits linked to red blood cell traits. And as he, Mikkelsen, and co-first authors Jacob Ulirsch and Satish Nandakumar reported** in Cell, MPRA data pointed to 32 hits that actually had some impact on gene expression. They then used additional computational and functional assays to further probe the effects of a subset of these variants on red blood cell traits, as a result revealing that several known genes may have heretofore-unrecognized roles in blood cell development.

    “One of the unexpected lessons we learned was that many of the variants tweaked a master blood development regulator, GATA1,” said Ulirsch, a staff scientist in Sankaran’s lab. “There was a common pattern. Going one by one, variant by variant, we would never have been able to see this.”

    Clockwise from top left: Pardis Sabeti, Vijay Sankaran, Satish
    Nandakumar, Ryan Tewhey, Jacob Ulirsch. Photo: Megan Purdum

    Building MPRA 2.0

    While Mikkelsen and Melnikov’s original method is quite powerful, Sabeti’s lab wanted to see if they could make it even more robust.

    “The original version of MPRA is limited in how many variants you can test,” said Ryan Tewhey, a postdoctoral fellow in Sabeti’s lab. “We wanted to know, can you expand this technology out? Can you test tens of thousands of variants at once? And can you make it more sensitive?”

    Tewhey, Sabeti, and their team doubled the length of each DNA barcode and upped the number of barcodes to as many as 350 per variant. They then used their enhanced assay to study more than 32,000 possible B cell regulatory variants identified by the 1000 Genomes Project, deeply characterizing one associated with risk of ankylosing spondylitis (an autoimmune disease). They also highlighted another 842 candidate variants, including 53 particularly promising ones associated with human traits and diseases.

    As they discussed in their own Cell paper***, the added barcodes reduced the noise in their data and increased the assay’s overall sensitivity.

    “With more barcodes you can start to detect more subtle changes in expression, including changes that might arise from differences between alleles,” Tewhey added.
    Another view into regulation

    MPRA isn’t the only approach for pulling causal needles out of GWAS haystacks, and Tewhey is realistic that it won’t be a panacea for studying all of the cell’s mechanisms for regulating expression.

    “For promoters and enhancers, we know it works well,” he said. “For things related to long distance connectivity or the genome’s shape, we’re not as confident. ”

    Sankaran points out that MPRA really shines in its ability to find themes in genetic variation that researchers can marry to other genetic, structural, or functional data.

    “When you start to get all these independent pieces together, you get a real fine view of what’s important,” he said.

    Papers cited:

    Melnikov A, Murugan A, et al. Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay. Nature Biotechnology. February 26, 2012. DOI: 10.1038/nbt.2137

    Ulirsch JC, Nandakumar SK, et al. Systematic functional dissection of common genetic variation affecting red blood cell traits. Cell. June 2, 2016. DOI: 10:1016/j.Cell.2016.04.048

    Tewhey R, Kotliar D, et al. Direct identification of hundreds of expression-modulating variants using a multiplexed reporter assay. Cell. June 2, 2016. DOI: 10:1016/j.cell.2016.04.027

    *Science paper:
    Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay

    **Science paper:
    Systematic Functional Dissection of Common Genetic Variation Affecting Red Blood Cell Traits
    ***Science paper:
    Direct Identification of Hundreds of Expression-Modulating Variants using a Multiplexed Reporter Assay

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    Broad Institute Campus

    The Eli and Edythe L. Broad Institute of Harvard and MIT is founded on two core beliefs:

    This generation has a historic opportunity and responsibility to transform medicine by using systematic approaches in the biological sciences to dramatically accelerate the understanding and treatment of disease.
    To fulfill this mission, we need new kinds of research institutions, with a deeply collaborative spirit across disciplines and organizations, and having the capacity to tackle ambitious challenges.

    The Broad Institute is essentially an “experiment” in a new way of doing science, empowering this generation of researchers to:

    Act nimbly. Encouraging creativity often means moving quickly, and taking risks on new approaches and structures that often defy conventional wisdom.
    Work boldly. Meeting the biomedical challenges of this generation requires the capacity to mount projects at any scale — from a single individual to teams of hundreds of scientists.
    Share openly. Seizing scientific opportunities requires creating methods, tools and massive data sets — and making them available to the entire scientific community to rapidly accelerate biomedical advancement.
    Reach globally. Biomedicine should address the medical challenges of the entire world, not just advanced economies, and include scientists in developing countries as equal partners whose knowledge and experience are critical to driving progress.

    Harvard University

    MIT Widget

  • richardmitnick 11:41 am on December 22, 2015 Permalink | Reply
    Tags: , Genomics,   

    From Harvard: “Researchers help cells forget who they are” 

    Harvard University

    Harvard University

    December 21, 2015
    Hannah Robbins, Harvard Stem Cell Institute Communications

    Erasing a cell’s memory makes it easier to manipulate them into becoming another type of cell

    Induced pluripotent stem cell colonies generated after researchers at Harvard Stem Cell Institute suppressed the CAF1 gene. Photo by Sihem Cheloufi

    They say we can’t escape our past — no matter how much we change, we still have the memory of what came before. The same can be said of our cells.

    Mature cells, such as skin or blood cells, have a cellular “memory,” or record of how the cell changed as it developed from an uncommitted embryonic cell into a specialized adult cell. Now, Harvard Stem Cell Institute researchers at Massachusetts General Hospital (MGH), in collaboration with scientists from the Institutes of Molecular Biotechnology (IMBA) and Molecular Pathology (IMP) in Vienna, have identified genes that, when suppressed effectively, erase a cell’s memory, making it more susceptible to reprogramming and, consequently, making the process of reprogramming quicker and more efficient.

    The study was recently published in Nature.

    “We began this work because we wanted to know why a skin cell is a skin cell, and why does it not change its identity the next day, or the next month, or a year later?” said co-senior author Konrad Hochedlinger, an HSCI principal faculty member at MGH and Harvard’s Department of Stem Cell and Regenerative Biology, and a world expert in cellular reprogramming.

    Every cell in the human body has the same genome, or DNA blueprint, explained Hochedlinger, and it is how those genes are turned on and off during development that determines what kind of adult cell each becomes. By manipulating those genes and introducing new factors, scientists can unlock dormant parts of an adult cell’s genome and reprogram it into another cell type.

    However, “a skin cell knows it is a skin cell,” said IMBA’s Josef Penninger, even after scientists reprogram those skin cells into induced pluripotent stem cells (iPS cells) — a process that would ideally require a cell to “forget” its identity before assuming a new one.

    Cellular memory is often conserved, acting as a roadblock to reprogramming. “We wanted to find out which factors stabilize this memory and what mechanism prevents iPS cells from forming,” Penninger said.

    To identify potential factors, the team established a genetic library targeting known chromatin regulators — genes that control the packaging and bookmarking of DNA, and are involved in creating cellular memory.

    Hochedlinger and Sihem Cheloufi, co-first author and a postdoc in Hochedlinger’s lab, designed a screening approach that tested each of these factors.

    Of the 615 factors screened, the researchers identified four chromatin regulators, three of which had not yet been described, as potential roadblocks to reprogramming. In comparison to the three- to fourfold increase seen by suppressing previously known roadblock factors, inhibiting the newly described chromatin assembly factor 1 (CAF1) made the process 50- to 200-fold more efficient. Moreover, in the absence of CAF1, reprogramming turned out to be much faster: While the process normally takes nine days, the researchers could detect the first iPS cell after four days.

    “The CAF1 complex ensures that during DNA replication and cell division, daughter cells keep their memory, which is encoded on the histones that the DNA is wrapped around,” said Ulrich Elling, a co-first author from IMBA. “When we block CAF1, daughter cells fail to wrap their DNA the same way, lose this information, and covert into blank sheets of paper. In this state, they respond more sensitively to signals from the outside, meaning we can manipulate them much more easily.”

    By suppressing CAF1 the researchers were also able to facilitate the conversion of one type of adult cell directly into another, skipping the intermediary step of forming iPS cells, via a process called direct reprogramming, or transdifferentiation. Thus, CAF1 appears to act as a general guardian of cell identity whose depletion facilitates both the interconversion of one adult cell type to another as well as the conversion of specialized cells into iPS cells.

    In finding CAF1, the researchers identified a complex that allows cell memory to be erased and rewritten. “The cells forget who they are, making it easier to trick them into becoming another type of cell,” said Cheloufi.

    CAF1 may provide a general key to facilitate the “reprogramming” of cells to model disease and test therapeutic agents, IMP’s Johannes Zuber explained. “The best-case scenario,” he said, “is that with this insight, we hold a universal key in our hands that will allow us to model cells at will.”

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    Harvard University campus

    Harvard is the oldest institution of higher education in the United States, established in 1636 by vote of the Great and General Court of the Massachusetts Bay Colony. It was named after the College’s first benefactor, the young minister John Harvard of Charlestown, who upon his death in 1638 left his library and half his estate to the institution. A statue of John Harvard stands today in front of University Hall in Harvard Yard, and is perhaps the University’s best known landmark.

    Harvard University has 12 degree-granting Schools in addition to the Radcliffe Institute for Advanced Study. The University has grown from nine students with a single master to an enrollment of more than 20,000 degree candidates including undergraduate, graduate, and professional students. There are more than 360,000 living alumni in the U.S. and over 190 other countries.

  • richardmitnick 5:49 pm on December 7, 2015 Permalink | Reply
    Tags: , , Genomics   

    From Caltech: “Unlocking the Chemistry of Life” 

    Caltech Logo

    Jessica Stoller-Conrad

    No image credit

    In just the span of an average lifetime, science has made leaps and bounds in our understanding of the human genome and its role in heredity and health—from the first insights about DNA structure in the 1950s to the rapid, inexpensive sequencing technologies of today. However, the 20,000 genes of the human genome are more than DNA; they also encode proteins to carry out the countless functions that are key to our existence. And we know much less about how this collection of proteins supports the essential functions of life.

    In order to understand the role each of these proteins plays in human health—and what goes wrong when disease occurs—biologists need to figure out what these proteins are and how they function. Several decades ago, biologists realized that to answer these questions on the scale of the thousands of proteins in the human body, they would have to leave the comfort of their own discipline to get some help from a standard analytical-chemistry technique: mass spectrometry. Since 2006, Caltech’s Proteome Exploration Laboratory (PEL) has been building on this approach to bridge the gap between biology and chemistry, in the process unlocking important insights about how the human body works.

    Scientists can easily sequence an entire genome in just a day or two, but sequencing a proteome—all of the proteins encoded by a genome—is a much greater challenge says Ray Deshaies, protein biologist and founder of the PEL. “One challenge is the amount of protein. If you want to sequence a person’s DNA from a few of their cheek cells, you first amplify—or make copies of—the DNA so that you’ll have a lot of it to analyze. However, there is no such thing as protein amplification,” Deshaies says. “The number of protein molecules in the cells that you have is the number that you have, so you must use a very sensitive technique to identify those very few molecules.” The best means available for doing this today is called shotgun mass spectrometry, Deshaies says. In general, mass spectrometry allows researchers to identify the amount and types of molecules that are present in a biological sample by separating and analyzing the molecules as gas ions, based on mass and charge; shotgun mass spectrometry—a combination of several techniques—applies this separation process specifically to digested, broken-down proteins, allowing researchers to identify the types and amounts of proteins that are present in a heterogeneous mixture.

    The first step of shotgun mass spectroscopy entails digesting a mixture of proteins into smaller fragments called peptides. The peptides are then separated based on their physical properties, and then they are sprayed into a mass spectrometer and blasted apart via collisions with gas molecules such as helium or nitrogen—a process that creates a unique fragmentation pattern for each peptide. This pattern, or “fingerprint,” of each peptide’s fragmentation can then be searched on a database and used to identify the protein this peptide came from.

    download mp4 video here.

    “Up until this technique was invented, people had to take a mixture of proteins, run a current through a polyacrylamide gel to separate the proteins by size, stain the proteins, and then physically cut the stained bands out of the gel to have each individual protein species sequenced,” says Deshaies. “But mass spectrometry technology has gotten so good that we can now cast a broader net by sequencing everything, then use data analysis to figure out what specific information is of interest after the dust settles down.”

    Deshaies began using this shotgun mass spectrometry in the late 1990s, but because the technology was still very new, all of the protein analysis had to be done at the outside laboratories that were inventing the methodology.

    In 2001, after realizing the potential of this field-changing technology, he and colleague Barbara Wold, the Bren Professor of Molecular Biology, applied for and received a Department of Energy grant for their very own mass spectrometer. When the instrument arrived on campus, demand began to surge. “Barbara and I were first just doing experiments for our own labs, but then other people on campus wanted us to help them apply this technology to their research problems,” Deshaies says.

    So he and Wold began campaigning for a larger, ongoing center where anyone could begin using mass spectrometry resources for protein research. In 2006, Deshaies and then chair of the Division of Biology (now the Division of Biology and Biological Engineering) Elliot Meyerowitz petitioned the Gordon and Betty Moore Foundation to secure funding for a formal Proteome Exploration Laboratory, as part of the foundation’s commitment to Caltech.

    The influx of cash dramatically expanded the capabilities and resources that were available to the PEL, allowing it to purchase the best and fastest mass spectrometry instruments available. But just as importantly, it also meant that the PEL could expand its human resources, Deshaies adds. Mostly students were running the instruments in the Deshaies lab, he says, so when they graduated or moved on, gaps were left in expertise. Sonja Hess came to Caltech in 2007 to fill that gap as director of the PEL.

    Hess, who came from a proteomics lab at the National Institutes of Health, knew the challenges of running an interdisciplinary center such as the PEL. Although the field of proteomics holds great promise for understanding big questions in many fields, including biology and medicine, mass spectrometry is still a highly technical method involving analytical chemistry and data science—and it’s a technique that many biologists were never trained in. Conversely, many chemists and mass spectrometry technicians don’t necessarily understand how to apply the technique to biological processes.

    By encouraging dialogue between these two sides, Hess says that the PEL crosses that barrier, helping apply mass spectrometry techniques to diverse research questions from more than 20 laboratories on campus. Creating this interdisciplinary and resource-rich environment has enabled a wide breadth of discoveries, says Hess. One major user of the PEL, chemist David Tirrell, has used the center for many collaborations involving a technique he developed with former colleagues Erin Schuman and Daniela Dieterich called BONCAT (for “bioorthogonal noncanonical amino-acid tagging”). BONCAT uses synthetic molecules that are not normally found in proteins in nature and that carry particular chemical tags. When these artificial amino acids are incubated with certain cells, they are taken up by the cells and incorporated into all newly formed proteins in those cells.

    The tags then allow researchers to identify and pull out proteins from the cells, thus enabling them to wash away all of the other untagged proteins from other cells that aren’t of interest. When this method is combined with mass spectrometry techniques, it enables researchers to achieve specificity in their results and determine which proteins are produced in a particular subset of cells during a particular time. “In my own laboratory, we work at making sure the method is adapted appropriately to the specifics of a biological problem. But we rely on collaborations with other laboratories to help us understand what the demands on the method are and what kinds of questions would be interesting to people in those fields,” Tirrell says.

    For example, Tirrell collaborated with biologist Paul Sternberg and the PEL, using BONCAT and mass spectrometry to analyze specific proteins from a few cells within a whole organism, a feat that had never been accomplished before. Using the nematode C. elegans, Sternberg and his team applied the BONCAT technique to tag proteins in the 20 cells of the worm’s pharynx, and then used the PEL resources to analyze proteome-wide information from just those 20 cells. The results, including identification of proteins that were not previously associated with the pharynx, were published in PNAS in 2014.

    The team is now trying to target the experiment to a single pair of neurons that help the worm to sense and avoid harmful chemicals—a first step in learning which proteins are essential to producing this responsive behavior. But analyzing protein information from just two cells is a difficult experiment, says Tirrell. “The challenge comes in separating out the proteins that are made in those two cells from the proteins in the rest of the hundreds of cells in the worm’s body. You’re only interested in two cells, but to get the proteins from those two cells, you’re essentially trying to wash away everything else— about 500 times as much ‘junk’ protein as the protein that you’re really interested in,” he says. “We’re working on these separation methods now because the ultimate experiment would be to find a way to use BONCAT and mass spec to pull out proteomic information from a single cell in an animal.”

    This next step is a big one, but Tirrell says that an advantage of the PEL is that the laboratory’s staff can focus on optimizing the very technical mass spectrometry aspects of an experiment, while researchers using the PEL can focus more holistically on the question they’re trying to answer. This was also true for biologist Mitch Guttman, who asked the laboratory to help him develop a mass spectrometry–based technique for identifying the proteins that hitchhike on a class of RNA genes called lncRNAs. Long noncoding RNAs—or lncRNAs (pronounced “link RNAs”) for short—are abundant in the human genome, but scientists know very little about how they work or what they do.

    Although it’s known that protein-coding genes start out as DNA, which is transcribed into RNA, which is then translated into the gene product, a protein, lncRNAs are never translated into proteins. Instead, they’re thought to act as scaffolds, corralling important proteins and bringing them to where they’re needed in the cell. In a study published in April 2015 in Nature, Guttman used a specific example of a lncRNA, a gene called Xist, to learn more about these hitchhiking proteins.

    “The big challenge to doing this was technical; we’ve never had a way to identify what proteins are actually interacting with a lncRNA molecule. By working with the PEL, we were able to develop a method based on mass spectrometry to actually purify and identify this complex of proteins interacting with a lncRNA in living cells,”Guttman says. “Once we had that information, we could really start to ask ourselves questions about these proteins and how are they working.”

    Using this new method, called RNA antisense purification with mass spectrometry (RAP-MS), Guttman’s lab determined that 10 proteins associate with the lncRNA Xist, and that three of those 10 are essential to the gene’s function—inactivating the second X chromosome in women, a necessary process that, if interrupted, results in the death of female embryos early in development. Guttman’s findings marked the first time that anyone had uncovered the detailed mechanism of action for an lncRNA gene. For decades, other research groups had been trying to solve this problem; however, the collaborative development of RAP-MS in the PEL provided the missing piece.

    Even Deshaies, who began doing shotgun mass spectrometry experiments in his own laboratory, now exclusively uses the PEL’s resources and says that the laboratory has played an essential support role in his work. He studies the normal balance of proteins in a cell and how this balance changes during disease. In a 2013 study published in Cell, his laboratory focused on a dynamic network of protein complexes called SCF complexes, which go through cycles of assembly and dissociation in a cell, depending on when they are needed.

    Because there was no insight into how these complexes form and disassemble, Deshaies and his colleagues used the PEL to quantitatively monitor how this protein network’s dynamics were changing within cells. They determined that SCF complexes are normally very stable, but in the presence of a protein called Cand1 they become very dynamic and rapidly exchange subunits. Because some components of the SCF complex have been implicated in the development of human diseases such as cancers, work is now being done to see if Cand1 holds promise as a target for a cancer therapeutic.

    Although Deshaies says that the PEL resources have become invaluable to his work, he adds that what makes the laboratory unique is how it benefits the entire institute—a factor that he hopes will encourage further support for its mission. “The value of the PEL is not just about what it contributes to my lab or to Dave Tirrell’s lab or to anyone else’s,” he says. “It’s about the breadth of PEL’s impact—the 20 or so labs that are bringing in samples and using this operation every year to do important work, like solving the mechanism of X-chromosome inactivation in females.”

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    The California Institute of Technology (commonly referred to as Caltech) is a private research university located in Pasadena, California, United States. Caltech has six academic divisions with strong emphases on science and engineering. Its 124-acre (50 ha) primary campus is located approximately 11 mi (18 km) northeast of downtown Los Angeles. “The mission of the California Institute of Technology is to expand human knowledge and benefit society through research integrated with education. We investigate the most challenging, fundamental problems in science and technology in a singularly collegial, interdisciplinary atmosphere, while educating outstanding students to become creative members of society.”
    Caltech buildings

  • richardmitnick 11:12 am on October 19, 2015 Permalink | Reply
    Tags: 4D, , Genomics,   

    From U Washington: “Researchers win $12-million to study the human genome in 4-D” 

    U Washington

    University of Washington

    Michael McCarthy

    A computer-generated three-dimensional model of the yeast genome, which UW researchers described in a paper in the journal Nature in 2010.

    In order to fit within the nucleus of a cell, the human genome must bend, fold and coil into an unimaginably compact shape – and still function. This is no mean feat: The human genome is about 6.5 feet long, and the average cell nucleus is only 6 to10 micrometers (one-millionth of a meter) in diameter.

    How this happens and the genome’s three-dimensional shape within the nucleus are unknown. Nor is it known how the shape changes over time – the fourth dimension – as a cell develops, grows and goes about its specialized functions.

    “There’s a tendency to talk about the genome as a linear sequence and to forget about the fact that it’s folded,” said Dr. Jay Shendure, University of Washington associate professor of genome sciences and investigator with the Howard Hughes Medical Institute.

    William Noble, left, and Jay Shendure will co-direct the UW Center for Nuclear Organization and Function.

    “To understand how the different parts of the genome talk to each other to control gene expression, we need to understand how the different elements are arranged in relation to each other in three-dimensional space.”

    To puzzle out this information and its effect on cell function in health and disease, UW researchers will join peers at five other academic institutions to create the Nuclear Organization and Function Interdisciplinary Consortium.

    Underwriting the consortium is the National Institutes of Health’s 4D Nucleome program. The UW was awarded $12 million over five years to conduct research in its new Center for Nuclear Organization and Function. Shendure and William Stafford Noble, a professor of genome sciences and computer science, will co-lead.

    UW researchers will first develop tools to work out the three- and four-dimensional architecture of the nucleome and to create computer models that predict changes in the architecture as cells grow, divide and differentiate into different types.

    The results of this work will then be tested in mouse and human cell lines and, if confirmed, be used to understand how changes in nuclear architecture affect development of normal and abnormal heart muscle.

    All tools and data developed by the project will be shared with researchers in and outside of the 4D Nucleome network of researchers and with the public.

    Other investigators who will be working on the project include: Cole Trapnell, assistant professor of genome sciences; Christine Disteche, professor of pathology; Zhijun Duan, research assistant professor of medicine (hematology); and Dr. Charles Murry, professor of pathology and interim director of the UW Institute of Institute for Stem Cell and Regenerative Medicine.

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    The University of Washington is one of the world’s preeminent public universities. Our impact on individuals, on our region, and on the world is profound — whether we are launching young people into a boundless future or confronting the grand challenges of our time through undaunted research and scholarship. Ranked number 10 in the world in Shanghai Jiao Tong University rankings and educating more than 54,000 students annually, our students and faculty work together to turn ideas into impact and in the process transform lives and our world. For more about our impact on the world, every day.

    So what defines us — the students, faculty and community members at the University of Washington? Above all, it’s our belief in possibility and our unshakable optimism. It’s a connection to others, both near and far. It’s a hunger that pushes us to tackle challenges and pursue progress. It’s the conviction that together we can create a world of good. Join us on the journey.

  • richardmitnick 12:28 pm on October 16, 2015 Permalink | Reply
    Tags: , , Genomics, MIT Whitehead Institute   

    From Broad Institute: “Screen of human genome reveals set of genes essential for cellular viability” 

    Broad Institute

    Broad Institute

    October 15th, 2015

    Whitehead Institute MIT

    Whitehead Institute Communications

    Using two complementary analytical approaches, scientists at Whitehead Institute of MIT and Broad Institute of MIT and Harvard have for the first time identified the universe of genes in the human genome essential for the survival and proliferation of human cell lines or cultured human cells.

    Their findings and the materials they developed in conducting the research will not only serve as invaluable resources for the global research community but should also have application in the discovery of drug-targetable genetic vulnerabilities in a variety of human cancers.

    Scientists have long known the essential genes in microorganisms, such as yeast, whose genomes are smaller and more easily manipulated. Most common yeast strains, for example, are haploid, meaning that genes exist in single copies, making it fairly simple for researchers to eliminate or “knock out” individual genes and assess the impact on the organism. However, owing to their greater complexity, diploid mammalian genomes, including the human genome, have been resistant to such knockout techniques—including RNA interference, which is hampered by off-target effects and incomplete gene silencing.

    Diploid cells have two homologous copies of each chromosome.

    Now, however, through use of the breakthrough CRISPR (for clustered regularly interspersed short palindromic repeats) genome editing system , researchers in the labs of Whitehead Member David Sabatini and Broad Institute Director Eric Lander have been able to generate a genome-wide library of single-guide RNAs (sgRNAs) to screen for and identify the genes required for cellular viability.

    Diagram of the CRISPR prokaryotic viral defense mechanism

    The sgRNA library targeted slightly more than 18,000 genes, of which approximately 10% proved to be essential. These findings are reported online this week in the journal Science.

    “This is the first report of human cell-essential genes,” says Tim Wang, a graduate student in the Sabatini and Lander labs and first author of the Science paper. “This answers a question people have been asking for quite a long time.”

    As might have been expected, Wang says that many of the essential genes are involved in fundamental biological processes, including DNA replication, RNA transcription, and translation of messenger RNA. But, as Wang also notes, approximately 300 of these essential genes are of a class not previously characterized, are largely located in the cellular compartment known as the nucleolus, and are associated with RNA processing. Wang says the precise function of these genes is the subject of future investigation.

    Nucleus. The nucleolus is contained within the cell nucleus.

    To validate the results of the CRISPR screens, the group took the added step of screening for essential genes in a unique line of haploid human cells. Using an approach known as gene-trap mutagenesis (a method pioneered in part by former Whitehead Fellow Thijn Brummelkamp) in the haploid cells and comparing it to the CRISPR results, the researchers found significant, consistent overlap in the gene sets found to be essential. In a final step, the group tested their approaches in cell lines derived from two cancers, chronic myelogenous leukemia (CML) and Burkitt’s lymphoma, both of which have been extensively studied. The novel method not only identified the essentiality of the known genes—in the case of CML, it hit on the BCR and ABL1 genes, whose translocation is the target of the successful drug Gleevec—but also highlighted additional genes that may be therapeutic targets in these cancers.

    “The ability to zero in on the essential genes in the highly complex human system will give us new insight into how diseases, such as cancer, continue to resist efforts to defeat them,” Lander says.

    Wang, Lander, and Sabatini are enthusiastic about the potential applications of their work, as it should accelerate the identification of cancer drug targets while enhancing our understanding of the evolution of drug resistance, a major contributor to therapeutic failure. The researchers attribute this vast potential to the rigor that CRISPR brings to human genetics.

    “This is really the first time we can reliably, accurately, and systematically study genetics in mammalian cells,” Sabatini says. “It’s remarkable how well it’s working.”

    This work was supported by the National Institutes of Health (grant CA103866), the National Human Genome Research Institute (grant 2U54HG003067-10), the National Science Foundation, the MIT Whitaker Health Sciences Fund, and the Howard Hughes Medical Institute.

    About Whitehead Institute

    The Whitehead Institute is a world-renowned non-profit research institution dedicated to improving human health through basic biomedical research. Wholly independent in its governance, finances, and research programs, Whitehead shares a close affiliation with Massachusetts Institute of Technology through its faculty, who hold joint MIT appointments. For more information about the Whitehead Institute, go to wi.mit.edu.

    See the full article here .

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition

    Broad Institute Campus

    The Eli and Edythe L. Broad Institute of Harvard and MIT is founded on two core beliefs:

    This generation has a historic opportunity and responsibility to transform medicine by using systematic approaches in the biological sciences to dramatically accelerate the understanding and treatment of disease.
    To fulfill this mission, we need new kinds of research institutions, with a deeply collaborative spirit across disciplines and organizations, and having the capacity to tackle ambitious challenges.

    The Broad Institute is essentially an “experiment” in a new way of doing science, empowering this generation of researchers to:

    Act nimbly. Encouraging creativity often means moving quickly, and taking risks on new approaches and structures that often defy conventional wisdom.
    Work boldly. Meeting the biomedical challenges of this generation requires the capacity to mount projects at any scale — from a single individual to teams of hundreds of scientists.
    Share openly. Seizing scientific opportunities requires creating methods, tools and massive data sets — and making them available to the entire scientific community to rapidly accelerate biomedical advancement.
    Reach globally. Biomedicine should address the medical challenges of the entire world, not just advanced economies, and include scientists in developing countries as equal partners whose knowledge and experience are critical to driving progress.

    Harvard University

    MIT Widget

  • richardmitnick 10:16 am on October 15, 2015 Permalink | Reply
    Tags: , Genomics   

    From The Uncovering Genome Mysteries project at WCG: “Analyzing a wealth of data about the natural world” 

    New WCG Logo

    14 Oct 2015
    Wim Degrave, Ph.D.
    Laboratório de Genômica Funcional e Bioinformática Instituto Oswaldo Cruz – Fiocruz

    The Uncovering Genome Mysteries project has already amassed data on over 200 million proteins, with the goal of understanding the common features of life everywhere on earth. There are tens of millions of calculations still to run, but the team is also making preparations for analysis and eventual publication of the data.


    For almost a year now, Uncovering Genome Mysteries has been comparing protein sequences derived from the genomes of nearly all living organisms analyzed to date. Thanks to the volunteers that contribute computer time to World Community Grid, more than 34 million results have been returned with data on functional identification and protein similarities. Along with our collaborators in Australia, we’ve paid particular attention to microorganisms from different ecosystems, with special emphasis on marine organisms. More than 200 million proteins have been compared thus far, during the equivalent of 15,000 years of computation. The resulting data are sent to our computer servers at the Fiocruz Foundation in Rio de Janeiro, Brazil and now also to the University of New South Wales, Sydney, Australia. A last set of around 20 million protein sequences, determined over the last year, is now being added to the dataset and will be run on World Community Grid in the coming months.

    However, the task of functional mapping and comparison between proteins from all these organisms does not end there. Our team of scientists is, in the meantime, investing more efforts to optimize the algorithms for further analysis and representation of the data generated by World Community Grid volunteers, and preparing for the database systems that will make the results available to the scientific community. Once our data is public, we expect that the scientific community’s understanding of the intricate network of life will gain a completely new perspective, and that results will also contribute to the development of many new applications in health, agriculture and life sciences in general.

    This project is a cooperation between World Community Grid, the laboratory of Dr. Torsten Thomas and his team from the School of Biotechnology and Biomolecular Sciences & Centre for Marine Bio-Innovation at the University of New South Wales, Sydney, Australia, and our team at the Laboratory for Functional Genomics and Bioinformatics, at the Oswaldo Cruz Foundation – Fiocruz, in Brazil.

    See the full article here.

    Please help promote STEM in your local schools.
    STEM Icon

    Stem Education Coalition

    World Community Grid (WCG) brings people together from across the globe to create the largest non-profit computing grid benefiting humanity. It does this by pooling surplus computer processing power. We believe that innovation combined with visionary scientific research and large-scale volunteerism can help make the planet smarter. Our success depends on like-minded individuals – like you.”

    WCG projects run on BOINC software from UC Berkeley.

    BOINC is a leader in the field(s) of Distributed Computing, Grid Computing and Citizen Cyberscience.BOINC is more properly the Berkeley Open Infrastructure for Network Computing.

    BOINC WallPaper


    “Download and install secure, free software that captures your computer’s spare power when it is on, but idle. You will then be a World Community Grid volunteer. It’s that simple!” You can download the software at either WCG or BOINC.

    Please visit the project pages-
    Outsmart Ebola together

    Outsmart Ebola Together

    Mapping Cancer Markers

    Uncovering Genome Mysteries
    Uncovering Genome Mysteries

    Say No to Schistosoma

    GO Fight Against Malaria

    Drug Search for Leishmaniasis

    Computing for Clean Water

    The Clean Energy Project

    Discovering Dengue Drugs – Together

    Help Cure Muscular Dystrophy

    Help Fight Childhood Cancer

    Help Conquer Cancer

    Human Proteome Folding


    World Community Grid is a social initiative of IBM Corporation
    IBM Corporation

    IBM – Smarter Planet

  • richardmitnick 10:52 am on August 21, 2015 Permalink | Reply
    Tags: , Genomics,   

    From UCSC: Seagate gift supports UC Santa Cruz research on genomic data storage 

    UC Santa Cruz

    UC Santa Cruz

    August 20, 2015
    Tim Stephens

    Researchers in the Baskin School of Engineering at UC Santa Cruz are working with industry partner Seagate Technologies on new ways to structure and store massive amounts of genomic data. Seagate has donated data storage devices with a total capacity of 2.5 petabytes to support this effort.

    “This gift provides the basis for a major research program on storage of genomic data,” said Andy Hospodor, executive director of the Storage Systems Research Center (SSRC) at UC Santa Cruz.

    “Seagate is pleased to be a part of this important research effort. The storage requirements for genomics are staggering and the potential for medical breakthroughs even larger,” said Mark Re, senior vice president and CTO at Seagate.

    The gift, valued at $250,000, includes 1 petabyte of Seagate’s new Kinetic disk drives for object-based storage, plus an additional 1.5 petabytes of traditional Seagate SATA disk drives for use in existing clusters within the UC Santa Cruz Genomics Institute.

    Ethan Miller, professor of computer science, directs the Center for Research in Storage Systems (CRSS). (Photo by Elena Zhukova)

    Large-scale test bed

    “This gives us a large-scale test bed that we can use to explore the organization of data for large-scale disk-based storage systems. We need to develop better ways to store and organize the vast quantities of data we’re generating,” said Ethan Miller, professor of computer science and director of the Center for Research in Storage Systems (CRSS) at UCSC.

    Miller and other storage systems researchers at UC Santa Cruz work closely with industry partners such as Seagate, and several of the center’s alumni and graduate students have been working at Seagate on the company’s latest disk technology. The Seagate storage donation will support research on new ways to structure and store genomic data using object stores and newly proposed open-source standards (APIs) for genomic data that are being developed by the Global Alliance for Genomics and Health.

    “Genomic data storage is one of several areas of emerging interest where we’ll be looking at using Seagate’s new intelligent disks to build large-scale storage systems,” Miller said.

    Genomics Institute

    The donation also adds over a petabyte of storage capacity to the genomics data storage cluster maintained by the UC Santa Cruz Genomics Institute at the San Diego Supercomputing Center. For Benedict Paten, a research scientist at the Genomics Institute, it’s all about speeding up the processing of genomic data.

    “We in genomics know that we have a big data problem,” Paten said. “We need to be able to compute on much larger volumes of data than we have before. The amount of genomic data is growing exponentially, and we haven’t been keeping up.”

    Part of the solution, he said, is distributed processing of large data sets in which the processing is done where the data are stored, instead of downloading the data over a network for processing. “Now we can put a lot of disks on the compute nodes for efficient distributed computation over large amounts of data. This donation is really important for our big data genomics efforts at UC Santa Cruz,” Paten said.

    See the full article here.

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition
    The University of California, Santa Cruz, opened in 1965 and grew, one college at a time, to its current (2008-09) enrollment of more than 16,000 students. Undergraduates pursue more than 60 majors supervised by divisional deans of humanities, physical & biological sciences, social sciences, and arts. Graduate students work toward graduate certificates, master’s degrees, or doctoral degrees in more than 30 academic fields under the supervision of the divisional and graduate deans. The dean of the Jack Baskin School of Engineering oversees the campus’s undergraduate and graduate engineering programs.

  • richardmitnick 11:02 am on July 23, 2015 Permalink | Reply
    Tags: , Genomics,   

    From UCSC: “Keck Foundation awards UC Santa Cruz $2 million for human genome variation project” 

    UC Santa Cruz

    UC Santa Cruz

    July 22, 2015
    Tim Stephens

    The UC Santa Cruz Genomics Institute has received a $2 million grant from the W. M. Keck Foundation for ongoing research to develop a comprehensive map of human genetic variation. The Human Genome Variation Map will be a valuable new resource for medical researchers, as well as for basic research on human evolution and diversity.

    Human Genome Variation Map

    The Keck grant provides funding over two years for UC Santa Cruz researchers to create a full-scale map, building on the results of a one-year pilot project funded by the Simons Foundation.

    “We’ve been experimenting with pilot regions of the genome and evaluating a variety of methods. The next steps will be to take it from a prototype to a full-scale genome reference that we can release to the community,” said Benedict Paten, a research scientist at the Genomics Institute and co-principal investigator of the project.

    Benedict Paten (Photo by Summer Stiegman)

    The Human Genome Variation Map is needed to overcome the limitations of using a single reference sequence for the human genome. Currently, new data from sequencing human genomes is analyzed by mapping the new sequences to one reference set of 24 human chromosomes to identify variants. But this approach leads to biases and mapping ambiguities, and some variants simply cannot be described with respect to the reference genome, according to David Haussler, distinguished professor of biomolecular engineering and scientific director of the Genomics Institute at UC Santa Cruz.

    Global Alliance

    Haussler and Paten are coordinating their work on the new map with the Global Alliance for Genomics and Health (GA4GH), which involves more than 300 collaborating institutions that have agreed to work together to enable secure sharing of genomic and clinical data. The overall vision of the global alliance includes a genomics platform based on something akin to the planned Human Genome Variation Map, along with open-source software tools to enable researchers to mine the data for new scientific and medical breakthroughs. In the long run, the map will be used to identify genomic variants encountered in precision medical care as well, Haussler said.

    The UCSC team has been collaborating with leading genomics researchers at other institutions to develop the map, which Paten began working on in 2014 as co-chair of the GA4GH Reference Variation Task Team. The new Human Genome Variation Map will replace the current assortment of isolated, incompatible databases of human genetic variation with a single, fundamental representation formalized as a very large mathematical graph. The clean mathematical formulation is a major strength of this new approach, Paten said.

    The primary reference genome is a linear sequence of DNA bases (represented by the letters A, C, T, and G). To build the Human Genome Variation Map, each new genome will be merged into the reference genome at the points where it matches the primary sequence, with variations appearing as additional alternate paths in the map.

    Mathematical structure

    This mathematical graph-based structure will augment the existing human reference genome with all common human variations, providing a means to name, identify, and analyze variations precisely and reproducibly. “The original human reference genome project gave us a detailed picture of one human genome. This map will give us a detailed picture of the world’s variety of human genomes,” Paten said.

    In the spirit of the original human genome project, the Human Genome Variation Map will be publicly and freely available to all. Haussler’s team at UC Santa Cruz made the first human genome sequence publicly available on the Internet 15 years ago. This new project has many parallels with that earlier work, in which UCSC genomics researchers assembled and posted the first human genome sequence and went on to create the widely used UCSC Genome Browser.

    “This is an infrastructure project for genomics that everyone agrees is important,” Paten said. “It is ambitious, and it requires a fundamental shift from thinking of the reference as one sequence to thinking of it as this structure that incorporates all variation. But now is the time to do it. We need to build a model that works, and make it easy enough to use to get community acceptance.”

    The UC Santa Cruz Genomics Institute is a fundraising priority of the $300-million Campaign for UC Santa Cruz.

    W. M. Keck Foundation

    Based in Los Angeles, the W. M. Keck Foundation was established in 1954 by the late W. M. Keck, founder of the Superior Oil Company. The Foundation’s grant making is focused primarily on pioneering efforts in the areas of medical, science and engineering research. The Foundation also maintains an undergraduate education program that promotes distinctive learning and research experiences for students in the sciences and in the liberal arts, and a Southern California Grant Program that provides support for the Los Angeles community, with a special emphasis on children and youth from low-income families, special needs populations and safety-net services. For more information, please visit www. wmkeck.org.

    See the full article here.

    Please help promote STEM in your local schools.

    STEM Icon

    Stem Education Coalition
    The University of California, Santa Cruz, opened in 1965 and grew, one college at a time, to its current (2008-09) enrollment of more than 16,000 students. Undergraduates pursue more than 60 majors supervised by divisional deans of humanities, physical & biological sciences, social sciences, and arts. Graduate students work toward graduate certificates, master’s degrees, or doctoral degrees in more than 30 academic fields under the supervision of the divisional and graduate deans. The dean of the Jack Baskin School of Engineering oversees the campus’s undergraduate and graduate engineering programs.

Compose new post
Next post/Next comment
Previous post/Previous comment
Show/Hide comments
Go to top
Go to login
Show/Hide help
shift + esc
%d bloggers like this: