{"id":497607,"date":"2024-01-26T10:03:36","date_gmt":"2024-01-26T13:03:36","guid":{"rendered":"https:\/\/revistapesquisa.fapesp.br\/?p=497607"},"modified":"2024-02-06T11:11:55","modified_gmt":"2024-02-06T14:11:55","slug":"the-era-of-whole-genome-sequencing","status":"publish","type":"post","link":"https:\/\/revistapesquisa.fapesp.br\/en\/the-era-of-whole-genome-sequencing\/","title":{"rendered":"The era of whole genome sequencing"},"content":{"rendered":"<p>Imagine if you were on a first date and wanted to check if your dinner partner really does have Italian ancestry as they claim. What if all you had to do to find out was to rub a cotton swab around the rim of their glass while they were distracted, put the DNA sample in a pocket sequencer, connect it to your cell phone, and have an answer within minutes. If you think that sounds like science fiction, you may be in for a surprise. Miniature devices that read genetic material in real time already exist and can be used to identify disease-causing pathogens. The gadget is part of the genome revolution that is allowing complete DNA sequencing to be performed in shorter and shorter time frames, with fewer errors and at ever-decreasing costs.<\/p>\n<p>Anyone who was an adult 20 years ago probably remembers the genome sequencing era, when the complete sequences of a series of organisms were released for the first time, with an emphasis on the Human Genome Project (HGP), which announced an almost complete human sequence in 2001. Major initiatives \u2014 some in Brazil funded by FAPESP (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/results-over-the-past-decade\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue<\/em> <em>n\u00ba 174<\/em><\/a>) \u2014 revealed the genetic material of organisms such as the bacteria <em>Xylella fastidiosa<\/em>, which damages orange groves, and various forms of cancer, offering a boost to genetics laboratories and developing potential in bioinformatics.<\/p>\n<p>Since then, the data has been shared with the scientific community and research has made it possible to identify a multitude of genes linked to health problems, but with an as-yet limited impact on patients&#8217; lives (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/legacies-of-the-genome\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue<\/em> <em>n\u00ba<\/em>\u00a0<em>284<\/em><\/a>), especially for complex diseases, such as diabetes and cancer, which involve the action of many DNA segments responsible for the production of proteins.<\/p>\n<p>\u201cThe Human Genome Project inaugurated modern genetics,\u201d says bioinformatician Pedro Galante, head of the bioinformatics laboratory at the Teaching and Research Institute of S\u00edrio-Liban\u00eas Hospital (IEP-HSL) in S\u00e3o Paulo. \u201cWe discovered many genes associated with a vast range of traits.\u201d Carlos Menck, a molecular biologist from the Institute of Biomedical Sciences at the University of S\u00e3o Paulo (ICB-USP), reports that the use of next-generation sequencing around 10 years ago made a huge difference in the diagnosis of the disease Xeroderma pigmentosum, patients with which cannot be exposed to sunlight (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/from-the-lab-to-the-field\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue<\/em> <em>n\u00ba<\/em>\u00a0<\/a><em>323 and sidebar<\/em>). \u201cWe will soon be publishing an article with the mutations detected in 150 patients,\u201d he says. He also sees the approaching possibility that this knowledge will give rise to treatments based on gene editing or regulation using tools such as CRISPR-Cas9 and RNA interference.<\/p>\n<div class=\"box\"><\/p>\n<details>\n<summary><strong>\u25ba New oncology<br \/>\n(click to learn more)<\/strong><\/summary>\n<p>Pedro Galante from IEP-HSL points out that oncology is at a much higher level than it was just a few years ago and that more advances are coming quickly. It is now possible to direct aggressive \u2014 and extremely expensive \u2014 treatments to the patients who are most likely to benefit based on the mutation profile of the tumors and the patient&#8217;s immune system, among other factors. \u201cThis is thanks to discoveries made over the last five years.\u201d<\/p>\n<p>One example, published in the scientific journal Cancers this year, was the discovery of 19 highly active olfactory receptor genes, normally involved in the detection of odors, inside acute myeloid leukemia cells \u2014 a type of cancerous cell for which there is no good treatment. It was a surprising finding that could improve early diagnosis of the disease and possibly reveal potential therapeutic targets.<\/p>\n<p>Skin tumors among xeroderma pigmentosa patients have greater insertion of transposable elements, known as retrotransposons, according to an article by a group led by Carlos Menck from ICB-USP, published in the journal Carcinogenesis in May. \u201cWe were the first to show this, thanks to a bioinformatics tool developed by Pedro Galante,\u201d says Menck. The study indicates that these insertions cause instability in the genome and may be the cause of the tumors, as well as identifying an enzyme with an important role in maintaining stability. \u201cWe are now investigating the mechanism of action.\u201d<\/p>\n<\/details>\n<p><\/div>\n<p>The way we look at the genome is changing radically, focusing not only on a very small percentage of genes that were previously considered essential but also on the complete DNA sequence, and comparing a large number of individuals and even species. It is an exciting time. In May, Galante attended The Biology of Genomes conference at the Cold Spring Harbor Laboratory in the USA, the most renowned in the field. He recalls that in the closing lecture, American geneticist Evan Eichler of the University of Washington stated that he had been waiting years for the field to arrive at the moment in which it now finds itself. There is great anticipation of the major advances that genomic research could provide, especially in medicine.<\/p>\n<div id=\"attachment_497610\" style=\"max-width: 810px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-497610 size-full\" src=\"https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-800-04.jpg\" alt=\"\" width=\"800\" height=\"716\" srcset=\"https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-800-04.jpg 800w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-800-04-250x224.jpg 250w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-800-04-700x627.jpg 700w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-800-04-120x107.jpg 120w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><p class=\"wp-caption-text\"><span class=\"media-credits-inline\">L\u00e9o Ramos Chaves\/Revista Pesquisa FAPESP<\/span>Sequencing machines: A flow cell, into which samples are inserted in the Illumina NovaSeq (<em>below<\/em>); with the Sanger method, DNA passes through thin threads (<em>right<\/em>)<span class=\"media-credits\">L\u00e9o Ramos Chaves\/Revista Pesquisa FAPESP<\/span><\/p><\/div>\n<p>One of the limitations of the HGP was that despite having collected DNA from around 20 volunteers, roughly 70% of the sequenced genome came from just one person, a North American of European and African descent from the city of Buffalo in the state of New York. This means that genetic variants that differ between one population and another may not be represented in the sequenced genome that is still used as a reference to this day. This is the genome that researchers compare against their results to identify the function and variation of genes detected in different situations.<\/p>\n<p>Now, the <a href=\"https:\/\/humanpangenome.org\/\" target=\"_blank\" rel=\"noopener\">Human Pangenome<\/a> Reference Consortium (HPRC) intends to resolve this issue by completely sequencing at least 350 people from all over the planet, to ensure scientists have access to more than just one reference. \u201cIt would be much better if we had genome references from the Brazilian population to compare to what we see in our patients, because common variants here may not be present in the North American population or vice versa,\u201d explains Galante, who is applying to join the consortium in the hope of contributing through his bioinformatics knowledge.<\/p>\n<p>The plan is to carry out telomere-to-telomere (T2T) sequencing, which uses the ends of the chromosomes, whose highly repetitive sequences make them difficult to disentangle using previous methods. To do this, the strategy has been to combine techniques. Devices made by British company Oxford Nanopore and American company Pacific Biosciences can read very long stretches of DNA, up to millions of bases long. They are very efficient for genome assembly. \u201cLong reads allow us to determine the genome\u2019s outline,\u201d explains geneticist Gl\u00f3ria Franco of the Federal University of Minas Gerais (UFMG). Not surprisingly, long reading was highlighted in an <a href=\"https:\/\/www.nature.com\/articles\/s41592-022-01759-x\" target=\"_blank\" rel=\"noopener\">editorial published in the journal <em>Nature <\/em>in January this year<\/a> as the method of the year for 2022. Because the error rate is higher, geneticists refine the sequences by using shorter readings made by machines from the Californian company Illumina, which allow them to look at smaller sections with greater precision.<\/p>\n<div class=\"box\"><\/p>\n<details>\n<summary><strong>\u25ba Autism<br \/>\n(click to learn more)<\/strong><\/summary>\n<p>Genome studies related to autism have highlighted gene variants associated with the disorder and developmental delays, according to a study published in the journal Nature Genetics in 2022 in which Maria Rita Passos-Bueno, from USP\u2019s Center for Human Genome Studies, participated. The approach shows promise for better understanding the neurobiological basis of autism spectrum disorder (ASD).<\/p>\n<p>In the quest to understand hereditary factors and the effects of parental age on children born with ASD, an analysis that considers three generations of each family rather than broader population studies seems promising. An article written by CEGH-USP scientists on an analysis of 33 families, published in the journal Genetics this year, praised the approach \u2014 although the sample size was too small to draw any major conclusions.<\/p>\n<\/details>\n<p><\/div>\n<p>Until recently, genomes were sequenced in small fragments that needed to be assembled like a puzzle that may have overlapping parts (<em>see infographic below<\/em>). Repetitive genome sections, which are highly frequent, are especially difficult to include in the assembly and used to be left out \u2014 in the representation of the nitrogenous bases that make up DNA, such as A, T, C, and G, there could be a long string of ATATATATATAT, for example.<\/p>\n<\/div><div class='overflow-responsive-img' style='text-align:center'><picture data-tablet=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-1-DESK-ING.png\" data-tablet_size=\"1939x1213\" alt=\"Os grandes marcos do sequenciamento\">\n    <source srcset=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-1-DESK-ING.png\" media=\"(min-width: 1920px)\" \/>\n    <source srcset=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-1-DESK-ING.png\" media=\"(min-width: 1140px)\" \/>\n    <img decoding=\"async\" class=\"responsive-img\" src=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-1-MOBILE-ING.png\" \/>\n  <\/picture><span class=\"embed media-credits-inline\">Alexandre Affonso \/ Revista Pesquisa FAPESP<\/span><\/div><div class=\"post-content sequence\">\n<p>The initiatives go far beyond human genomes, <a href=\"https:\/\/www.science.org\/content\/article\/eleven-science-stories-likely-make-big-news-2023\" target=\"_blank\" rel=\"noopener\">as noted by the journal <em>Science <\/em>in a January article<\/a> highlighting the topics it expected to make the news this year. The <a href=\"https:\/\/www.earthbiogenome.org\/\" target=\"_blank\" rel=\"noopener\">Earth Biogenome Project (EBP)<\/a>, a consortium of researchers from several countries launched in 2022, is attempting to sequence all species \u2014 including the large proportion still unknown to science, especially unicellular organism and small invertebrates \u2014 over the next 10 years. The objective is to \u201cknow, use, and conserve biodiversity.\u201d This umbrella also covers other projects, such as the sequencing of all mammals (Zoonomia, <a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/zoonomia-project-compares-genomes-in-search-of-answers-about-evolution-and-health\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue n\u00ba<\/em>\u00a0<em>328<\/em><\/a>) and primates (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/primate-genetic-material-could-contribute-to-precision-medicine\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue<\/em> <em>n\u00ba<\/em>\u00a0<em>329<\/em><\/a>).<\/p>\n<p>Marcela Uliano-Silva, a Brazilian bioinformatician from the Wellcome Sanger Institute in the UK, is part of the <a href=\"https:\/\/www.darwintreeoflife.org\/\" target=\"_blank\" rel=\"noopener\">Darwin Tree of Life project<\/a>, which is sequencing the genome of all organisms in the UK. She explains why such a broad perspective is needed: \u201cEverything in biology is investigated on the basis of comparison.\u201d There are around 70,000 species, of which one thousand are almost sequenced already. \u201cWe have a total of around 2,700 genomes at some stage in the sequencing process,\u201d she estimates. As the task progresses, the data is made immediately available and can be analyzed by anyone interested. \u201cThe first large sequencing block was of Lepidoptera, the family of butterflies and moths,\u201d says the bioinformatician.<\/p>\n<div class=\"box\"><\/p>\n<details>\n<summary><strong>\u25ba Diabetes and metformin<br \/>\n(click to learn more)<\/strong><\/summary>\n<p>The most commonly used oral treatment for controlling blood glucose levels in people with type 2 diabetes is metformin, but its mode of action is still a mystery and only protein-coding genes have been studied in this context. A team led by Gl\u00f3ria Franco at UFMG looked for long, noncoding RNA molecules (lncRNA), as described in a 2022 article published in the scientific journal Non-coding RNA. The study\u2019s still exploratory conclusion that the drug regulates critical forms of lncRNA that can affect the response to treatment, as well as cell proliferation and cell energy metabolism, could contribute to future investigations into how it works.<\/p>\n<p>Franco participated in the first genome projects with DNA from the parasite Schistosoma mansoni, which causes schistosomiasis, and is now interested in the relationship between lncRNA and cancer and drug responses.<\/p>\n<\/details>\n<p><\/div>\n<p>In search of diversity beyond genetics, Uliano-Silva is also a member of the EBP&#8217;s Justice, Equity, Diversity, and Inclusion commission, which is seeking to prevent \u201cscientific neocolonization\u201d as a result of the dominance of the wealthiest countries in large international consortiums. As part of this role, she has been contacted by Brazilian researchers interested in participating in this type of project and suggests that similar initiatives should be organized in Brazil.<\/p>\n<p>One early-stage project in the country plans to sequence all Brazilian tetrapods (vertebrates with four limbs), an initiative started by primatologist Jean Boubli of the University of Salford, UK, in partnership with a group from the Center for Human Genome Studies at USP (CEGH-USP). \u201cWe use an Illumina NovaSeq 6000 sequencer,\u201d explains geneticist Maria Rita Passos-Bueno, referring to one of the most modern devices. Although CEGH&#8217;s focus is on rare genetic diseases, she is excited about the new venture. \u201cWe are organizing a collection from the USP Zoology Museum, where they will check the quality of the samples before sending them to us.\u201d An initial test successfully sequenced 70 bird samples. \u201cI want to use data from nonhuman primates to understand the human genome,\u201d she says, returning to her main interest. \u201cIf we find a variant in humans that is common in other animals, there is a good chance that it will not be behind the problems,\u201d she explains.<\/p>\n<div id=\"attachment_497618\" style=\"max-width: 1150px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-497618 size-full\" src=\"https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-01.jpg\" alt=\"\" width=\"1140\" height=\"658\" srcset=\"https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-01.jpg 1140w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-01-250x144.jpg 250w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-01-700x404.jpg 700w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-01-290x166.jpg 290w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-01-120x69.jpg 120w\" sizes=\"auto, (max-width: 1140px) 100vw, 1140px\" \/><p class=\"wp-caption-text\"><span class=\"media-credits-inline\">Ana Cotta\u2009\/\u2009Wikimedia Commons | Frank Fox\u2009\/\u2009Wikimedia Commons | Eduardo Cesar\u2009\/\u2009Revista Pesquisa FAPESP | Michael Wolf\u2009\/\u2009Wikimedia Commons | Zeynel Cebeci\u2009\/\u2009Wikimedia Commons | rufus46\u2009\/\u2009Wikimedia Commons<\/span>The ocelot, the maned wolf, protozoan, rice, tomato, and a butterfly: very different organisms can have similar genes and transposable elements contribute to variations<span class=\"media-credits\">Ana Cotta\u2009\/\u2009Wikimedia Commons | Frank Fox\u2009\/\u2009Wikimedia Commons | Eduardo Cesar\u2009\/\u2009Revista Pesquisa FAPESP | Michael Wolf\u2009\/\u2009Wikimedia Commons | Zeynel Cebeci\u2009\/\u2009Wikimedia Commons | rufus46\u2009\/\u2009Wikimedia Commons<\/span><\/p><\/div>\n<div class=\"box\"><\/p>\n<details>\n<summary><strong>\u25ba Agrigenomics<br \/>\n(click to learn more)<\/strong><\/summary>\n<p>Pangenomics has already influenced humankind\u2019s most widely eaten crops. It is well-known that some plants naturally undergo duplications of their entire genome, resulting in them having several copies of DNA in their cells. This process increases the chances of transposable elements \u2014 and therefore new responses to environmental conditions \u2014 being generated, according to an article published in the journal Genome Biology in 2021 by Brazilian biologist Rafael Della Coletta, a PhD student at the University of Minnesota, USA.<\/p>\n<p>Transposons are responsible for changes such as repressing the functioning of the gene that causes maize to flower when the days are longer, promoting aluminum tolerance in rice, and generating the oval shape of Roma tomatoes. The article explains that identifying the causal relationship between changes in phenotype (observable characteristics of an organism) and genetic novelties, which can be achieved by gene editing based on agricultural production needs, would represent a new era in the domestication of edible plants.<\/p>\n<\/details>\n<p><\/div>\n<p>Obtaining an infinite number of complete genomes is not even the most important step forward in modern genomics. The crucial advance was the realization that focusing on genes that produce proteins restricts attention to just 1.2% of human DNA. When it comes to these genes, people are 99.9% identical. The majority of the genome consists of sequences that control and orchestrate gene activity and are the real reason for differences between organisms, largely through the action of RNA molecules with regulatory functions. This is one of the concepts of the field known as \u201cevo-devo\u201d (evolutionary developmental biology, <a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/one-mold-and-many-forms\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue<\/em> <em>n\u00ba<\/em>\u00a0<em>152<\/em><\/a>): throughout development, adjustments in the construction of the same structures can generate very different results, such as a person&#8217;s hand and a bat&#8217;s wing. \u201cThe hardware is the same, what changes is the software,\u201d explains molecular geneticist Paulo Amaral, from the Insper Institute of Education and Research. \u201cThe sea sponge, which we use as a luffa, has very similar genes to ours.\u201d<\/p>\n<\/div><div class='overflow-responsive-img' style='text-align:center'><picture data-tablet=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-3-DESK-INGLES.png\" data-tablet_size=\"1939x851\" alt=\"Do gene ao genoma\">\n    <source srcset=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-3-DESK-INGLES.png\" media=\"(min-width: 1920px)\" \/>\n    <source srcset=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-3-DESK-INGLES.png\" media=\"(min-width: 1140px)\" \/>\n    <img decoding=\"async\" class=\"responsive-img\" src=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-3-MOBILE-INGLES.png\" \/>\n  <\/picture><span class=\"embed media-credits-inline\">Alexandre Affonso \/ Revista Pesquisa FAPESP<\/span><\/div><div class=\"post-content sequence\">\n<p>The discovery that small RNA molecules turn off genes (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/from-slave-to-master\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue n\u00ba<\/em>\u00a0<em>133<\/em><\/a>) won Americans Andrew Fire and Craig Mello the Nobel Prize in 2006. With the new techniques that have emerged since then, its function is increasingly under the spotlight. \u201cFor the first time, we are producing complete genomes, including the noncoding parts and the RNA they produce,\u201d says Amaral.<\/p>\n<p>This year, in partnership with Australian geneticist John Mattick of the University of New South Wales in Australia, he published the book <a href=\"https:\/\/www.taylorfrancis.com\/books\/oa-mono\/10.1201\/9781003109242\/rna-epicenter-genetic-information-john-mattick-paulo-amaral\" target=\"_blank\" rel=\"noopener\"><em>RNA \u2013 The epicenter of genetic information<\/em> (CRC Press)<\/a>. The basis for the book, which was later expanded and updated, was a review that resulted from three months holed up in the library during his PhD, defended in 2011 at the University of Queensland, also in Australia, at what was at the time Mattick&#8217;s laboratory. It provides an overview of the history of molecular biology since the nineteenth century, focusing on the dilemma that attracted Amaral even as an undergraduate: if it was considered that the main function of RNA was to translate genes into proteins, then why does most of the human genome represent instructions for RNA molecules that do not make proteins? Compelling examples include the Xist gene, which produces long RNAs responsible for silencing one of the two X chromosomes in female mammalian cells and is involved in diseases such as cancer. Together with a group led by immunologist Helder Nakaya of Hospital Israelita Albert Einstein in S\u00e3o Paulo, Amaral has been investigating the action of noncoding RNA in immunological responses to vaccines and in cardiovascular and neurological diseases. In the end, the book elevates RNA to the position of a \u201ccomputational engine of the cell, development, cognition, and evolution.\u201d \u201cWe are already writing updates for a second edition,\u201d he reveals. Scientific discoveries are advancing faster than the editorial process.<\/p>\n<p>Transposable elements, also known as transposons \u2014 which tend to contain repeated sequences \u2014 are also gaining increasing prominence (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/copies-that-make-a-difference\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue n\u00ba<\/em>\u00a0<em>246<\/em><\/a>), having initially been described in the 1940s by American geneticist Barbara McClintock as responsible for the color variation in corn kernels. These replicated sections can jump and insert themselves into another part of the DNA strand, quickly influencing neighboring genes, causing changes and generating new features.<\/p>\n<picture data-tablet=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-2-DESK-ING.png\" data-tablet_size=\"1089x783\" alt=\"Quantos genes?\">\n    <source srcset=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-2-DESK-ING.png\" media=\"(min-width: 1920px)\" \/>\n    <source srcset=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-2-DESK-ING.png\" media=\"(min-width: 1140px)\" \/>\n    <img decoding=\"async\" class=\"responsive-img\" src=\"\/wp-content\/uploads\/2023\/11\/rpf-pangenoma-2023-07-info-2-MOBILE-ING.png\" \/>\n  <\/picture><span class=\"embed media-credits-inline\">Alexandre Affonso \/ Revista Pesquisa FAPESP<\/span>\n<p>A recent article by Galante&#8217;s group that was published on the bioRxiv repository in February investigated the origin of regulatory RNAs using transposable elements in primates, known as retrocopies. Five of the 17 studied molecules are the same in all primates, while two are specific to the human genome and may be involved in essential biological processes, such as metabolism, communication between cells, and the development of various types of cancer. \u201cThe line that separates transposable elements from genes is getting thinner and thinner,\u201d says botanist Marie-Anne van Sluys, who believes the topic does not receive as much attention as it deserves, although she has focused on it for decades and is currently leading a study on genes, genomes, and transposition elements in sugarcane and their association in interaction with pathogens. A study by her group published on bioRxiv in 2020 identified a transposable element capable of modulating development and gene expression in tobacco plants in response to stress. \u201cIn mammals, it is already clear that they stand out as drivers of diversification.\u201d<\/p>\n<p>The results also include epigenetic markers (such as methylation patterns) that regulate DNA functions without altering the sequence and reflect how genes interact with the environment (<em>see infographic above<\/em>). They could be the cause of certain types of tumors and detecting them in DNA circulating in the bloodstream has emerged as a promising and practical diagnostic tool in recent years, known as a liquid biopsy (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/blood-holds-more-clues-about-cancer\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP <em>issue n\u00ba<\/em>\u00a0<em>253<\/em><\/a>).<\/p>\n<div class=\"box\"><\/p>\n<details>\n<summary><strong>\u25ba Brazilian genomics<br \/>\n(click to learn more)<\/strong><\/summary>\n<p>Brazilian initiatives funded by FAPESP have kept the country in line with advances being made abroad. The sequencing of the bacteria Xyllella fastidiosa, which started in 1997, involved 35 laboratories and 191 researchers from institutions in S\u00e3o Paulo as part of the Network for Nucleotide Sequencing and Analysis (ONSA). It was the first sequencing of an organism that causes disease in plants and was thus of commercial interest, and it made the cover of Nature on July 13, 2000.<\/p>\n<p>At the beginning of the century, the sugarcane genome, cancer genome, and the genome of the bacteria Xanthomonas citri, which causes citrus canker, were also completed.<\/p>\n<p>The results, which were fundamental to the establishment of contemporary scientific culture, involved the creation of dozens of laboratories, coordination between research groups, the establishment of shared databases, and the development of bioinformatics. It also inspired the formation of new research groups nationwide.<\/p>\n<p>\u201cThe formation of the ONSA network was a daring and risky initiative that ended up turning out really well and generating great results. Building a methodology capable of delivering a sequence was extremely important and helped qualify a lot of people in this area,\u201d said geneticist Marcio de Castro, scientific director of FAPESP, in a recent interview (<a href=\"https:\/\/revistapesquisa.fapesp.br\/en\/marcio-de-castro-a-boost-for-bold-research\/\" target=\"_blank\" rel=\"noopener\"><em>see<\/em> Pesquisa FAPESP<em> issue n\u00ba 328<\/em><\/a>).<\/p>\n<\/details>\n<p><\/div>\n<p><strong>The urgency of bioinformatics<br \/>\n<\/strong>Highly specialized professionals called bioinformaticians are needed to process and interpret such immense volumes of data. Marcela Uliano-Silva is keenly aware of this need and has just developed an open-access program for assembling mitochondrial genomes, something that did not yet exist, according to an article published in the scientific journal <em>BMC Bioinformatics <\/em>in July. Mitochondrial genomes are a special type of DNA \u2014 they result from an ancient symbiosis with bacteria, are circular, and are usually transmitted from mothers to children. She warns that it is essential for biologists to learn programming because software development is so crucial to analyzing and visualizing data.<\/p>\n<div id=\"attachment_497622\" style=\"max-width: 1150px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-497622 size-full\" src=\"https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-02.jpg\" alt=\"\" width=\"1140\" height=\"579\" srcset=\"https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-02.jpg 1140w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-02-250x127.jpg 250w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-02-700x356.jpg 700w, https:\/\/revistapesquisa.fapesp.br\/wp-content\/uploads\/2023\/11\/RPF-pangenomica-2023-08-1140-02-120x61.jpg 120w\" sizes=\"auto, (max-width: 1140px) 100vw, 1140px\" \/><p class=\"wp-caption-text\"><span class=\"media-credits-inline\">Steve Wilson\u2009\/\u2009Wikimedia Commons | Ivar Leidus\u2009\/\u2009Wikimedia Commons | Oceancetaceen\u2009\/\u2009Wikimedia Commons | Andrew Mercer\u2009\/\u2009Wikimedia Commons | H. Zell\u2009\/\u2009Wikimedia Commons\u2002<\/span>Tamarin monkeys, bees, porpoises, bats, marsupials, sea sponges: regulation during development is behind some of the differences<span class=\"media-credits\">Steve Wilson\u2009\/\u2009Wikimedia Commons | Ivar Leidus\u2009\/\u2009Wikimedia Commons | Oceancetaceen\u2009\/\u2009Wikimedia Commons | Andrew Mercer\u2009\/\u2009Wikimedia Commons | H. Zell\u2009\/\u2009Wikimedia Commons\u2002<\/span><\/p><\/div>\n<p>Galante adds that the training needed is not usually available in Brazilian undergraduate courses. Becoming a bioinformatician, he says, often requires going back to school to learn statistics and mathematics, something that most people with a background in biological sciences are not eager to do. \u201cThe statistics and computing demands of analyzing genomic data are increasingly complex,\u201d he stresses. Demand is also rising in the private sector, and Galante points out that the PhD students working in his laboratory who master this area are almost guaranteed employment in clinical analysis and oncology labs or companies wishing to develop computational technologies. \u201cIn five short years, an entire ecosystem of industries that require bioinformaticians was formed in the country.\u201d<\/p>\n<p>Technological advances make sense, and reach their full potential, through sound scientific questions. But this is a two-way relationship, highlights Carlos Menck. \u201cWhen we are looking for a way to solve a problem, sometimes we learn something that changes the scientific question.\u201d Many researchers in health-related fields who spend most of their time in the lab doing basic research, as is the case for Menck, have a clear objective: to solve patients&#8217; problems. While searching for solutions to practical difficulties regarding diagnoses or how to deliver the drug to the intended destination, for example, they uncover molecular modes of action, and as they investigate how the DNA works, they may find unexpected applied solutions.<\/p>\n<p class=\"bibliografia separador-bibliografia\"><strong>Projects<br \/>\n1.<\/strong> HUG-CELL &#8211; Human Genome and Stem Cell Research Center (<a href=\"https:\/\/bv.fapesp.br\/pt\/auxilios\/58578\/cegh-cel-centro-de-estudos-do-genoma-humano-e-de-celulas-tronco\/?q=13\/08028-1\" target=\"_blank\" rel=\"noopener\">n\u00ba 13\/08028-1<\/a>); <strong>Grant Mechanism<\/strong> Research, Innovation, and Dissemination Center (RIDC); <strong>Principal Investigator<\/strong> Mayana Zatz (USP); <strong>Investment<\/strong> R$55,474,011.98.<br \/>\n<strong>2.<\/strong> The role of DNA damage and mitochondrial function in vascular, immune system, and neurological aging (DNA MoVINg) (<a href=\"https:\/\/bv.fapesp.br\/pt\/auxilios\/106138\/papel-de-danos-no-dna-e-funcao-mitocondrial-em-envelhecimento-vascular-imune-e-neurologico-dna-movin\/?q=19\/19435-3\" target=\"_blank\" rel=\"noopener\">n\u00ba 19\/19435-3<\/a>); <strong>Grant Mechanism<\/strong> Thematic Project;<strong> Principal Investigator<\/strong> Carlos Frederico Martins Menck (USP); <strong>Investment <\/strong>R$8,100,714.10.<br \/>\n<strong>3.<\/strong> Contribution of genes, genomes, and transposable elements in the interaction between plants and microorganisms: sugarcane case study (<a href=\"https:\/\/bv.fapesp.br\/pt\/auxilios\/98031\/contribuicao-de-genes-genomas-e-elementos-de-transposicao-na-interacao-entre-plantas-e-micro-organis\/?q=16\/17545-8\" target=\"_blank\" rel=\"noopener\">n\u00ba 16\/17545-8<\/a>); <strong>Grant Mechanism<\/strong> Thematic Project; <strong>Program<\/strong> BIOEN; <strong>Principal Investigator<\/strong> Marie-Anne van Sluys (USP); <strong>Investment <\/strong>R$6,106,313.34.<br \/>\n<strong>4.<\/strong> Retroelements: A driving force in the creation of genetic novelties in the genomes of humans and mice (<a href=\"https:\/\/bv.fapesp.br\/pt\/auxilios\/103797\/retroelementos-uma-forca-motriz-criando-novidades-geneticas-no-genoma-humano-e-de-camundongos\/?q=18\/15579-8\">n\u00ba 18\/15579-8<\/a>); <strong>Grant Mechanism<\/strong> Young Investigator Award; <strong>Principal Investigator<\/strong> Pedro Alexandre Favoretto Galante (SBSHSL); <strong>Investment<\/strong> R$2,080,277.52.<br \/>\n<strong>5.<\/strong> Olfactory receptors: Mechanisms of gene expression and signal transduction (<a href=\"https:\/\/bv.fapesp.br\/pt\/auxilios\/96044\/receptores-olfatorios-mecanismos-de-expressao-genica-e-transducao-de-sinal\/?q=16\/24471-0\" target=\"_blank\" rel=\"noopener\">n\u00ba 16\/24471-0<\/a>); <strong>Grant Mechanism<\/strong> Thematic Project;<strong> Principal Investigator<\/strong> Bettina Malnic (USP); <strong>Investment <\/strong>R$1,486,107.99.<br \/>\n<strong>6.<\/strong> Impact of RNA-binding proteins on splicing dysregulation in glioblastoma (<a href=\"https:\/\/bv.fapesp.br\/pt\/bolsas\/178521\/impacto-de-proteinas-de-ligacao-a-rna-na-desregulacao-de-splicing-em-glioblastoma\/?q=17\/19541-2\" target=\"_blank\" rel=\"noopener\">n\u00ba 17\/19541-2<\/a>); <strong>Grant Mechanism<\/strong> Postdoctoral fellowship; <strong>Supervisor<\/strong> Pedro Alexandre Favoretto Galante (SBSHSL); <strong>Beneficiary<\/strong> Gabriela Der Agopian Guardia; <strong>Investment <\/strong>R$321,292.29.<\/p>\n<p class=\"bibliografia\"><strong>Scientific articles<\/strong><strong><br \/>\n<\/strong>MATTICK, J. &amp; AMARAL, P. <a href=\"https:\/\/www.taylorfrancis.com\/books\/oa-mono\/10.1201\/9781003109242\/rna-epicenter-genetic-information-john-mattick-paulo-amaral\" target=\"_blank\" rel=\"noopener\">RNA \u2013 The epicenter of genetic information<\/a>.<strong> Boca Raton: CRC Press<\/strong>, 2023.<strong><br \/>\n<\/strong>GUARDIA, G. D. A. <em>et al<\/em>. <a href=\"https:\/\/www.mdpi.com\/2072-6694\/15\/12\/3073\">Acute myeloid leukemia expresses a specific group of olfactory receptors<\/a>. <strong>Cancers<\/strong><strong>. <\/strong>vol. 15 no. 12 3073. june 6, 2023.<strong><br \/>\n<\/strong>WRIGHT, C. J. <em>et al<\/em>. <a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2023.05.12.540473v1\" target=\"_blank\" rel=\"noopener\">Chromosome evolution in Lepidoptera<\/a>. <strong>bioRxiv<\/strong>. may 14, 2023.<strong><br \/>\n<\/strong>CORRADI, C. <em>et al.<\/em><a href=\"https:\/\/academic.oup.com\/carcin\/advance-article-abstract\/doi\/10.1093\/carcin\/bgad030\/7168604?redirectedFrom=fulltext&amp;login=false\">Mutational signatures and increased retrotransposon insertions in xeroderma pigmentosum variant skin tumors<\/a>. <strong>Carcinogenesis<\/strong><strong>. <\/strong>bgad030. may 17, 2023.<strong><br \/>\n<\/strong>MERCURI, R. L. <em>et al.<\/em><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2023.02.24.529967v1\" target=\"_blank\" rel=\"noopener\">Retro-miRs: Novel and functional miRNAs originated from mRNA retrotransposition<\/a>. <strong>bioRxiv<\/strong><strong>.<\/strong> feb. 25, 2023.<strong><br \/>\n<\/strong>CONCEI\u00c7\u00c3O, I. M. C. A. da, <em>et al.<\/em><a href=\"https:\/\/www.mdpi.com\/2311-553X\/8\/5\/68\">Metformin treatment modulates long non-coding RNA 2 isoforms expression in human cells<\/a>. <strong>Non-coding RNA<\/strong>. vol. 8, no. 5, 68. oct. 12, 2022.<strong><br \/>\n<\/strong>MULHAIR, P. O. <em>et al.<\/em><a href=\"https:\/\/genome.cshlp.org\/content\/33\/1\/32\" target=\"_blank\" rel=\"noopener\">Diversity, duplication, and genomic organization of homeobox genes in Lepidoptera<\/a>. <strong>Genome Research<\/strong><strong>. <\/strong>vol. 33 pp. 32\u201344. 2023.<strong><br \/>\n<\/strong>ULIANO-SILVA, M. <em>et al<\/em>. <a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2022.12.23.521667v1\" target=\"_blank\" rel=\"noopener\">MitoHiFi: A python pipeline for mitochondrial genome assembly from PacBio High Fidelity reads<\/a>. <strong>BMC Bioinformatics<\/strong><strong>. <\/strong>vol. 24, 288. july 18, 2023.<strong><br \/>\n<\/strong>DELLA COLETTA, R. <em>et al.<\/em><a href=\"https:\/\/genomebiology.biomedcentral.com\/articles\/10.1186\/s13059-020-02224-8#Abs1\" target=\"_blank\" rel=\"noopener\">How the pan-genome is changing crop genomics and improvement<\/a>. <strong>Genome Biology<\/strong><strong>. <\/strong>vol. 22, no. 3. jan. 4, 2021.<strong><br \/>\n<\/strong>FU, J. M. <em>et al.<\/em><a href=\"https:\/\/www.nature.com\/articles\/s41588-022-01104-0#data-availability\" target=\"_blank\" rel=\"noopener\">Rare coding variation provides insight into the genetic architecture and phenotypic context of autism<\/a>. <strong>Nature Genetics.<\/strong> vol. 54, pp. 1320\u201331. aug. 18, 2022.<strong><br \/>\n<\/strong>COSTA, C. I. S. <em>et al<\/em>. <a href=\"https:\/\/www.nature.com\/articles\/s41431-023-01398-6\" target=\"_blank\" rel=\"noopener\">Three generation families: Analysis of de novo variants in autism<\/a>. <strong>European Journal of Human Genetics<\/strong>. june 6, 2023.<strong><br \/>\n<\/strong>HERNANDES-ROSA, J. <em>et al<\/em>. <a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2020.08.15.252312v1\" target=\"_blank\" rel=\"noopener\">Evidence-based gene expression modulation correlates with transposable element knock-down<\/a>. <strong>bioRxiv<\/strong><strong>. <\/strong>aug. 15, 2020.<\/p>\n","protected":false},"excerpt":{"rendered":"New techniques are changing the way genetic material is understood and promising to transform our understanding of diseases, human genetic diversity, food production, and evolutionary processes","protected":false},"author":3,"featured_media":497626,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[156,159],"tags":[206,209,237,247],"coauthors":[1601],"class_list":["post-497607","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-cover","category-science","tag-biodiversity","tag-biology","tag-genetics","tag-medicine","position_at_home-sumario"],"acf":[],"_links":{"self":[{"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/posts\/497607","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/comments?post=497607"}],"version-history":[{"count":15,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/posts\/497607\/revisions"}],"predecessor-version":[{"id":503194,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/posts\/497607\/revisions\/503194"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/media\/497626"}],"wp:attachment":[{"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/media?parent=497607"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/categories?post=497607"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/tags?post=497607"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/revistapesquisa.fapesp.br\/en\/wp-json\/wp\/v2\/coauthors?post=497607"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}