TY - JOUR
T1 - The draft genome sequence of cork oak
AU - Ramos, António Marcos
AU - Usié, Ana
AU - Barbosa, Pedro
AU - Barros, Pedro M.
AU - Capote, Tiago
AU - Chaves, Inês
AU - Simões, Fernanda
AU - Abreu, Isabel
AU - Carrasquinho, Isabel
AU - Faro, Carlos
AU - Guimarães, Joana B.
AU - Mendonça, Diogo
AU - Nóbrega, Filomena
AU - Rodrigues, Leandra
AU - Saibo, Nelson J.M.
AU - Varela, Maria Carolina
AU - Egas, Conceição
AU - Matos, José
AU - Miguel, Célia M.
AU - Oliveira, M. Margarida
AU - Ricardo, Cândido P.
AU - Gonçalves, Sónia
PY - 2018/5/22
Y1 - 2018/5/22
N2 - Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species.
AB - Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species.
UR - http://www.scopus.com/inward/record.url?scp=85047644050&partnerID=8YFLogxK
U2 - 10.1038/sdata.2018.69
DO - 10.1038/sdata.2018.69
M3 - Article
C2 - 29786699
AN - SCOPUS:85047644050
SN - 2052-4463
VL - 5
JO - Scientific Data
JF - Scientific Data
M1 - 180069
ER -