|Title||A ranking system for reference libraries of DNA barcodes: application to marine fish species from Portugal.|
|Publication Type||Journal Article|
|Authors||Costa, FO, Landi, M, Martins, R, Costa, MH, Costa, ME, Carneiro, M, Alves, MJ, Steinke, D, Carvalho, GR|
|Year of Publication||2012|
|Keywords||Animals, Aquatic Organisms, DNA Barcoding, Taxonomic, DNA, Mitochondrial, Electron Transport Complex IV, Fishes, Gene Library, Phylogeny, Phylogeography, Portugal|
BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We propose a ranking system to attribute a confidence level to species identifications associated with DNA barcode records from a RLDB. Here we apply the proposed ranking system to a newly generated RLDB for marine fish of Portugal.METHODOLOGY/PRINCIPAL FINDINGS: Specimens (n = 659) representing 102 marine fish species were collected along the continental shelf of Portugal, morphologically identified and archived in a museum collection. Samples were sequenced at the barcode region of the cytochrome oxidase subunit I gene (COI-5P). Resultant DNA barcodes had average intra-specific and inter-specific Kimura-2-parameter distances (0.32% and 8.84%, respectively) within the range usually observed for marine fishes. All specimens were ranked in five different levels (A-E), according to the reliability of the match between their species identification and the respective diagnostic DNA barcodes. Grades A to E were attributed upon submission of individual specimen sequences to BOLD-IDS and inspection of the clustering pattern in the NJ tree generated. Overall, our study resulted in 73.5% of unambiguous species IDs (grade A), 7.8% taxonomically congruent barcode clusters within our dataset, but awaiting external confirmation (grade B), and 18.7% of species identifications with lower levels of reliability (grades C/E).CONCLUSION/SIGNIFICANCE: We highlight the importance of implementing a system to rank barcode records in RLDB, in order to flag taxa in need of taxonomic revision, or reduce ambiguities of discordant data. With increasing DNA barcode records publicly available, this cross-validation system would provide a metric of relative accuracy of barcodes, while enabling the continuous revision and annotation required in taxonomic work.
|Alternate Journal||PLoS ONE|
|PubMed Central ID||PMC3338485|