Genbank Accession number

WBOY
發布: 2016-06-07 15:31:55
原創
2715 人瀏覽過

Accession numbers are identifiers for a sequence, for example P123456. They can have version numbers if suffixed with a "." and a number, for example P123456.2. This aids distinguishing between older and newer versions of a sequence, and t

Accession numbers are identifiers for a sequence, for example P123456. They can have version numbers if suffixed with a "." and a number, for example P123456.2. This aids distinguishing between older and newer versions of a sequence, and to track which actual sequence was used in an analysis.

NCBI Reference sequences have their own syntax.

Accessions are allocated in batches to the different sequence repositories DDBJ, EMBL Database, and NCBI. Table 1 shows the format of some unversioned accession numbers.

Table 1: Some Accession Number Formats

 Database  Regular Expression  Perl Regular Expression
 RefSeq  [:alpha]{2}_[:digit]{6,9} or NZ_[:alpha]{4} [:digit]{6,9}  [A-Z]{2}_\d{6,9} or NZ_[A-Z]{4}\d{6,9}
 Swissprot  [OPQ][:digit][:alnum]{3}[:digit]  
 GenBank/EMBL/DDBJ  [:alpha][:digit]{5} or [:alpha]{2}[:digit]{6}  [A-Z]\d{5} or [A-Z]{2}\d{6}
 PRF  [:digit]{6,7} [:alpha]  \d{6,7}[A-Z]
 PDB  [:digit][:alpha]{3}  \d[A-Z]{3}
 MMDB  [:digit]{4}  \d{4}
 GenBank GI  [:digit]{5,}  \d{5,}


相關標籤:
來源:php.cn
本網站聲明
本文內容由網友自願投稿,版權歸原作者所有。本站不承擔相應的法律責任。如發現涉嫌抄襲或侵權的內容,請聯絡admin@php.cn
熱門教學
更多>
最新下載
更多>
網站特效
網站源碼
網站素材
前端模板