首页 诗词 字典 板报 句子 名言 友答 励志 学校 网站地图
当前位置: 首页 > 教程频道 > 企业软件 > 行业软件 >

NCBI参照序列RefSeq

2012-10-07 
NCBI参考序列RefSeq关于RefSeq的基本信息,可以参照一下几篇文章【开启传送门~!@#¥%……&*】http://liucheng.na

NCBI参考序列RefSeq

关于RefSeq的基本信息,可以参照一下几篇文章【开启传送门~!@#¥%……&*】

http://liucheng.name/381/

http://www.biosino.org/pages/ncbi-10.htm

官方版本:http://www.ncbi.nlm.nih.gov/RefSeq/RSfaq.html

?

不过可能我现在更关注与RefSeq的格式说明,这一阶段的失败教训提醒我,数据分析的时候一定要搞清楚各个数据项的意义。

方便查阅

AccessionMoleculeMethod?@Note说明?AC_123456GenomicMixedAlternate complete genomic molecule. This prefix is used for records that are provided to reflect an alternate assembly or annotation. Primarily used for viral, prokaryotic records.?基因组序列,主要是病毒、原核生物。AP_123456ProteinMixedProtein products; alternate protein record. This prefix is used for records that are provided to reflect an alternate assembly or annotation. The AP_ prefix was originally designated for bacterial proteins but this usage was changed.?蛋白序列,AP_原本只用于细菌的蛋白。NC_123456GenomicMixedComplete genomic molecules including genomes, chromosomes, organelles, plasmids.?全基因组序列,包括细胞器的、质粒等NG_123456GenomicMixedIncomplete genomic region; supplied to support the?NCBI?genome annotation pipeline. Represents either non-transcribed pseudogenes, or larger regions representing a gene cluster that is difficult to annotate via automatic methods.?不完整的基因组序列,NM_123456
NM_123456789mRNAMixedTranscript products; mature messenger RNA (mRNA) transcripts.?成熟的mRNANP_123456
NP_123456789ProteinMixedProtein products; primarily full-length precursor products but may include some partial proteins and mature peptide products.?全长蛋白序列。但也有可能包括非全长的蛋白或成熟的多肽序列。NR_123456RNAMixedNon-coding transcripts including structural RNAs, transcribed pseudogenes, and others.?不编码的RNA,假基因或其它NT_123456GenomicAutomatedIntermediate genomic assemblies of BAC and/or Whole Genome Shotgun sequence data.?BAC法或鸟枪法得到的基因组序列NW_123456
NW_123456789GenomicAutomatedIntermediate genomic assemblies of BAC or Whole Genome Shotgun sequence data.?BAC法或鸟枪法得到的基因组序列NZ_ABCD12345678GenomicAutomatedA collection of whole genome shotgun sequence data for a project. Accessions are not tracked between releases. The first four characters following the underscore (e.g. 'ABCD') identifies a genome project.?'ABCD'代表的是具体的基因组计划XM_123456
XM_123456789mRNAAutomatedTranscript products; model mRNA provided by a genome annotation process; sequence corresponds to the genomic contig.?转录序列XP_123456
XP_123456789ProteinAutomatedProtein products; model proteins provided by a genome annotation process; sequence corresponds to the genomic contig.?蛋白序列XR_123456RNAAutomatedTranscript products; model non-coding transcripts provided by a genome annotation process; sequence corresponds to the genomic contig.?不编码的转录序列,YP_123456
YP_123456789ProteinMixedProtein products; no corresponding transcript record provided. Primarily used for bacterial, viral, and mitochondrial records.?蛋白序列,没有对应的转录序列。用于细菌、病毒和线粒体ZP_12345678ProteinAutomatedProtein products; annotated on NZ_ accessions (often via computational methods).?蛋白序列。来自对应的NZ_开头的核酸序列。NS_123456GenomicAutomatedGenomic records that represent an assembly which does not reflect the structure of a real biological molecule. The assembly may represent an unordered assembly of unplaced scaffolds, or it may represent an assembly of DNA sequences generated from a biological sample that may not represent a single organism.?比较复杂

@ Method:???
Mixed: indicates the process flow includes both automated processing and expert review for some of the records; curation analysis may be provided either by NCBI staff or collaborators.由专家手动检查过的
Automated: indicates records that are not individually reviewed; updates are released in bulk for a genome.自动注释的

For more:http://www.ncbi.nlm.nih.gov/RefSeq/key.html#accession

热点排行