Report for Falco peregrinus Annotation Release 100
The RefSeq genome records for Falco peregrinus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline. This page provides reports on input data used in the pipeline, intermediate alignment results, and statistics on the annotation products.
For details on the workflow used to generate the data please visit the annotation pipeline description page.
The annotation products are available in the sequence databases and on the FTP site.
Annotation Release information
Annotation release ID: 100Date of Entrez queries for transcripts and proteins: Aug 06 2013
Date of submission of annotation to the public databases: Aug 08 2013
Software version: 5.1
Assemblies
The following assemblies were included in this annotation run:Assembly name | Assembly accession | Submitter | Assembly date | Reference/Alternate | Assembly content |
---|---|---|---|---|---|
F_peregrinus_v1.0 | GCF_000337955.1 | BGI | 02-05-2013 | Reference | 1 assembled chromosomes; unplaced scaffolds |
Gene and feature statistics
Counts and length of annotated features are provided below for each assembly unit.Feature counts
Feature | Primary Assembly F_peregrinus_v1.0 |
---|---|
Gene | 14,720 |
protein-coding | 14,451 |
non-coding | 228 |
pseudogene | 41 |
mRNA | 16,079 |
fully-supported | 9,993 |
with > 5% ab initio | 3,411 |
partial | 795 |
known RefSeq (NM_) | 0 |
model RefSeq (XM_) | 16,079 |
model RefSeq (XM_) with correction | 853 |
Other RNA | 238 |
fully-supported | 40 |
with > 5% ab initio | 0 |
partial | 0 |
known RefSeq (NR_) | 0 |
model RefSeq (XR_) | 40 |
model RefSeq (XR_) with correction | 0 |
CDSs | 16,084 |
fully-supported | 9,993 |
with > 5% ab initio | 3,810 |
partial | 795 |
known RefSeq (NM_) | 0 |
model RefSeq (XM_) | 16,079 |
model RefSeq (XM_) with correction | 853 |
Detailed reports
Feature lengths
Masking of genomic sequence
Transcript and protein alignments and gene prediction are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly.
Assembly name | Assembly accession | % Masked with RepeatMasker | % Masked with WindowMasker |
---|---|---|---|
F_peregrinus_v1.0 | GCF_000337955.1 | 4.16% | 17.91% |
Transcript and Protein alignments
The annotation pipeline relies heavily on the alignment of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, the gene prediction software.