Report for Falco peregrinus Annotation Release 100

The RefSeq genome records for Falco peregrinus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline. This page provides reports on input data used in the pipeline, intermediate alignment results, and statistics on the annotation products.

For details on the workflow used to generate the data please visit the annotation pipeline description page.

The annotation products are available in the sequence databases and on the FTP site.


Annotation Release information

Annotation release ID: 100
Date of Entrez queries for transcripts and proteins: Aug 06 2013
Date of submission of annotation to the public databases: Aug 08 2013
Software version: 5.1

Assemblies

The following assemblies were included in this annotation run:
Assembly name Assembly accession Submitter Assembly date Reference/Alternate Assembly content
F_peregrinus_v1.0 GCF_000337955.1 BGI 02-05-2013 Reference 1 assembled chromosomes; unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly unit.

Feature counts

Feature Primary Assembly
F_peregrinus_v1.0
Gene 14,720
  protein-coding 14,451
  non-coding 228
  pseudogene 41
mRNA 16,079
  fully-supported 9,993
  with > 5% ab initio 3,411
  partial 795
  known RefSeq (NM_) 0
  model RefSeq (XM_) 16,079
  model RefSeq (XM_) with correction 853
Other RNA 238
  fully-supported 40
  with > 5% ab initio 0
  partial 0
  known RefSeq (NR_) 0
  model RefSeq (XR_) 40
  model RefSeq (XR_) with correction 0
CDSs 16,084
  fully-supported 9,993
  with > 5% ab initio 3,810
  partial 795
  known RefSeq (NM_) 0
  model RefSeq (XM_) 16,079
  model RefSeq (XM_) with correction 853

Detailed reports

Feature lengths

Masking of genomic sequence

Transcript and protein alignments and gene prediction are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly.

Assembly name Assembly accession % Masked with RepeatMasker % Masked with WindowMasker
F_peregrinus_v1.0 GCF_000337955.1 4.16% 17.91%
For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker.

Transcript and Protein alignments

The annotation pipeline relies heavily on the alignment of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, the gene prediction software.

Transcript alignments

RNA-Seq alignments

No RNA-Seq data was used in this annotation

Protein alignments

External link. Please review our privacy policy.