ORGANISM NAME: Camelina sativa ORGANISM COMMON NAME: false flax TAXID: 90675 ANNOTATION RELEASE NAME: NCBI Camelina sativa Annotation Release 100 ANNOTATION EVIDENCE FREEZE DATE: 24 November 2014 ANNOTATION RELEASE DATE: 1 December 2014 ANNOTATION REPORT: http://www.ncbi.nlm.nih.gov/genome/annotation_euk/Camelina_sativa/100/ ANNOTATED ASSEMBLIES: * REFERENCE: ASSEMBLY NAME: Cs ASSEMBLY ACCESSION: GCF_000633955.1 ASSEMBLY SUBMITTER: Agriculture & AgriFood Canada ASSEMBLY DATE: 24 April 2014 ASSEMBLY TYPE: Haploid NUMBER OF ASSEMBLY-UNITS: 1 ====================================================================== Note: May 19, 2015 - update to the GFF3 files in the GFF directory. In both top-level and scaffold GFF3, indels in the coding regions are now represented by overlaps or short introns. * The top-level file (*_top_level.gff3.gz) in the GFF directory was corrected to reflect an intended software release 6.2 change in how transcripts with indels are represented compared to the genome. Insertions in transcripts within the coding region are now represented by a small overlap between the two halves of a split exon, and deletions within the coding region are represented by very short introns between the two halves of an exon. This allows software to properly interpret the reading frame. Note that the conceptual sequence of the feature can still differ from the transcript or protein sequence because of mismatches, gaps, and when overlapping genome sequence does not match the sequence of an insertion. * The scaffold file (*_scaffolds.gff3.gz) was corrected to remove unintended small overlaps or very short introns for indels in the transcripts' untranslated regions. ====================================================================== Note: May 4, 2015 - update for protein.gbk.gz The protein.gbk.gz file in the protein directory was regenerated to correct an error in the source for the protein (DBSOURCE and coded_by fields). For some proteins, the scaffold on which the protein is annotated was used as the source rather than the transcript. The full list of changes includes: * corrected DBSOURCE to RefSeq transcripts instead of scaffolds * removed unexpected PUBLICATIONS propagated from the scaffolds * corrected CDS feature annotation, including: coded_by to range on the RefSeq transcripts instead of scaffolds corrected CDS range (if different than the genome length) removed 'Derived by automated computational analysis using gene prediction method: Gnomon' note on CDS removed /inference and /exception on CDS removed some db_xrefs that are present on genome but missing on known RefSeq (NP_s) used native known RefSeq markup for gene names and such when they differ from what was annotated on the genome added CDS feature to organelle proteins