Drosophila species assemblies counts of Dmel gene matches
tBLASTn of Dmel proteins x other species
27 October 2005
Dros. euGenes Dupl. UCSC
Assembly matches genes matches
-----------------------------------------
Dmel r4.1 13472 935 13449 (source genes)
Dsim 2004 13246 1231 13024
Dyak 2004 13329 1276 13279
Dere 2004.x 13304 1251 --
Dere 2005.0801 13328 1141 13362
Dana 2004.x 13082 1514 12943
Dana 2005.0801 13074 1528 13098
Dpse 1 12858 1815 12834
Dmoj 2004.x 12695 2878 12502
Dmoj 2005.0801 12675 1946 12627
Dvir 2004.x 12721 1974 12619
Dvir 2005.0801 12709 1389 12731
Dgri 2005.0801 12585 2877 12635
Dgri 2005.0829 12615 2502 --
----------------------------------------
Columns are counts unique gene matches on genome assemblies.
Duplicate genes are matches to more than one location on assembly.
Notes:
The UCSC Dmel protein reference set (13449 genes with 18941 proteins)
differs from the euGenes protein ref. set (single longest protein
from each of 13472 genes) so these two are not strictly comparable.
UCSC and euGenes are not using same 2004 assemblies.
Loss of 10 to 20 gene matches for Dana, Dmoj, Dvir newer vs older
assemblies seems to be only low quality matches (Dvir, Dmoj checked)
Don Gilbert, 28 Oct 2005
|