Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= DDB0202532
(717 letters)
Database: /home/dicty1/resource/WorkingDBs//nr-clean
2,329,665 sequences; 788,375,511 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|54658838|gb|EAL37451.1| hypothetical protein Chro.30077 [Cryp... 136 2e-30
gi|46228303|gb|EAK89202.1| hypothetical protein with 4 transmemb... 135 6e-30
gi|56497588|emb|CAI00101.1| conserved hypothetical protein [Plas... 129 3e-28
gi|56519381|emb|CAH75495.1| conserved hypothetical protein [Plas... 129 4e-28
gi|23498780|emb|CAD50850.1| hypothetical protein [Plasmodium fal... 126 3e-27
gi|23490733|gb|EAA22438.1| hypothetical protein [Plasmodium yoel... 77 1e-12
gi|23619520|ref|NP_705482.1| hypothetical protein [Plasmodium fa... 77 2e-12
gi|56513961|emb|CAH86598.1| hypothetical protein PC302073.00.0 [... 76 3e-12
gi|56526585|emb|CAH89026.1| conserved hypothetical protein [Plas... 76 4e-12
gi|56499773|emb|CAH97091.1| conserved hypothetical protein [Plas... 75 5e-12
gi|23484006|gb|EAA19486.1| ERYTHROCYTE MEMBRANE PROTEIN PFEMP3 [... 75 5e-12
gi|23490734|gb|EAA22439.1| hypothetical protein [Plasmodium yoel... 71 1e-10
gi|56507955|emb|CAH87264.1| hypothetical protein PC302397.00.0 [... 62 5e-08
gi|48891474|ref|ZP_00324981.1| COG4782: Uncharacterized protein ... 52 5e-05
gi|24213740|ref|NP_711221.1| hypothetical protein LA1040 [Leptos... 51 1e-04
gi|45658461|ref|YP_002547.1| hypothetical protein LIC12624 [Lept... 49 4e-04
gi|13470451|ref|NP_102019.1| hypothetical protein mll0159 [Mesor... 47 0.002
gi|17427839|emb|CAD14529.1| CONSERVED HYPOTHETICAL PROTEIN [Rals... 47 0.002
gi|13470449|ref|NP_102017.1| hypothetical protein mll0158 [Mesor... 47 0.003
gi|17934312|ref|NP_531102.1| hypothetical protein Atu0397 [Agrob... 45 0.008
gi|10444251|gb|AAG17817.1| haem lyase [Naegleria gruberi] >gi|11... 45 0.008
gi|15155313|gb|AAK86212.1| AGR_C_699p [Agrobacterium tumefaciens... 45 0.008
gi|16262766|ref|NP_435559.1| hypothetical protein SMa0599 [Sinor... 45 0.010
gi|52008217|ref|ZP_00335594.1| COG0668: Small-conductance mechan... 44 0.013
gi|23347117|gb|AAN29277.1| lipoprotein, putative [Brucella suis ... 44 0.022
gi|36959002|gb|AAQ87427.1| Hypothetical protein RNGR00301 [Rhizo... 44 0.022
gi|23025044|ref|ZP_00064224.1| COG2211: Na+/melibiose symporter ... 43 0.037
gi|16264069|ref|NP_436861.1| hypothetical protein SMb20335 [Sino... 42 0.049
gi|49475541|ref|YP_033582.1| hypothetical protein BH07710 [Barto... 42 0.064
gi|14133569|gb|AAK54067.1| galactose permease [Lactobacillus bre... 42 0.064
gi|57242518|ref|ZP_00370456.1| competence locus E (comE3) [Campy... 42 0.083
gi|15965400|ref|NP_385753.1| hypothetical protein SMc00955 [Sino... 41 0.11
gi|34534287|dbj|BAC86958.1| unnamed protein product [Homo sapiens] 41 0.11
gi|49474175|ref|YP_032217.1| hypothetical protein BQ05560 [Barto... 41 0.14
gi|52009316|ref|ZP_00336680.1| COG4642: Uncharacterized protein ... 40 0.24
gi|17987877|ref|NP_540511.1| hypothetical protein BMEI1594 [Bruc... 40 0.24
gi|45915397|ref|ZP_00197078.1| COG4782: Uncharacterized protein ... 40 0.24
gi|49647997|emb|CAG82450.1| unnamed protein product [Yarrowia li... 40 0.32
gi|23612229|ref|NP_703809.1| hypothetical protein [Plasmodium fa... 39 0.41
gi|15899328|ref|NP_343933.1| hypothetical protein SSO2601 [Sulfo... 39 0.41
gi|29346015|ref|NP_809518.1| putative polysaccharide export prot... 39 0.41
gi|56680431|gb|AAV97097.1| lipoprotein, putative [Silicibacter p... 39 0.54
gi|36958719|gb|AAQ87187.1| Hypothetical protein RNGR00163 [Rhizo... 39 0.54
gi|17933991|ref|NP_530781.1| hypothetical protein Atu0072 [Agrob... 39 0.71
gi|15642876|ref|NP_227917.1| hypothetical protein TM0101 [Thermo... 39 0.71
gi|49146488|ref|YP_026062.1| NADH dehydrogenase subunit 6 [Aleur... 39 0.71
gi|27383091|ref|NP_774620.1| hypothetical protein blr7980 [Brady... 38 0.92
>gi|54658838|gb|EAL37451.1| hypothetical protein Chro.30077
[Cryptosporidium hominis]
Length = 1172
Score = 136 bits (342), Expect = 2e-30
Identities = 82/232 (35%), Positives = 118/232 (50%), Gaps = 20/232 (8%)
Query: 415 NWISSQ---SNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLY 471
NWI S + E ++FIHGY++ + DAL++ Q +A G+FP+YIKP VF+WPS S L Y
Sbjct: 746 NWIHSDGLHAAEALIFIHGYNNTIMDALRQVGQMIAFGNFPSYIKPMVFSWPSGNSFLEY 805
Query: 472 WCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKRKPI 531
+ A A H L + I L GIR++HIM HSMGTR F++SF K+ +
Sbjct: 806 FKARKSAESPHTHNSLYQLILGLKNRGIRHIHIMTHSMGTRLFIQSFPKLLSENLLERCE 865
Query: 532 VYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYIN-KINLTNLIFLNP 590
+ D+ IN K+ + ++ FLNP
Sbjct: 866 QHDHRSSIDPGNYLGLRHSETQFG---------------DEINPIINEKVQIVSMTFLNP 910
Query: 591 DYEINTFKND-YGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFGI 641
DY ++ F N + LR YC IT+Y D +D A+K + I +R LG N+FG+
Sbjct: 911 DYYLDDFINKAFPMLRQYCNLITMYGDSQDGALKWSEIIQGRRALGCNVFGL 962
Score = 81.6 bits (200), Expect = 7e-14
Identities = 52/165 (31%), Positives = 80/165 (47%), Gaps = 16/165 (9%)
Query: 245 DGRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSW 304
+GRPHG G W + Y GE+L GFW++G P+GPF++ E + S + L++ G ++
Sbjct: 353 EGRPHGFGRWREDDYYGEILVGFWKNGHPVGPFKTRECRSGSGFICLKL-----GFCRTE 407
Query: 305 LDRIPLNIGVASIECCVSGNFFKGYPKVSMIKGPDLCKCQNRCTCIQSLLDKKYYRHIDD 364
+ P G A IEC VSG FF+G+P + P + NR I + + K +DD
Sbjct: 408 ANLTPNLYGYADIECSVSGQFFRGFPLCHIYPAPSI--GMNRSGSIFARIGAKGKAALDD 465
Query: 365 DKTITSIVVSLDKKMDALAI--SGFKPLHPKNKTVSIEIGRQENV 407
KKM S FK + K+K + ++E +
Sbjct: 466 -------TTQKGKKMVGAVFNSSWFKKIRKKDKDKKTTLEKKEGI 503
Score = 56.2 bits (134), Expect = 3e-06
Identities = 27/70 (38%), Positives = 47/70 (66%), Gaps = 1/70 (1%)
Query: 649 LDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVGDVFRF 707
LD+D+ID ++ N+ H+ +N+NR +++DL +L+V+ KRA R++RL K G+V+ +
Sbjct: 1101 LDMDVIDQSFIEQNVGTMRHNNWNLNREVIEDLRELVVSRKRAYQRSTRLDKREGNVWVY 1160
Query: 708 SILPSTVVVV 717
I PS V +
Sbjct: 1161 RIAPSCVTSI 1170
>gi|46228303|gb|EAK89202.1| hypothetical protein with 4
transmembrane domains, possible unusual phyletic
distribution [Cryptosporidium parvum]
Length = 1173
Score = 135 bits (339), Expect = 6e-30
Identities = 81/232 (34%), Positives = 118/232 (49%), Gaps = 20/232 (8%)
Query: 415 NWISSQ---SNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLY 471
NWI S + E ++FIHGY++ + DAL++ Q +A G+FP+YIKP VF+WPS S L Y
Sbjct: 747 NWIHSDGLHAAEALIFIHGYNNTIMDALRQVGQMIAFGNFPSYIKPMVFSWPSGNSFLEY 806
Query: 472 WCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKRKPI 531
+ A A H L + I L GIR++HIM HSMGTR F++SF K+ +
Sbjct: 807 FKARKSAESPHTHNSLYQLILGLKNRGIRHIHIMTHSMGTRLFIQSFPKLLSENLLERCE 866
Query: 532 VYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYIN-KINLTNLIFLNP 590
+ D+ IN K+ + ++ FLNP
Sbjct: 867 QHDHRSSIDPGNYLGLRHSETQFG---------------DEINPIINEKVQIVSMTFLNP 911
Query: 591 DYEINTFKND-YGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFGI 641
DY ++ F N + LR YC IT+Y D +D A+K + I ++ LG N+FG+
Sbjct: 912 DYYLDDFINKAFPMLRQYCNLITMYGDSQDGALKWSEIIQGRKALGCNVFGL 963
Score = 82.0 bits (201), Expect = 6e-14
Identities = 52/165 (31%), Positives = 81/165 (48%), Gaps = 16/165 (9%)
Query: 245 DGRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSW 304
+GRPHG G W + Y GE+L GFW++G P+GPF++ E + S + L++ G ++
Sbjct: 354 EGRPHGFGRWREDDYYGEILVGFWKNGHPVGPFKTRECRSGSGFICLKL-----GFCRTE 408
Query: 305 LDRIPLNIGVASIECCVSGNFFKGYPKVSMIKGPDLCKCQNRCTCIQSLLDKKYYRHIDD 364
+ P G A IEC VSG FF+G+P + P + NR I + + K +DD
Sbjct: 409 ANLTPNLYGYADIECSVSGQFFRGFPLCHIYPAPSI--GMNRSGSIFARIGAKGKAALDD 466
Query: 365 DKTITSIVVSLDKKMDALAI--SGFKPLHPKNKTVSIEIGRQENV 407
KKM S FK + K+K + ++E++
Sbjct: 467 -------TTQKGKKMVGAVFNSSWFKKIRKKDKDKKTALEKKESI 504
Score = 56.2 bits (134), Expect = 3e-06
Identities = 27/70 (38%), Positives = 47/70 (66%), Gaps = 1/70 (1%)
Query: 649 LDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVGDVFRF 707
LD+D+ID ++ N+ H+ +N+NR +++DL +L+V+ KRA R++RL K G+V+ +
Sbjct: 1102 LDMDVIDQSFIEQNVGTMRHNNWNLNREVIEDLRELVVSRKRAYQRSTRLDKREGNVWVY 1161
Query: 708 SILPSTVVVV 717
I PS V +
Sbjct: 1162 RIAPSCVTSI 1171
>gi|56497588|emb|CAI00101.1| conserved hypothetical protein
[Plasmodium berghei]
Length = 1497
Score = 129 bits (324), Expect = 3e-28
Identities = 77/241 (31%), Positives = 121/241 (49%), Gaps = 42/241 (17%)
Query: 405 ENVQKLIVDNNWISSQ---SNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFN 461
+N+++L ++ W+ + S E ++FIHGY+ +AL+ Q + G+FPNYIK F+FN
Sbjct: 1046 DNIKELSLEG-WVRCELAGSLEAVIFIHGYNTSHLEALQILGQMASFGNFPNYIKLFLFN 1104
Query: 462 WPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKI 521
WPS + L ++ A + + + H + F+ +L +GI+ +HI+ HSMGTR FL SF I
Sbjct: 1105 WPSGKNILEFFIARENSKNKEIHHAFKSFLNTLRNNGIKQIHIITHSMGTRMFLLSFHDI 1164
Query: 522 KKAFAKRKPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKIN 581
K F E+D NK+
Sbjct: 1165 VKGEL-------------------------------------FSNIDEEDNVENNKNKMK 1187
Query: 582 LTNLIFLNPDYEINTFKN-DYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFG 640
L L +NP+Y +N F N +Y LR+YC I++Y D D+A+K A + ++LG NIF
Sbjct: 1188 LITLTMMNPEYSLNDFVNKEYIFLRSYCTVISIYCDSNDKALKWAEIFSGTKSLGKNIFD 1247
Query: 641 I 641
+
Sbjct: 1248 L 1248
Score = 74.7 bits (182), Expect = 9e-12
Identities = 36/87 (41%), Positives = 51/87 (58%), Gaps = 5/87 (5%)
Query: 246 GRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSWL 305
GRPHG G W +GE+L G+W GIP+GPF+ + T S + ++I YG
Sbjct: 360 GRPHGFGYWRGIHSEGEVLIGYWYHGIPVGPFKCRDLKTGSGFMCIKIGYGHTS-----C 414
Query: 306 DRIPLNIGVASIECCVSGNFFKGYPKV 332
+ L IG+A ECCVSG F++ +P+V
Sbjct: 415 EPNDLKIGLADTECCVSGAFYRTFPRV 441
Score = 62.4 bits (150), Expect = 5e-08
Identities = 32/70 (45%), Positives = 49/70 (69%), Gaps = 1/70 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D+ LDVD+IDT L SN+ HS++++NR +++D+ +LIVT KRA RTSRL + G
Sbjct: 1418 DNRDWLDVDVIDTTWLGSNVHTLRHSYWSLNREIIEDIRELIVTRKRARQRTSRLDRREG 1477
Query: 703 DVFRFSILPS 712
+V+ + + PS
Sbjct: 1478 NVWVYRVAPS 1487
>gi|56519381|emb|CAH75495.1| conserved hypothetical protein
[Plasmodium chabaudi]
Length = 609
Score = 129 bits (323), Expect = 4e-28
Identities = 82/290 (28%), Positives = 141/290 (48%), Gaps = 55/290 (18%)
Query: 405 ENVQKLIVDNNWISSQ---SNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFN 461
+N+++L ++ W+ + S E ++FIHGY+ +AL+ Q + G+FPNYIK F+FN
Sbjct: 159 DNIKELSLEG-WVRCELAGSLEAVIFIHGYNTSHLEALQILGQMASFGNFPNYIKLFLFN 217
Query: 462 WPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKI 521
WPS + L ++ A + + + H + F+++L +GI+ +HI+ HSMGTR FL +F I
Sbjct: 218 WPSGKNMLEFFIARENSKNKEIHHAFKSFLDTLRNNGIKQIHIITHSMGTRMFLLAFHDI 277
Query: 522 KKAFAKRKPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKIN 581
K F E+D NK+
Sbjct: 278 VKGGL-------------------------------------FSTIDEEDNVENNQNKMK 300
Query: 582 LTNLIFLNPDYEINTFKN-DYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFG 640
L L +NP+Y +N F N +Y LR+YC I++Y D D+A+K A + ++LG N+F
Sbjct: 301 LITLTMMNPEYSLNDFVNKEYIFLRSYCTVISIYCDSNDKALKWAEIFSGTKSLGKNVF- 359
Query: 641 IVDDDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKR 690
D+++ ++S ++ F+ N+L D + + + K+
Sbjct: 360 ---------DLNVSKNNFSKWSISSSENNVFDSNKL---DCYSIQIDNKK 397
Score = 62.4 bits (150), Expect = 5e-08
Identities = 32/70 (45%), Positives = 49/70 (69%), Gaps = 1/70 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D+ LDVD+IDT L SN+ HS++++NR +++D+ +LIVT KRA RTSRL + G
Sbjct: 530 DNRDWLDVDVIDTTWLGSNVHTLRHSYWSLNREIIEDIRELIVTRKRARQRTSRLDRREG 589
Query: 703 DVFRFSILPS 712
+V+ + + PS
Sbjct: 590 NVWVYRVAPS 599
>gi|23498780|emb|CAD50850.1| hypothetical protein [Plasmodium
falciparum 3D7]
ref|NP_704042.1| hypothetical protein [Plasmodium falciparum 3D7]
Length = 2310
Score = 126 bits (316), Expect = 3e-27
Identities = 74/241 (30%), Positives = 123/241 (50%), Gaps = 42/241 (17%)
Query: 405 ENVQKLIVDNNWISSQSN---EGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFN 461
+++++L +D W+ + E ++FIHGY+ +AL+ Q + G+FPNYIK F+FN
Sbjct: 1859 DDIKELSLDG-WVKCELAGCLEAVIFIHGYNTSHLEALQILGQMASFGNFPNYIKLFLFN 1917
Query: 462 WPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKI 521
WPS + L ++ A + + H + F+++L +GIR +HI+ HSMGTR FL +F I
Sbjct: 1918 WPSGKNLLEFFIAKDNSQNKKVHHAFKSFLDTLRNNGIRQIHIITHSMGTRMFLLAFHDI 1977
Query: 522 KKAFAKRKPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKIN 581
+ F E++K KY NK+
Sbjct: 1978 VSSEL-------------------------------------FSTIEEKEKDEKYQNKMK 2000
Query: 582 LTNLIFLNPDYEINTFKN-DYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFG 640
L L +NP+Y ++ F N +Y LR++C I++Y D D+A+K A + ++LG N+F
Sbjct: 2001 LITLTMMNPEYYLSDFVNKEYIFLRSFCTVISIYCDSNDKALKWAEIFSGTKSLGKNVFD 2060
Query: 641 I 641
+
Sbjct: 2061 L 2061
Score = 77.4 bits (189), Expect = 1e-12
Identities = 37/87 (42%), Positives = 54/87 (61%), Gaps = 5/87 (5%)
Query: 246 GRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSWL 305
GRPHG G W + +GE+L G+W GIP+GPF+ + T S + ++I G GK+
Sbjct: 1009 GRPHGFGYWRGINLEGEVLIGYWYHGIPVGPFKCRDFKTGSGFMCIKI-----GYGKTNC 1063
Query: 306 DRIPLNIGVASIECCVSGNFFKGYPKV 332
+ L IG+A ECCVSG F++ +P+V
Sbjct: 1064 ELNDLEIGLADTECCVSGAFYRTFPRV 1090
Score = 62.4 bits (150), Expect = 5e-08
Identities = 32/70 (45%), Positives = 49/70 (69%), Gaps = 1/70 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D+ LDVD+IDT L SN+ HS++++NR +++D+ +LIVT KRA RTSRL + G
Sbjct: 2231 DNRDWLDVDVIDTTWLGSNVHTLRHSYWSLNREIIEDIRELIVTRKRARQRTSRLDRREG 2290
Query: 703 DVFRFSILPS 712
+V+ + + PS
Sbjct: 2291 NVWVYRVAPS 2300
>gi|23490733|gb|EAA22438.1| hypothetical protein [Plasmodium yoelii
yoelii]
Length = 1535
Score = 77.4 bits (189), Expect = 1e-12
Identities = 63/205 (30%), Positives = 93/205 (44%), Gaps = 15/205 (7%)
Query: 160 FIFNLYYEKQYYIPLISLGSGVLVFSIICFFVYPYIMRCIFSITGAITLNVDKDQWVRGE 219
++FN Y+KQ I + S S I + V +C F T +K Q +
Sbjct: 658 YLFNTLYKKQDAIKNEDIESNTK--SAIGYNVVIKKEKCSFFFTKFFR---NKKQTHKKN 712
Query: 220 SNRFTIKQKNLFQKPSTCIYEGPL-LDGRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFE 278
+ R +IK + Y G L GRPHG G W +GE+L G+W GIP+GPF+
Sbjct: 713 TKRTSIK--TIEMDFYIIFYIGELDRKGRPHGFGYWRGIHSEGEVLIGYWYHGIPVGPFK 770
Query: 279 SMENDTRSLLVNLRIIYGTNGGGKSWLDRIPLNIGVASIECCVSGNFFKGYPKVSMIKGP 338
+ T S + ++I YG + L IG+A ECCVSG F++ +P+V
Sbjct: 771 CRDLKTGSGFMCIKIGYGHTS-----CEPNDLKIGLADTECCVSGAFYRTFPRVVFYNLN 825
Query: 339 DLCKCQNRCTCIQSLLDKK--YYRH 361
+ + + +DKK YY H
Sbjct: 826 LTLQDSKKKITKKKKMDKKNNYYNH 850
Score = 65.9 bits (159), Expect = 4e-09
Identities = 31/97 (31%), Positives = 58/97 (58%), Gaps = 4/97 (4%)
Query: 405 ENVQKLIVDNNWISSQ---SNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFN 461
+++++L ++ W+ + S E ++FIHGY+ +AL+ Q + G+FPNYIK F+FN
Sbjct: 1433 DDIKELSLEG-WVRCELAGSLEAVIFIHGYNTSHLEALQILGQMASFGNFPNYIKLFLFN 1491
Query: 462 WPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQSG 498
WPS + L ++ A + + + H + F+ +L +G
Sbjct: 1492 WPSGKNMLEFFIARENSKNKEIHHAFKSFLNTLRNNG 1528
>gi|23619520|ref|NP_705482.1| hypothetical protein [Plasmodium
falciparum 3D7]
emb|CAD52719.1| hypothetical protein [Plasmodium falciparum 3D7]
Length = 2126
Score = 77.0 bits (188), Expect = 2e-12
Identities = 39/85 (45%), Positives = 49/85 (56%), Gaps = 4/85 (4%)
Query: 246 GRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSWL 305
GRPHG G W++ GE L GFW G P+GPF S E + SL VN R+ + GK W
Sbjct: 630 GRPHGYGEWIEDHSYGEKLRGFWYHGYPVGPFISQEIGSGSLFVNTRVGFAA-CVGKDWS 688
Query: 306 DRIPLNIGVASIECCVSGNFFKGYP 330
D + GVA EC +SG+FF +P
Sbjct: 689 D---VRYGVACTECSISGHFFNDFP 710
Score = 68.2 bits (165), Expect = 8e-10
Identities = 50/190 (26%), Positives = 86/190 (44%), Gaps = 25/190 (13%)
Query: 353 LLDKKYYRHIDDDKTITSIVVSLDKKMDALAISGFKPLHP----KNKTVSI--------E 400
L KKY ++ +++ I +I + ++ + P +NKT++ E
Sbjct: 1444 LRSKKYKKNYKNERIIGTIGNKMRNLNESYGNFNTPTISPTNIKRNKTIAQVFAEEDEWE 1503
Query: 401 IGRQENVQKLIVDNNW--------ISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFP 452
R ++ QK+++D W ++ E +++IHGY+ L + A ++ P
Sbjct: 1504 QFRHQHKQKIVIDG-WQSLSLKQNLNFMPEEILIYIHGYNVKLNHGCSQLAHLVSFSKLP 1562
Query: 453 NYIKPFVFNWPSST----SPLLYWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHS 508
YI+PFVF+W + S L Y A + FI+ L SGI+N+HI+ HS
Sbjct: 1563 AYIQPFVFHWEGAMWGAFSALSYPVAKKRTEMTILGNSFRTFIKELINSGIKNVHIISHS 1622
Query: 509 MGTRFFLRSF 518
G+R F F
Sbjct: 1623 CGSRLFFNGF 1632
Score = 57.0 bits (136), Expect = 2e-06
Identities = 29/93 (31%), Positives = 57/93 (61%), Gaps = 1/93 (1%)
Query: 623 KMAFRITQKRNLGNNIFGIVDDDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLH 682
K +F+++ ++ N+ I D LD+D+IDT +++N+ HSF+ + R ++DD+
Sbjct: 2029 KKSFKMSNQKFKKNDTVYISFDKYAWLDMDVIDTTFVETNVDFLKHSFYQVKREIIDDIR 2088
Query: 683 DLIVTGKRAMDRTSRL-KSVGDVFRFSILPSTV 714
+++++ RA +R SRL + G+VF + P+ V
Sbjct: 2089 EVLISNIRAHERVSRLDRRRGNVFVLRVAPAGV 2121
Score = 50.1 bits (118), Expect = 2e-04
Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 1/78 (1%)
Query: 563 GFVQQPEQDKSLKYINKINLTNLIFLNPDYEINTF-KNDYGELRNYCPRITVYADHRDEA 621
G V+ K+ K +I + +I LNPDY ++TF + D+ LR++C I +Y D RD+A
Sbjct: 1772 GNVKTNTDKKNKKQKKQIIVKTVILLNPDYPLDTFLEKDFFMLRSHCNHIVMYGDTRDQA 1831
Query: 622 IKMAFRITQKRNLGNNIF 639
+ + +++ LG IF
Sbjct: 1832 LTYSETWNREKCLGKRIF 1849
>gi|56513961|emb|CAH86598.1| hypothetical protein PC302073.00.0
[Plasmodium chabaudi]
Length = 255
Score = 76.3 bits (186), Expect = 3e-12
Identities = 62/205 (30%), Positives = 93/205 (45%), Gaps = 16/205 (7%)
Query: 160 FIFNLYYEKQYYIPLISLGSGVLVFSIICFFVYPYIMRCIFSITGAITLNVDKDQWVRGE 219
++FN Y+KQ I + S S I + V +C F T K + +
Sbjct: 5 YLFNTLYKKQDTIKHEDIESNTK--SAIGYNVVIKKEKCSFFFTNFFRKK--KQTYKKKI 60
Query: 220 SNRFTIKQKNLFQKPSTCIYEGPL-LDGRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFE 278
N+ N + Y G L GRPHG G W +GE+L G+W GIP+GPF+
Sbjct: 61 KNKIKGIDMNFY----IIFYIGELDRKGRPHGFGYWRGIHSEGEVLIGYWYHGIPVGPFK 116
Query: 279 SMENDTRSLLVNLRIIYGTNGGGKSWLDRIPLNIGVASIECCVSGNFFKGYPKVSMIKGP 338
+ T S + ++I G G++ + L IG+A ECCVSG F++ +P+V
Sbjct: 117 CRDLKTGSGFMCIKI-----GYGRTSCEPNDLTIGLADTECCVSGAFYRTFPRVVFYNLN 171
Query: 339 DLCKCQNRCTCIQSLLDKK--YYRH 361
+ + T + LDKK Y +H
Sbjct: 172 LTLQNNKKKTNRKKNLDKKPNYSKH 196
>gi|56526585|emb|CAH89026.1| conserved hypothetical protein
[Plasmodium chabaudi]
Length = 1498
Score = 75.9 bits (185), Expect = 4e-12
Identities = 41/100 (41%), Positives = 56/100 (56%), Gaps = 5/100 (5%)
Query: 246 GRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSWL 305
G+PHG G W++ GE L GFW G P+GPF S E T SL VN R+ + GK W
Sbjct: 328 GKPHGYGEWIEDHSFGEKLRGFWFHGYPVGPFISQEVGTGSLFVNTRVGFAA-CIGKDWS 386
Query: 306 DRIPLNIGVASIECCVSGNFFKGYPKVSMIKGPDLCKCQN 345
D + GV+ EC +SG+FF +P ++ P + K +N
Sbjct: 387 D---VRYGVSCTECSISGHFFNDFP-LTHFFNPKIAKNEN 422
Score = 69.3 bits (168), Expect = 4e-10
Identities = 52/187 (27%), Positives = 90/187 (47%), Gaps = 29/187 (15%)
Query: 363 DDDKTITSIVVSLDKKMDALA------ISGFKPLHPKNKTVSI--------EIGRQENVQ 408
++ K S+V + D++A ++GF +NKT++ E R + +
Sbjct: 866 ENTKIFNSLVKKMKNLNDSIASFNGANLTGFNS--QRNKTIAQVFAEEDEWENYRHHHKE 923
Query: 409 KLIVDNNW--ISSQSN------EGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVF 460
+++VD W +S + N E +++IHGY+ L + A ++ P+YI+PFVF
Sbjct: 924 EIVVDG-WQPLSLKRNLNIIPDEIVIYIHGYNVRLHHGCSQLAHLVSFSKLPSYIQPFVF 982
Query: 461 NWPSSTSPLLYWCAHSVASDNDNHRDL----QKFIESLGQSGIRNLHIMCHSMGTRFFLR 516
+W S + ++ VA L +KFI+ L GI+N+HI+ HS G+R F
Sbjct: 983 HWEGSMWGIFSALSYPVAKKRSEMAILGDSFKKFIQDLINWGIKNVHIISHSCGSRLFFN 1042
Query: 517 SFSKIKK 523
F+ K
Sbjct: 1043 GFASCVK 1049
Score = 52.4 bits (124), Expect = 5e-05
Identities = 25/72 (34%), Positives = 46/72 (63%), Gaps = 1/72 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D LD+D+IDT +++N+ HSF+ + R ++DD+ +++++ RA +R SRL + G
Sbjct: 1422 DKYAWLDMDVIDTTFVETNVDFLKHSFYQVKREIIDDIREILISNVRAHERVSRLDRRRG 1481
Query: 703 DVFRFSILPSTV 714
+V+ I P+ V
Sbjct: 1482 NVYVLRIAPAGV 1493
Score = 48.9 bits (115), Expect = 5e-04
Identities = 23/63 (36%), Positives = 41/63 (64%), Gaps = 1/63 (1%)
Query: 578 NKINLTNLIFLNPDYEINTF-KNDYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGN 636
N+I + +I LNPDY ++ F + D+ LR++C I +Y D RD+A++ + +++ LG
Sbjct: 1169 NQIIVKTIILLNPDYSLDKFLETDFFLLRSHCNHIVMYGDTRDQALRYSETWNREKCLGK 1228
Query: 637 NIF 639
+IF
Sbjct: 1229 SIF 1231
>gi|56499773|emb|CAH97091.1| conserved hypothetical protein
[Plasmodium berghei]
Length = 1497
Score = 75.5 bits (184), Expect = 5e-12
Identities = 41/100 (41%), Positives = 56/100 (56%), Gaps = 5/100 (5%)
Query: 246 GRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSWL 305
G+PHG G W++ GE L GFW G P+GPF S E T SL VN R+ + GK W
Sbjct: 329 GKPHGYGEWIEDHSFGEKLRGFWFHGYPVGPFISQEVGTGSLFVNTRVGFAA-CIGKDWS 387
Query: 306 DRIPLNIGVASIECCVSGNFFKGYPKVSMIKGPDLCKCQN 345
D + GV+ EC +SG+FF +P ++ P + K +N
Sbjct: 388 D---VRYGVSCTECSISGHFFSDFP-LTHFFNPKISKNEN 423
Score = 69.7 bits (169), Expect = 3e-10
Identities = 56/218 (25%), Positives = 95/218 (42%), Gaps = 30/218 (13%)
Query: 325 FFKGYPKVSMIKGPDLCKCQNRCTCIQSLLDKKYYRHIDDDKTITSIVVSLDKKMDALAI 384
F K Y ++ K +N T Q + ++ K S+V + D++A
Sbjct: 837 FIKSYRTNGLLNASSNSKNENEITSKQPPKN-------ENTKIFNSLVKKMKNLNDSIAS 889
Query: 385 SGFKPLHP----KNKTVSIEIGRQENVQKL-------IVDNNW--ISSQSN------EGI 425
L P +NKT++ ++ + IV + W +S + N E +
Sbjct: 890 FNGANLTPFNSQRNKTIAQVFAEEDEWENYRHYHKEEIVVDGWQPLSLKRNLNIIPDEIV 949
Query: 426 LFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHR 485
++IHGY+ L + A ++ P+YI+PFVF+W S + ++ VA
Sbjct: 950 IYIHGYNVKLHHGCSQLAHLVSFSKLPSYIQPFVFHWEGSMWGIFSALSYPVAKKRSEMA 1009
Query: 486 DL----QKFIESLGQSGIRNLHIMCHSMGTRFFLRSFS 519
L +KFI+ L GI+N+HI+ HS G+R F F+
Sbjct: 1010 ILGDSFKKFIQDLINWGIKNVHIISHSCGSRLFFNGFA 1047
Score = 53.5 bits (127), Expect = 2e-05
Identities = 26/72 (36%), Positives = 46/72 (63%), Gaps = 1/72 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D LD+D+IDT +++N+ HSF+ + R ++DD+ +++++ RA +R SRL + G
Sbjct: 1421 DKYAWLDMDVIDTTFVETNVDFLKHSFYQVKREIIDDIREILISNVRAHERVSRLDRRRG 1480
Query: 703 DVFRFSILPSTV 714
+VF I P+ V
Sbjct: 1481 NVFVLRIAPAGV 1492
Score = 47.8 bits (112), Expect = 0.001
Identities = 27/83 (32%), Positives = 48/83 (57%), Gaps = 8/83 (9%)
Query: 565 VQQPEQDKSLKYIN-------KINLTNLIFLNPDYEINTF-KNDYGELRNYCPRITVYAD 616
VQ+ E +K K N ++ + +I LNPDY ++ F + D+ LR++C I +Y D
Sbjct: 1152 VQKKELNKKKKINNNKKNKKKQVIVKTIILLNPDYPLDKFLEKDFFLLRSHCNHIVMYGD 1211
Query: 617 HRDEAIKMAFRITQKRNLGNNIF 639
RD+A++ + +++ LG +IF
Sbjct: 1212 TRDQALRYSETWNREKCLGKSIF 1234
>gi|23484006|gb|EAA19486.1| ERYTHROCYTE MEMBRANE PROTEIN PFEMP3
[Plasmodium yoelii yoelii]
Length = 1709
Score = 75.5 bits (184), Expect = 5e-12
Identities = 41/100 (41%), Positives = 56/100 (56%), Gaps = 5/100 (5%)
Query: 246 GRPHGIGTWMDTSYQGELLTGFWEDGIPLGPFESMENDTRSLLVNLRIIYGTNGGGKSWL 305
G+PHG G W++ GE L GFW G P+GPF S E T SL VN R+ + GK W
Sbjct: 558 GKPHGYGEWIEDHSFGEKLRGFWFHGYPVGPFISQEVGTGSLFVNTRVGFAA-CIGKDWS 616
Query: 306 DRIPLNIGVASIECCVSGNFFKGYPKVSMIKGPDLCKCQN 345
D + GV+ EC +SG+FF +P ++ P + K +N
Sbjct: 617 D---VRYGVSCTECSISGHFFSDFP-LTHFFNPKISKNEN 652
Score = 66.2 bits (160), Expect = 3e-09
Identities = 48/180 (26%), Positives = 83/180 (45%), Gaps = 23/180 (12%)
Query: 363 DDDKTITSIVVSLDKKMDALAISGFKPLHP----KNKTVSIEIGRQENVQKL-------I 411
++ K S+V + D++A L P +NKT++ ++ + I
Sbjct: 1098 ENTKIFNSLVKKMKNLNDSIASFNGANLTPFNSQRNKTIAQVFAEEDEWENYRHYHKEEI 1157
Query: 412 VDNNW--ISSQSN------EGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWP 463
V + W +S + N E +++IHGY+ L + A ++ P+YI+PFVF+W
Sbjct: 1158 VVDGWQPLSLKRNLNIIPDEIVIYIHGYNVKLHHGCSQLAHLVSFSKLPSYIQPFVFHWE 1217
Query: 464 SSTSPLLYWCAHSVASDNDNHRDL----QKFIESLGQSGIRNLHIMCHSMGTRFFLRSFS 519
+ ++ VA L +KFI+ L GI+N+HI+ HS G+R F F+
Sbjct: 1218 GYMWGIFSALSYPVAKKRSEMAILGDSFKKFIQDLINWGIKNVHIISHSCGSRLFFNGFA 1277
Score = 50.1 bits (118), Expect = 2e-04
Identities = 25/70 (35%), Positives = 44/70 (62%), Gaps = 1/70 (1%)
Query: 571 DKSLKYINKINLTNLIFLNPDYEINTF-KNDYGELRNYCPRITVYADHRDEAIKMAFRIT 629
+K K N+I + +I LNPDY ++ F + D+ LR++C I +Y D RD+A++ +
Sbjct: 1394 NKKKKKKNQIIVKTIILLNPDYPLDKFLEKDFFLLRSHCNHIVMYGDTRDQALRYSETWN 1453
Query: 630 QKRNLGNNIF 639
+++ LG +IF
Sbjct: 1454 REKCLGKSIF 1463
>gi|23490734|gb|EAA22439.1| hypothetical protein [Plasmodium yoelii
yoelii]
Length = 416
Score = 70.9 bits (172), Expect = 1e-10
Identities = 47/149 (31%), Positives = 67/149 (44%), Gaps = 38/149 (25%)
Query: 494 LGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKRKPIVYXXXXXXXXXXXXXXXXXXXX 553
L +GI+ +HI+ HSMGTR FL SF I KA
Sbjct: 53 LSFAGIKQIHIITHSMGTRMFLLSFHDIVKADL--------------------------- 85
Query: 554 XXXXXXXXXGFVQQPEQDKSLKYINKINLTNLIFLNPDYEINTFKN-DYGELRNYCPRIT 612
F E+D NK+ L L +NP+Y +N F N +Y LR+YC I+
Sbjct: 86 ----------FSNIDEEDNVENNKNKMKLITLTMMNPEYSLNDFVNKEYIFLRSYCTVIS 135
Query: 613 VYADHRDEAIKMAFRITQKRNLGNNIFGI 641
+Y D D+A+K A + ++LG N+F +
Sbjct: 136 IYCDSNDKALKWAEIFSGTKSLGKNVFDL 164
Score = 62.4 bits (150), Expect = 5e-08
Identities = 32/70 (45%), Positives = 49/70 (69%), Gaps = 1/70 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D+ LDVD+IDT L SN+ HS++++NR +++D+ +LIVT KRA RTSRL + G
Sbjct: 337 DNRDWLDVDVIDTTWLGSNVHTLRHSYWSLNREIIEDIRELIVTRKRARQRTSRLDRREG 396
Query: 703 DVFRFSILPS 712
+V+ + + PS
Sbjct: 397 NVWVYRVAPS 406
>gi|56507955|emb|CAH87264.1| hypothetical protein PC302397.00.0
[Plasmodium chabaudi]
Length = 112
Score = 62.4 bits (150), Expect = 5e-08
Identities = 32/70 (45%), Positives = 49/70 (69%), Gaps = 1/70 (1%)
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRL-KSVG 702
D+ LDVD+IDT L SN+ HS++++NR +++D+ +LIVT KRA RTSRL + G
Sbjct: 33 DNRDWLDVDVIDTTWLGSNVHTLRHSYWSLNREIIEDIRELIVTRKRARQRTSRLDRREG 92
Query: 703 DVFRFSILPS 712
+V+ + + PS
Sbjct: 93 NVWVYRVAPS 102
>gi|48891474|ref|ZP_00324981.1| COG4782: Uncharacterized protein
conserved in bacteria [Trichodesmium erythraeum IMS101]
Length = 661
Score = 52.4 bits (124), Expect = 5e-05
Identities = 67/296 (22%), Positives = 116/296 (38%), Gaps = 64/296 (21%)
Query: 405 ENVQKLIVDNNWISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPS 464
EN+ + + D+ I+ +S ++F+HGY+ + +DA R AQ P I F ++WPS
Sbjct: 417 ENINEELKDHE-INERS--ALVFVHGYNVNFEDAAIRAAQMGFDLQVPG-ITAF-YSWPS 471
Query: 465 STSPLLYWCAHSVASDNDNHRDLQKFIESLGQ-SGIRNLHIMCHSMGTRFFLRSFSKIKK 523
Y AS + + + +F+ +L + + I +HI+ HSMG R LR+ +I
Sbjct: 472 QGKLSAY--PVDEASIEASEKYMTEFLLNLAEKTDIEKIHIIAHSMGNRGLLRAVQRI-- 527
Query: 524 AFAKRKPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKINLT 583
++ I I
Sbjct: 528 -----------------------------------------------ISQVQTITNIAFG 540
Query: 584 NLIFLNPDYEINTFKNDYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFGIVD 643
+I PD +I+ FK R T+Y +D+A+ + I Q G F V
Sbjct: 541 QIILAAPDVDIDLFKELAKGYHQLAERTTLYISSKDKALATSALIHQHGRAG--FFPPVT 598
Query: 644 DDGGMLDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRLK 699
G +D + +D ++ H +F RL+++D+ DL++ R RL+
Sbjct: 599 VVEG---IDTVKVSKID--LTLLGHGYFADARLVLEDIRDLLINNTSPGQRRGRLE 649
>gi|24213740|ref|NP_711221.1| hypothetical protein LA1040
[Leptospira interrogans serovar Lai str. 56601]
gb|AAN48239.1| conserved hypothetical protein [Leptospira interrogans serovar lai
str. 56601]
Length = 360
Score = 50.8 bits (120), Expect = 1e-04
Identities = 62/295 (21%), Positives = 118/295 (39%), Gaps = 78/295 (26%)
Query: 422 NEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSP------------- 468
+E ++F+HG++ ++A+ R Q FP K +F WP+
Sbjct: 124 DEILVFVHGFNVKFEEAILRGGQIRFDLKFPG--KMIIFTWPAGNEEVGLVSQVLLNQIL 181
Query: 469 LLYWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKR 528
L +++S + ++ + FI L +G + +H++ HSMG + L + S+I K
Sbjct: 182 LKKTYEKNLSSAKASKKEFKSFINYLQNAG-KKIHLIVHSMGHQVVLPALSEIGK----- 235
Query: 529 KPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKINLTNLIFL 588
E DK L + LI
Sbjct: 236 ----------------------------------------ETDKPL-------IQELILN 248
Query: 589 NPDYEINTFKNDYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFGIVDDDGGM 648
PD++ F+ L RIT+Y D A++++ + Q LG+ V +G
Sbjct: 249 APDFDSAEFRLISDSLIKSSKRITLYCSPGDNALQISASLNQGSRLGS----CVPIEG-- 302
Query: 649 LDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRLKSVGD 703
D+++ +DS++ H +++ +R ++ D++ +++ G RA R KS G+
Sbjct: 303 --FDVVNVNPVDSSLISIGHGYYS-SRPLLTDIYQILL-GVRAEKRLFIRKSSGN 353
>gi|45658461|ref|YP_002547.1| hypothetical protein LIC12624
[Leptospira interrogans serovar Copenhageni str. Fiocruz
L1-130]
gb|AAS71184.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
Length = 360
Score = 49.3 bits (116), Expect = 4e-04
Identities = 60/295 (20%), Positives = 116/295 (38%), Gaps = 78/295 (26%)
Query: 422 NEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSP------------- 468
+E ++F+HG++ ++A+ R Q FP K +F WP+
Sbjct: 124 DEILVFVHGFNVKFEEAILRGGQIRFDLKFPG--KMIIFTWPAGNEEVGLVSQVLLNQIL 181
Query: 469 LLYWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKR 528
L +++S + ++ + FI L +G + +H++ HSMG + L + S+I K
Sbjct: 182 LKKTYEKNLSSAKASKKEFKSFINYLQNAG-KKIHLIVHSMGHQVVLPALSEIGK----- 235
Query: 529 KPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKINLTNLIFL 588
E DK L + LI
Sbjct: 236 ----------------------------------------ETDKPL-------IQELILN 248
Query: 589 NPDYEINTFKNDYGELRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFGIVDDDGGM 648
PD++ F+ L RIT+Y D A++++ + Q LG+ +
Sbjct: 249 APDFDSAEFRLISDSLIKSSKRITLYCSPGDNALQISASLNQGSRLGS--------CAPI 300
Query: 649 LDVDIIDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRLKSVGD 703
D+++ +DS++ H +++ +R ++ D++ +++ G RA R KS G+
Sbjct: 301 EGFDVVNVNPVDSSLISIGHGYYS-SRPLLTDIYQILL-GVRAEKRLFIRKSSGN 353
>gi|13470451|ref|NP_102019.1| hypothetical protein mll0159
[Mesorhizobium loti MAFF303099]
dbj|BAB47805.1| mll0159 [Mesorhizobium loti MAFF303099]
Length = 368
Score = 47.4 bits (111), Expect = 0.002
Identities = 28/95 (29%), Positives = 46/95 (47%), Gaps = 4/95 (4%)
Query: 417 ISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHS 476
I+++ ++F+HGY+ DA+ R Q + +P P +F+W S Y +
Sbjct: 135 IAARGGRVMVFVHGYNTGFDDAVYRLTQIVHDSGYPG--TPVLFSWASGAKTTDY--VYD 190
Query: 477 VASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGT 511
S + L+ + L QSG R + I+ HSMGT
Sbjct: 191 KESASAARDQLEVTLRMLQQSGARRIDIVAHSMGT 225
>gi|17427839|emb|CAD14529.1| CONSERVED HYPOTHETICAL PROTEIN
[Ralstonia solanacearum]
ref|NP_518948.1| hypothetical protein RSc0827 [Ralstonia solanacearum GMI1000]
Length = 270
Score = 47.4 bits (111), Expect = 0.002
Identities = 63/288 (21%), Positives = 108/288 (36%), Gaps = 64/288 (22%)
Query: 417 ISSQSNEGIL-FIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAH 475
I SN+ +L F+HGY+ DA +R AQ FP F+WPS L Y
Sbjct: 32 IQISSNKSMLVFVHGYNVSFADAARRTAQMTYDLAFPGV--AVFFSWPSQNQLLAYTLDE 89
Query: 476 SVASDNDNHRDLQKFI-ESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKRKPIVYX 534
H L+ F+ E L S N++++ HSMG R ++ + +
Sbjct: 90 QSIEWAQPH--LEHFLRELLNNSSADNIYLIAHSMGNRALTKTLVSLAGS---------- 137
Query: 535 XXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQPEQDKSLKYINKINLTNLIFLNPDYEI 594
P+ + +K +I PD +
Sbjct: 138 --------------------------------DPQAVQRIK--------EVILAAPDVDA 157
Query: 595 NTFKNDYGE-LRNYCPRITVYADHRDEAIKMAFRITQKRNLGNNIFGIVDDDGGMLDVDI 653
+ F + L +T+YA D A+ M+ ++ G + IV +G ++
Sbjct: 158 DVFVDQIAPGLAKLGAPVTLYASSSDRALMMSKKVHGGARAGESGDHIVVVNG----IET 213
Query: 654 IDTGDLDSNMSERHHSFFNINRLMVDDLHDLIVTGKRAMDRTSRLKSV 701
+D + D+++ HS++ R ++ D+ L+ RA R LKS+
Sbjct: 214 VDASNADTDLI--GHSYYGDRRSILADMFYLVRNDTRAAQRFG-LKSI 258
>gi|13470449|ref|NP_102017.1| hypothetical protein mll0158
[Mesorhizobium loti MAFF303099]
dbj|BAB47803.1| mll0158 [Mesorhizobium loti MAFF303099]
Length = 308
Score = 46.6 bits (109), Expect = 0.003
Identities = 25/95 (26%), Positives = 48/95 (50%), Gaps = 4/95 (4%)
Query: 417 ISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHS 476
I+ + + ++F+HG+++ D + R Q +P P +F+W SS Y +
Sbjct: 47 IAMRGDRALVFVHGFNNGFDDGVYRLTQIAHDTKYPG--TPVLFSWASSAKTTGY--IYD 102
Query: 477 VASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGT 511
S N DL+ + L ++ ++++ I+ HSMGT
Sbjct: 103 KDSANAARDDLEATLRMLAKTRVKSIDIIAHSMGT 137
>gi|17934312|ref|NP_531102.1| hypothetical protein Atu0397
[Agrobacterium tumefaciens str. C58]
gb|AAL41418.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58]
pir||AD2625 conserved hypothetical protein Atu0397 [imported] - Agrobacterium
tumefaciens (strain C58, Dupont)
Length = 440
Score = 45.1 bits (105), Expect = 0.008
Identities = 35/120 (29%), Positives = 60/120 (49%), Gaps = 13/120 (10%)
Query: 396 TVSIEIGRQENVQKLIVDNNWISSQSNEG---ILFIHGYDHDLKDALKRFAQFLALGHFP 452
TV++E R E K +WIS + ++F+HG+++ ++++ RFAQ +
Sbjct: 101 TVAVEPIRSEAETK-----SWISKHIKKDRRVLVFVHGFNNRYEESVYRFAQIVY--DSG 153
Query: 453 NYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQ-SGIRNLHIMCHSMGT 511
+ + P VF WPS S Y + S N + L++ + L + I + +M HSMGT
Sbjct: 154 SDVVPVVFTWPSRASIFDY--NYDKESTNYSRDALEEMLTRLARDKSIGEVTVMAHSMGT 211
>gi|10444251|gb|AAG17817.1| haem lyase [Naegleria gruberi]
ref|NP_066539.1| haem lyase [Naegleria gruberi]
Length = 474
Score = 45.1 bits (105), Expect = 0.008
Identities = 33/113 (29%), Positives = 58/113 (51%), Gaps = 18/113 (15%)
Query: 93 ILWFLFFMIVYGV-----YLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAII--- 144
I WF+FF+ ++ + Y++L I+ I++F L F+ F+ + F II
Sbjct: 178 IYWFIFFLFIHKINKINTYIYLYIF--------IIDFFLFFFLKFNNFQSIHIFKIIYIN 229
Query: 145 VRIVFLEVYSTLYLFFIFNLYYEKQYYIPL-ISLGSGVLVFSIICFF-VYPYI 195
++I F+ +Y L F++N+ K+Y I I L + + I+ FF +Y YI
Sbjct: 230 LQIDFIIIYIIFSLIFLYNIQKNKKYNIKFNIVLVVNISIAIIVFFFTIYIYI 282
>gi|15155313|gb|AAK86212.1| AGR_C_699p [Agrobacterium tumefaciens
str. C58]
pir||C97407 hypothetical protein AGR_C_699 [imported] - Agrobacterium
tumefaciens (strain C58, Cereon)
ref|NP_353427.1| hypothetical protein AGR_C_699 [Agrobacterium tumefaciens str. C58]
Length = 441
Score = 45.1 bits (105), Expect = 0.008
Identities = 35/120 (29%), Positives = 60/120 (49%), Gaps = 13/120 (10%)
Query: 396 TVSIEIGRQENVQKLIVDNNWISSQSNEG---ILFIHGYDHDLKDALKRFAQFLALGHFP 452
TV++E R E K +WIS + ++F+HG+++ ++++ RFAQ +
Sbjct: 102 TVAVEPIRSEAETK-----SWISKHIKKDRRVLVFVHGFNNRYEESVYRFAQIVY--DSG 154
Query: 453 NYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQ-SGIRNLHIMCHSMGT 511
+ + P VF WPS S Y + S N + L++ + L + I + +M HSMGT
Sbjct: 155 SDVVPVVFTWPSRASIFDY--NYDKESTNYSRDALEEMLTRLARDKSIGEVTVMAHSMGT 212
>gi|16262766|ref|NP_435559.1| hypothetical protein SMa0599
[Sinorhizobium meliloti 1021]
gb|AAK64971.1| hypothetical protein SMa0599 [Sinorhizobium meliloti 1021]
pir||A95301 hypothetical protein SMa0599 [imported] - Sinorhizobium meliloti
(strain 1021) magaplasmid pSymA
Length = 488
Score = 44.7 bits (104), Expect = 0.010
Identities = 29/110 (26%), Positives = 53/110 (47%), Gaps = 6/110 (5%)
Query: 414 NNWISSQSNEGIL-FIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYW 472
N +S + +L F+HG+++ +D++ RFAQ + + P + WPS S L Y
Sbjct: 178 NTTVSKSPDRSVLVFVHGFNNRFEDSVYRFAQIVHDSGIKS--APVLVTWPSRGSLLAY- 234
Query: 473 CAHSVASDNDNHRDLQKFIESLGQSG-IRNLHIMCHSMGTRFFLRSFSKI 521
+ S N L+ + L + G ++ + I+ HSMG L + ++
Sbjct: 235 -GYDRESTNYTRNALESLFQYLAEDGEVKEVSILAHSMGNWLTLEALRQM 283
>gi|52008217|ref|ZP_00335594.1| COG0668: Small-conductance
mechanosensitive channel [Thiobacillus denitrificans
ATCC 25259]
Length = 221
Score = 44.3 bits (103), Expect = 0.013
Identities = 34/125 (27%), Positives = 63/125 (50%), Gaps = 13/125 (10%)
Query: 95 WFLFFMIVYGVYLFLQIYHFERIDGY--ILEFILMGFISFSLGLLVPRFA--IIVRIVFL 150
W ++ GV L + HF+ + I F+ G + SL ++ R A I++ +V +
Sbjct: 37 WLFANLVRTGVTKLLDLLHFDSLAEKTGIEAFLKQGNLDISLSRILARLAYWIVIFVVVV 96
Query: 151 EVYSTLYLFFIFNLYYEKQYYIPLISLGSGVLVFSIICFFVYPYIMRCIFS------ITG 204
V ++L L + L+ + +YIP I + VLVF ++ V +I R +F+ + G
Sbjct: 97 TVANSLGLHMVAELFNQVVFYIPNIIVAILVLVFGVL---VARFINRLVFAYLNNIGVQG 153
Query: 205 AITLN 209
A+T++
Sbjct: 154 ALTIS 158
>gi|23347117|gb|AAN29277.1| lipoprotein, putative [Brucella suis
1330]
ref|NP_697362.1| lipoprotein, putative [Brucella suis 1330]
Length = 413
Score = 43.5 bits (101), Expect = 0.022
Identities = 37/152 (24%), Positives = 73/152 (47%), Gaps = 15/152 (9%)
Query: 367 TITSIVVSL----DKKMDALAISGFKPLHPKNKTVSIEIGRQENVQKLIVDNNWISS--- 419
++T I+VS+ ++K+ + P +P +I + E + +WI++
Sbjct: 70 SLTDIIVSIPPDRNRKVGEVQWPKRLPPNPLKDFATIAV---EPLHGDAAAQHWINTHLP 126
Query: 420 QSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVAS 479
++ ++FIHG+++ +D++ RFAQ + + P +F WPS S Y + S
Sbjct: 127 RTKRVMIFIHGFNNTFEDSVYRFAQIVHDSGAD--VAPIIFTWPSRASVFDY--NYDKES 182
Query: 480 DNDNHRDLQKFIESLGQS-GIRNLHIMCHSMG 510
N + L+ + + +R++ IM HSMG
Sbjct: 183 TNYSRDALEHVLRVVANDPQVRDVTIMAHSMG 214
>gi|36959002|gb|AAQ87427.1| Hypothetical protein RNGR00301
[Rhizobium sp. NGR234]
Length = 407
Score = 43.5 bits (101), Expect = 0.022
Identities = 32/136 (23%), Positives = 62/136 (45%), Gaps = 11/136 (8%)
Query: 392 PKNKTVSIEIGRQENVQKLIVDNNWISSQSNEG-----ILFIHGYDHDLKDALKRFAQFL 446
P N + + E + + + W+S+ + ++FIHG+++ +DA+ RFAQ
Sbjct: 74 PPNPSTDFATLKAEQIDRTAAEQ-WLSNSVRKSPDRSVLVFIHGFNNHFEDAVFRFAQI- 131
Query: 447 ALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQS-GIRNLHIM 505
+ + P + WPS S L Y + S N +++ + L + ++ + I+
Sbjct: 132 -VHDSGAHSTPVLATWPSRGSLLAY--GYDRESTNYTRNAVERLFQYLARDPEVKEVAIL 188
Query: 506 CHSMGTRFFLRSFSKI 521
HSMG L S ++
Sbjct: 189 AHSMGNWLALESLRQM 204
>gi|23025044|ref|ZP_00064224.1| COG2211: Na+/melibiose symporter and
related transporters [Leuconostoc mesenteroides subsp.
mesenteroides ATCC 8293]
Length = 470
Score = 42.7 bits (99), Expect = 0.037
Identities = 34/148 (22%), Positives = 68/148 (44%), Gaps = 14/148 (9%)
Query: 93 ILW----FLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLL-VPRFAIIVRI 147
+LW ++F+ I + +++F+ + G++ EF ++G IS G++ VP F + ++
Sbjct: 250 LLWLALSYMFYAIANVATTGVLLFYFKFVIGHVTEFAMVGVISMITGIIAVPLFPFLAKV 309
Query: 148 VFLEVYSTLYLFFIFNLYYEKQYYIPLISLGSGVLVFSI-ICFFVYPYIMRCIFSITGAI 206
+ T F + Y+ GS + + + + FF +P + S+ I
Sbjct: 310 M------TRRYVFASGIALMVLAYVMFTIAGSNLWIVGLGLVFFYFPQQL-IFLSVLMTI 362
Query: 207 TLNVDKDQWVRGESNR-FTIKQKNLFQK 233
T +V+ QW G+ N T+ + L K
Sbjct: 363 TDSVEYGQWKNGQRNEAVTLSLRPLLDK 390
>gi|16264069|ref|NP_436861.1| hypothetical protein SMb20335
[Sinorhizobium meliloti 1021]
pir||A95882 conserved hypothetical protein [imported] - Sinorhizobium meliloti
(strain 1021) magaplasmid pSymB
emb|CAC48721.1| CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti 1021]
Length = 382
Score = 42.4 bits (98), Expect = 0.049
Identities = 40/237 (16%), Positives = 85/237 (34%), Gaps = 57/237 (24%)
Query: 392 PKNKTVSIEIGRQENVQKLIVDNNWISSQSNEG--ILFIHGYDHDLKDALKRFAQFLALG 449
P N + R + W + + +G +LF+HG+++ +D + R AQ +
Sbjct: 68 PPNPETDFAVTRVREMASEDEARTWFRAHNKDGHVLLFVHGFNNRYEDGVFRLAQI--VH 125
Query: 450 HFPNYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQS-GIRNLHIMCHS 508
P +F WPS Y + S N + L+ + +L + ++++ I+ HS
Sbjct: 126 DSGAQATPMLFTWPSRARVFDY--NYDKESTNYSRTALEDTLRTLANAPNVKDVTILAHS 183
Query: 509 MGTRFFLRSFSKIKKAFAKRKPIVYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFVQQP 568
MGT + + ++ K P
Sbjct: 184 MGTWLTMEALRQMGIRDGKIAP-------------------------------------- 205
Query: 569 EQDKSLKYINKINLTNLIFLNPDYEINTFKNDYGELRNYCPRITVYADHRDEAIKMA 625
+ N+I +PD +++ F + ++ + P+ T++ D A+ ++
Sbjct: 206 ------------KIENVILASPDIDLDVFAKQWTDMGDERPKFTIFVSQDDRALALS 250
>gi|49475541|ref|YP_033582.1| hypothetical protein BH07710
[Bartonella henselae str. Houston-1]
emb|CAF27571.1| hypothetical protein [Bartonella henselae str. Houston-1]
Length = 356
Score = 42.0 bits (97), Expect = 0.064
Identities = 38/151 (25%), Positives = 67/151 (44%), Gaps = 10/151 (6%)
Query: 385 SGFKPLHPKNKTVSIEIGRQENV----QKLIVDNNWISSQSNEGILFIHGYDHDLKDALK 440
+ +KP H K ++ + + +N Q+L D + E LFIHGY+++ D
Sbjct: 66 NAYKPTHDKY-FAAVALQKYDNKEQFKQQLNADLEKKTPGKREIFLFIHGYNNNFADGTF 124
Query: 441 RFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHRDLQKFIESLGQSGIR 500
R AQF + + ++WPS+ S LY + S N L + + + ++
Sbjct: 125 RTAQFTY--DYSLNVVTVHYSWPSAGSVPLY--IYDRDSANFARDGLIELLTLISETKAD 180
Query: 501 NLHIMCHSMGTRFFLRSFSKIKKAFAKRKPI 531
+ ++ HSMG + +F + K KPI
Sbjct: 181 QISVIAHSMGNFVIMEAFRTLALQ-GKYKPI 210
>gi|14133569|gb|AAK54067.1| galactose permease [Lactobacillus
brevis]
Length = 474
Score = 42.0 bits (97), Expect = 0.064
Identities = 37/140 (26%), Positives = 66/140 (46%), Gaps = 13/140 (9%)
Query: 86 NEKRGFSILWFLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLL-VPRFAII 144
N++ + L +L F + Y V L +Y+F+ + G ++ ++G I+ LG++ VP F ++
Sbjct: 245 NDQLMWLALSYLLFALGYVVTNSLLLYNFQYVLGAATKYSMVGGITTVLGIISVPLFPVL 304
Query: 145 VRIVFLEVYSTLYLFFIFNLYYEKQYYIPLISLGSGV---LVFSIICFFVYPYIMRCIFS 201
V+ + T ++ + Y+ I G+ V LV + FF YP I +
Sbjct: 305 VKAI------TRKGIYVGGIIMMLVGYLLFIFAGTSVVMTLVADAVFFFPYPMI---FLA 355
Query: 202 ITGAITLNVDKDQWVRGESN 221
IT +V+ QW G N
Sbjct: 356 ALMTITDSVEYGQWKNGVRN 375
>gi|57242518|ref|ZP_00370456.1| competence locus E (comE3)
[Campylobacter upsaliensis RM3195]
gb|EAL53586.1| competence locus E (comE3) [Campylobacter upsaliensis RM3195]
Length = 417
Score = 41.6 bits (96), Expect = 0.083
Identities = 25/98 (25%), Positives = 51/98 (51%), Gaps = 4/98 (4%)
Query: 98 FFMIVYGV-YLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAIIVRIVFLEVYSTL 156
FF + GV Y+FL ++HF + G + IL+ +F L +++P ++ + + +
Sbjct: 280 FFFSILGVFYIFLYLHHFAKFFGIVTNLILINLWTF-LAMIIP-VLYFFPLISYQQFLAI 337
Query: 157 YLFFIFNLYYEKQYYIPLISLGSGVLVFSIICFFVYPY 194
+L IF L+Y ++ I G +L + ++ FF + +
Sbjct: 338 FLSLIFVLFYPLALFLHFIGAGF-LLDYFLLEFFAFKF 374
>gi|15965400|ref|NP_385753.1| hypothetical protein SMc00955
[Sinorhizobium meliloti 1021]
emb|CAC46226.1| CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
Length = 391
Score = 41.2 bits (95), Expect = 0.11
Identities = 27/98 (27%), Positives = 49/98 (49%), Gaps = 5/98 (5%)
Query: 425 ILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVASDNDNH 484
++FIHG+++ +DA+ RFAQ + + P + WPS S L Y + S N
Sbjct: 116 LVFIHGFNNHFEDAVFRFAQIIHDSGARSV--PVLATWPSRGSLLAY--GYDRESTNYTR 171
Query: 485 RDLQKFIESLGQS-GIRNLHIMCHSMGTRFFLRSFSKI 521
+++ + L + ++ + I+ HSMG L S ++
Sbjct: 172 NAVERLFQYLARDPEVKEVSILAHSMGNWLALESLRQM 209
>gi|34534287|dbj|BAC86958.1| unnamed protein product [Homo sapiens]
Length = 130
Score = 41.2 bits (95), Expect = 0.11
Identities = 33/127 (25%), Positives = 59/127 (45%), Gaps = 15/127 (11%)
Query: 97 LFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAIIVRIVFLEVYSTL 156
L+ I +Y+F+ IY F I Y+ +I + +I F + F I +F+ +Y L
Sbjct: 5 LYVYIYMFMYIFIFIYIFIYIYIYVYVYIYL-YIYFYI------FTYIYLYIFIYIYICL 57
Query: 157 YLFFIFNLYYEKQYYIPL-ISLGSGVLVFSIICFFVYPYI-------MRCIFSITGAITL 208
Y++ LY YI L I + + ++ IC +++ Y+ MR + +I G I L
Sbjct: 58 YIYLYIYLYIFTYIYIYLHIFIYLHIFIYICICLYIFIYVFIYINIYMRGLQNIHGNIEL 117
Query: 209 NVDKDQW 215
++
Sbjct: 118 KATNKKY 124
>gi|49474175|ref|YP_032217.1| hypothetical protein BQ05560
[Bartonella quintana str. Toulouse]
emb|CAF26050.1| hypothetical protein [Bartonella quintana str. Toulouse]
Length = 356
Score = 40.8 bits (94), Expect = 0.14
Identities = 40/181 (22%), Positives = 77/181 (42%), Gaps = 17/181 (9%)
Query: 355 DKKYYRHIDDDKTITSIVVSLDKKMDALAISGFKPLHPKNKTVSIEIGRQENVQKLIVDN 414
+K YY +D I + + I+ + P H K ++ + + +N ++
Sbjct: 43 NKVYYNRVD-------IGIPQQHVKGFVEINTYNPTHDKY-FAAVALQKYDNREQFKQQL 94
Query: 415 NWISSQSNEG----ILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLL 470
N + ++G LFIHGY+++ D R AQF + + ++WPS+ S L
Sbjct: 95 NAALEKKSKGKREIFLFIHGYNNNFADGTFRTAQFTY--DYSLNVVAVHYSWPSAGSIPL 152
Query: 471 YWCAHSVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKAFAKRKP 530
Y + S N + + + + ++ + ++ HSMG + +F + K KP
Sbjct: 153 Y--IYDRDSANFARDGMIELLTLISETKADRISVIAHSMGNFVIMEAFRTLALQ-GKYKP 209
Query: 531 I 531
I
Sbjct: 210 I 210
>gi|52009316|ref|ZP_00336680.1| COG4642: Uncharacterized protein
conserved in bacteria [Silicibacter sp. TM1040]
Length = 500
Score = 40.0 bits (92), Expect = 0.24
Identities = 24/62 (38%), Positives = 35/62 (55%), Gaps = 13/62 (20%)
Query: 214 QWVRGESNRFTIKQKNLFQKPSTCIYEGPLLDGRPHGIG--TWMD-TSYQGELLTGFWED 270
QWV GE IK K + + P+ +YEG G+P G+G T+ D +Y+GE W+D
Sbjct: 69 QWVEGE-----IKGKGVARFPNGSVYEGEFSKGKPEGLGKITFADGGTYEGE-----WQD 118
Query: 271 GI 272
G+
Sbjct: 119 GV 120
>gi|17987877|ref|NP_540511.1| hypothetical protein BMEI1594
[Brucella melitensis 16M]
gb|AAL52775.1| Hypothetical Protein [Brucella melitensis 16M]
pir||AD3451 hypothetical protein BMEI1594 [imported] - Brucella melitensis
(strain 16M)
Length = 402
Score = 40.0 bits (92), Expect = 0.24
Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 8/100 (8%)
Query: 415 NWISS---QSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLY 471
+WI++ ++ ++FI G+++ +D++ RFAQ + + P +F WPS S Y
Sbjct: 108 HWINTHLPRTKRVMIFIRGFNNTFEDSVYRFAQIVHDSGAD--VAPIIFTWPSRASVFDY 165
Query: 472 WCAHSVASDNDNHRDLQKFIESLGQS-GIRNLHIMCHSMG 510
+ S N + L+ + + +R++ IM HSMG
Sbjct: 166 --NYDKESTNYSRDALEHVLRVVANDPQVRDVTIMAHSMG 203
>gi|45915397|ref|ZP_00197078.1| COG4782: Uncharacterized protein
conserved in bacteria [Mesorhizobium sp. BNC1]
Length = 388
Score = 40.0 bits (92), Expect = 0.24
Identities = 24/95 (25%), Positives = 47/95 (49%), Gaps = 6/95 (6%)
Query: 417 ISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLY-WCAH 475
I + ++FIHG++ +A+ R Q F + +F+WPS+ + Y + +
Sbjct: 128 IEAHDGRALVFIHGFNTGFDNAVYRMTQIAHDAGFKGTL--ILFSWPSAARIVDYIYDNN 185
Query: 476 SVASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMG 510
S + D DL + + +SG + ++++ HSMG
Sbjct: 186 SATASRDALEDLLRLV---ARSGAQRINVVAHSMG 217
>gi|49647997|emb|CAG82450.1| unnamed protein product [Yarrowia
lipolytica CLIB99]
ref|XP_502130.1| hypothetical protein [Yarrowia lipolytica]
Length = 281
Score = 39.7 bits (91), Expect = 0.32
Identities = 27/113 (23%), Positives = 53/113 (46%), Gaps = 9/113 (7%)
Query: 91 FSILWFLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAIIVRIVFL 150
F IL+F++F+ + + F + F YI FI + FI + + + F I +F+
Sbjct: 18 FHILYFIYFIFSFSYFFFFFFFFF-----YIFIFIFI-FIFIFIFIFIFIFIFIFIFIFI 71
Query: 151 EVYSTLYLF---FIFNLYYEKQYYIPLISLGSGVLVFSIICFFVYPYIMRCIF 200
++ +++F FIF ++Y + ++ +L F + YI+ IF
Sbjct: 72 FIFIFIFIFIFIFIFYIFYILYFIFYILYFIFYILYFIFYILYFIFYILYFIF 124
>gi|23612229|ref|NP_703809.1| hypothetical protein [Plasmodium
falciparum 3D7]
emb|CAG25387.1| hypothetical protein [Plasmodium falciparum 3D7]
Length = 1096
Score = 39.3 bits (90), Expect = 0.41
Identities = 34/126 (26%), Positives = 59/126 (45%), Gaps = 8/126 (6%)
Query: 54 SMETIERATELEFTYWIMKKRQWSRIQRGTFTNEKRGF-SILWFLFFMIV-YGVYLFLQI 111
S ++ E + E + I+ + + I RG F K SI +FF+ V Y + +
Sbjct: 226 SFDSYENGNDSEIYFLILHRFIFFYILRGIFYKYKEFLLSISSCIFFLYVFYKIIKIIWF 285
Query: 112 YHFERIDGYILEFILMGFISFSLGLLVPRF-----AIIVRIVFLEVYSTLYLFFIFNLYY 166
+F +I E F +F L + R+ I++ I+ + ++S++ FFIFN+Y
Sbjct: 286 QYFYKIYRKYYEKYSFLF-TFDLSFIYKRYLNDILTILIIILAIIIFSSISFFFIFNIYG 344
Query: 167 EKQYYI 172
E Y I
Sbjct: 345 ESVYII 350
>gi|15899328|ref|NP_343933.1| hypothetical protein SSO2601
[Sulfolobus solfataricus P2]
gb|AAK42723.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
pir||D90433 conserved hypothetical protein [imported] - Sulfolobus solfataricus
Length = 453
Score = 39.3 bits (90), Expect = 0.41
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 90 GFSILWFLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAIIVRIVF 149
G S++W +F M+ YG+ + + I YIL GF++ GLL+ + + F
Sbjct: 187 GTSLIWIVFIMLPYGL-----TFKYVTIPTYILPIFPFGFLNIE-GLLISLLYTGLSVFF 240
Query: 150 LEVYSTLYLFFIFNLYYEKQYYIPLIS 176
S +L F N Y +Y I L S
Sbjct: 241 AYKQSLKFLSFRLNSQYSTKYSIKLRS 267
>gi|29346015|ref|NP_809518.1| putative polysaccharide export protein
[Bacteroides thetaiotaomicron VPI-5482]
gb|AAO75712.1| putative polysaccharide export protein [Bacteroides
thetaiotaomicron VPI-5482]
Length = 481
Score = 39.3 bits (90), Expect = 0.41
Identities = 35/131 (26%), Positives = 59/131 (44%), Gaps = 27/131 (20%)
Query: 93 ILWFLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAIIVRIVFLEV 152
+L FL F I Y++ +G + ++M + FS G P A + + L+
Sbjct: 96 VLEFLIFPIC--------TYYYHNEEGVLPSMVIMLSVIFSAGQN-PAIAYFQKEIKLKK 146
Query: 153 YSTLYLF--------FIFNLYYEKQYYIPLISLGSGVLVFSIICFFVYPYIMRCIFSITG 204
Y L +F + ++YY K Y+ +I+L S L I FFVYPY ++ +
Sbjct: 147 YFYLKVFPKILSFVLVVVSVYYMKSYWGLIIALLSEYLFRLIYSFFVYPYKVKFV----- 201
Query: 205 AITLNVDKDQW 215
+DKD++
Sbjct: 202 -----IDKDKF 207
>gi|56680431|gb|AAV97097.1| lipoprotein, putative [Silicibacter
pomeroyi DSS-3]
ref|YP_169071.1| lipoprotein, putative [Silicibacter pomeroyi DSS-3]
Length = 367
Score = 38.9 bits (89), Expect = 0.54
Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 4/99 (4%)
Query: 426 LFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVASDNDNHR 485
LF+HG++ + R AQ P + V++WPS L Y A+ S
Sbjct: 125 LFVHGFNSTQTETAYRAAQLAHDIEMPGSL--MVYSWPSKGHALGY--AYDADSTVFARD 180
Query: 486 DLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIKKA 524
L+ + LG+ I L ++ HSMG+ + + +I+ A
Sbjct: 181 GLETVLRRLGEQRIDRLAVVAHSMGSFLLMEALRQIEIA 219
>gi|36958719|gb|AAQ87187.1| Hypothetical protein RNGR00163
[Rhizobium sp. NGR234]
Length = 333
Score = 38.9 bits (89), Expect = 0.54
Identities = 24/92 (26%), Positives = 45/92 (48%), Gaps = 6/92 (6%)
Query: 423 EGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHSVASDND 482
E ++FIHGY++ DA + + N WP+ S L++ +++ ++
Sbjct: 58 EVVVFIHGYNNSFDDAARTTGEICR--SLQNQFVCIALTWPAGGSGGLFF-GYNIDRESS 114
Query: 483 NH--RDLQKFIESLGQS-GIRNLHIMCHSMGT 511
DL+K I + ++ G+ LH++ HS GT
Sbjct: 115 EFSVADLKKAIRIIAEAKGVERLHLLAHSRGT 146
>gi|17933991|ref|NP_530781.1| hypothetical protein Atu0072
[Agrobacterium tumefaciens str. C58]
gb|AAL41097.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58]
gb|AAK85891.1| AGR_C_108p [Agrobacterium tumefaciens str. C58]
pir||AC2585 conserved hypothetical protein Atu0072 [imported] - Agrobacterium
tumefaciens (strain C58, Dupont)
pir||B97367 hypothetical protein AGR_C_108 [imported] - Agrobacterium
tumefaciens (strain C58, Cereon)
ref|NP_353106.1| hypothetical protein AGR_C_108 [Agrobacterium tumefaciens str. C58]
Length = 411
Score = 38.5 bits (88), Expect = 0.71
Identities = 26/106 (24%), Positives = 49/106 (45%), Gaps = 3/106 (2%)
Query: 417 ISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIKPFVFNWPSSTSPLLYWCAHS 476
IS + LF+HGY++ ++AL R AQ A + P VF+WPS + L + A
Sbjct: 157 ISRSGKQIALFVHGYNYSYQEALFRAAQMAADANMDGV--PLVFSWPSQAN-LTGYVADK 213
Query: 477 VASDNDNHRDLQKFIESLGQSGIRNLHIMCHSMGTRFFLRSFSKIK 522
++ + Q+ +++ + HSMG + + +++
Sbjct: 214 ESATYSRDALATLLTDLTRQTPRKSIVVFGHSMGGWLVMEALRQLR 259
>gi|15642876|ref|NP_227917.1| hypothetical protein TM0101
[Thermotoga maritima MSB8]
gb|AAD35195.1| hypothetical protein TM0101 [Thermotoga maritima MSB8]
pir||E72418 hypothetical protein TM0101 - Thermotoga maritima (strain MSB8)
Length = 622
Score = 38.5 bits (88), Expect = 0.71
Identities = 28/101 (27%), Positives = 47/101 (45%), Gaps = 5/101 (4%)
Query: 84 FTNEKRGFSILWFLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAI 143
F N S W++ F +VYG+ L + + + I L+FI+ I F
Sbjct: 246 FENTSPSISNFWWIRF-VVYGILLVIFLRKYRTI----LQFIIAAEILFIWVTKSLYLNT 300
Query: 144 IVRIVFLEVYSTLYLFFIFNLYYEKQYYIPLISLGSGVLVF 184
+ ++F V +++FF F L ++Y PL+SL L+F
Sbjct: 301 VENMIFSTVVVFMFIFFNFILLVRRRYLYPLLSLIFVFLLF 341
>gi|49146488|ref|YP_026062.1| NADH dehydrogenase subunit 6
[Aleurodicus dugesii]
gb|AAS77752.1| NADH dehydrogenase subunit 6 [Aleurodicus dugesii]
Length = 138
Score = 38.5 bits (88), Expect = 0.71
Identities = 31/118 (26%), Positives = 57/118 (48%), Gaps = 12/118 (10%)
Query: 91 FSILWFLFFMIVYGVYLFLQIYHFERIDGYILEFILMGFISFSLGLLVPRFAIIVRIVFL 150
F+ L L F+I Y +++ + IY + Y ++L F+ F GL++ F + IV L
Sbjct: 2 FNPLIMLLFIIFYLIFICIYIYFLVKTCFY--SYVL--FLLFMSGLMII-FMYLCCIVIL 56
Query: 151 EVYSTLYLFFIFNLYYEKQYYIPLISLGSGVLVF------SIICFFVYPYIMRCIFSI 202
E + + FF F ++ YY +++ + L+F ++ F YP + +F I
Sbjct: 57 ESFKFKFFFFFF-FFFLNNYYYMMLNYDNFNLIFYYWEFYNLNLLFYYPLNLMYLFLI 113
>gi|27383091|ref|NP_774620.1| hypothetical protein blr7980
[Bradyrhizobium japonicum USDA 110]
dbj|BAC53245.1| blr7980 [Bradyrhizobium japonicum USDA 110]
Length = 342
Score = 38.1 bits (87), Expect = 0.92
Identities = 34/122 (27%), Positives = 53/122 (42%), Gaps = 13/122 (10%)
Query: 405 ENVQKLIVDNNWISSQSNEGILFIHGYDHDLKDALKRFAQFLALGHFPNYIK----PFVF 460
E VQ I D + ++++HG+ + A L H + IK VF
Sbjct: 114 EPVQAEIGDLLAQGGAGGDVLIYVHGFKQTFETAA------LDAAHLSDGIKFRGRTMVF 167
Query: 461 NWPSSTSPLLYWCAHSVASDNDNHRDLQKFIESL-GQSGIRNLHIMCHSMGTRFFLRSFS 519
+WPS L+ A+ S + D ++ + SL SG +HI+ HSMGT L S
Sbjct: 168 SWPSKAG--LFDYAYDRDSAMWSRDDFERVLSSLVSTSGGGRVHIVAHSMGTMLTLESLR 225
Query: 520 KI 521
++
Sbjct: 226 QL 227
Database: /home/dicty1/resource/WorkingDBs//nr-clean
Posted date: Mar 10, 2005 12:10 PM
Number of letters in database: 788,375,511
Number of sequences in database: 2,329,665
Lambda K H
0.325 0.142 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,089,026,723
Number of Sequences: 2329665
Number of extensions: 48088145
Number of successful extensions: 149063
Number of sequences better than 1.0: 47
Number of HSP's better than 1.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 148925
Number of HSP's gapped (non-prelim): 123
length of query: 717
length of database: 788,375,511
effective HSP length: 135
effective length of query: 582
effective length of database: 473,870,736
effective search space: 275792768352
effective search space used: 275792768352
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 87 (38.1 bits)
BLASTP 2.2.1 [Jul-12-2001]