BLASTP 2.2.23+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. RID: UMJ1YAV0012 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 10,635,453 sequences; 3,627,481,469 total letters Query= Sel3.Plasmodium_vivax.1 Located on Plasmodium_vivax_SaI-1|ctg_7202|2005-09-01|ds-DNA [translate(1)] Length=322 Score E Sequences producing significant alignments: (Bits) Value emb|CAX64123.2| Sel3 protein [Plasmodium falciparum 3D7] 236 1e-62 ref|XP_001349374.1| hypothetical protein [Plasmodium falcipar... 212 2e-55 ref|XP_001348224.1| DNA mismatch repair protein, putative [Pl... 33.1 0.14 gb|AAC47438.1| member of var gene family; implicated in antig... 30.0 1.2 ref|XP_001348751.1| conserved Plasmodium protein, unknown fun... 29.3 2.5 emb|CAX64281.1| MORN repeat protein, putative [Plasmodium fal... 28.9 3.1 ref|XP_001349792.1| hypothetical protein [Plasmodium falcipar... 28.9 3.1 gb|AAO67398.1| erythrocyte membrane protein 1 [Plasmodium fal... 27.7 6.8 gb|AAB06961.1| erythrocyte membrane protein 1 [Plasmodium fal... 27.3 7.7 ref|XP_966219.1| ATP-dependent DEAD box helicase [Plasmodium ... 26.9 9.9 ALIGNMENTS >emb|CAX64123.2| Sel3 protein [Plasmodium falciparum 3D7] Length=351 Score = 236 bits (601), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 143/326 (43%), Positives = 206/326 (63%), Gaps = 9/326 (2%) Query 1 MVLNKVYLLTILVLFYVSTLCVEAGXSKKLHIKLPNEDDDYLGKLINISKKITQYARNNK 60 M+L KVY+L IL+L + + V++G SK+LHIKLP+EDDDYL KLI + I Y+ NNK Sbjct 1 MILKKVYILIILILLSILSRTVDSGUSKQLHIKLPDEDDDYLAKLIGVFNDICTYSINNK 60 Query 61 GKIAKVLATSALSAYSLNWVYQSGVTLQRDPHYSLFVPSNSYINSAIKRVKKNYQV---K 117 KIAK+L+TSA+S Y++ +Y SG+T +++PHYS F+PS YI I K Sbjct 61 EKIAKILSTSAVSVYTITSLYNSGITFKKNPHYSFFLPSQKYILKIINNNVNIQNNDIKK 120 Query 118 KYTFKEKKIFERNFHLNNAISQMGQMYVVNNLVNFLNFLPYKWRTKCTYNFCKYKEFEN- 176 K+ IFER F + I QM + Y++NN +NF+NFLPYK + + YNF KYKEF+ Sbjct 121 IDNLKDIPIFERTFTNKHNI-QMSKTYILNNFINFINFLPYKLKKESLYNFSKYKEFDGL 179 Query 177 --VENFFDYIKMVKHEGPILLFQGKLKKQYWIHLPLKYEILKPGDGSLCT--LTFTPLHK 232 VEN F ++++ HE I+++Q K KK YWIHLP KY I+K D + + L F PL+K Sbjct 180 KYVENPFTQVQVLNHEDNIIVYQAKTKKHYWIHLPFKYNIIKISDDNTTSYILMFIPLYK 239 Query 233 YYSDYTVEIKLVKEKENNNVKLITSVKCNSKNGEEGNSFYINVVKNIAMLLTYDIFEGIN 292 YYS+Y +++K+ N NV + +K + NSF+ +++KNI+ +TYDI I+ Sbjct 240 YYSNYIIQMKITSNNNNQNVTFSSCIKQEKTEHIQNNSFHYDIIKNISKHITYDIINAID 299 Query 293 NNIHVVYRRNASCGKSMFTRSNAILK 318 NNI+++Y RN GK F + L+ Sbjct 300 NNINILYMRNVKPGKYNFINTTNALQ 325 >ref|XP_001349374.1| hypothetical protein [Plasmodium falciparum 3D7] Length=331 Score = 212 bits (539), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 125/291 (42%), Positives = 179/291 (61%), Gaps = 9/291 (3%) Query 36 NEDDDYLGKLINISKKITQYARNNKGKIAKVLATSALSAYSLNWVYQSGVTLQRDPHYSL 95 NEDDDYL KLI + I Y+ NNK KIAK+L+TSA+S Y++ +Y SG+T +++PHYS Sbjct 16 NEDDDYLAKLIGVFNDICTYSINNKEKIAKILSTSAVSVYTITSLYNSGITFKKNPHYSF 75 Query 96 FVPSNSYINSAIKRVKKNYQV---KKYTFKEKKIFERNFHLNNAISQMGQMYVVNNLVNF 152 F+PS YI I K K+ IFER F + I QM + Y++NN +NF Sbjct 76 FLPSQKYILKIINNNVNIQNNDIKKIDNLKDIPIFERTFTNKHNI-QMSKTYILNNFINF 134 Query 153 LNFLPYKWRTKCTYNFCKYKEFEN---VENFFDYIKMVKHEGPILLFQGKLKKQYWIHLP 209 +NFLPYK + + YNF KYKEF+ VEN F ++++ HE I+++Q K KK YWIHLP Sbjct 135 INFLPYKLKKESLYNFSKYKEFDGLKYVENPFTQVQVLNHEDNIIVYQAKTKKHYWIHLP 194 Query 210 LKYEILKPGDGSLCT--LTFTPLHKYYSDYTVEIKLVKEKENNNVKLITSVKCNSKNGEE 267 KY I+K D + + L F PL+KYYS+Y +++K+ N NV + +K + Sbjct 195 FKYNIIKISDDNTTSYILMFIPLYKYYSNYIIQMKITSNNNNQNVTFSSCIKQEKTEHIQ 254 Query 268 GNSFYINVVKNIAMLLTYDIFEGINNNIHVVYRRNASCGKSMFTRSNAILK 318 NSF+ +++KNI+ +TYDI I+NNI+++Y RN GK F + L+ Sbjct 255 NNSFHYDIIKNISKHITYDIINAIDNNINILYMRNVKPGKYNFINTTNALQ 305 >ref|XP_001348224.1| DNA mismatch repair protein, putative [Plasmodium falciparum 3D7] gb|AAN36663.1| DNA mismatch repair protein, putative [Plasmodium falciparum 3D7] Length=1515 Score = 33.1 bits (74), Expect = 0.14, Method: Compositional matrix adjust. Identities = 17/34 (50%), Positives = 22/34 (64%), Gaps = 0/34 (0%) Query 94 SLFVPSNSYINSAIKRVKKNYQVKKYTFKEKKIF 127 SLF YINS I+ +K+NY V K +EK+IF Sbjct 1112 SLFREQAYYINSVIEEIKENYSVDKRPTREKEIF 1145 >gb|AAC47438.1| member of var gene family; implicated in antigenic variation [Plasmodium falciparum] Length=2647 Score = 30.0 bits (66), Expect = 1.2, Method: Compositional matrix adjust. Identities = 19/46 (41%), Positives = 28/46 (60%), Gaps = 6/46 (13%) Query 229 PLHKYYSDYTVEIKLVKEKENNN--VKLITSVKC----NSKNGEEG 268 P +++ D +I +K N+N VKL+ SVKC NS+NG+EG Sbjct 1946 PRLRFFVDLIRQIAATIDKGNHNGLVKLVKSVKCNCGNNSQNGKEG 1991 >ref|XP_001348751.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum 3D7] gb|AAN37190.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum 3D7] Length=1632 Score = 29.3 bits (64), Expect = 2.5, Method: Compositional matrix adjust. Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 4/68 (5%) Query 57 RNNKGKIAK--VLATSALSAYSLNWVYQSGVTLQRDPHYS-LFVPSNSYINSAIKRVKKN 113 +NNK + K +L T L N++ + V L+ D Y+ +V ++ N+ IK ++KN Sbjct 1355 QNNKSRKPKEVLLDTQQLFNDDNNYL-KENVCLENDKEYNKCYVNTHKNKNTPIKDIRKN 1413 Query 114 YQVKKYTF 121 + V +TF Sbjct 1414 HNVNSHTF 1421 >emb|CAX64281.1| MORN repeat protein, putative [Plasmodium falciparum 3D7] Length=4313 Score = 28.9 bits (63), Expect = 3.1, Method: Compositional matrix adjust. Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 1/65 (1%) Query 233 YYSDYTVEIKLVKEKENNNVKLITSVKCNSKNGEEGNSFYINVVKNIAMLLTYDIFEGIN 292 Y DY+++ + K K V++ ++ N+KN E+ F++NV+ NI + TYD Sbjct 1402 YTYDYSIDYDIKKNKYIK-VRIERNMLYNNKNKEQTMLFFLNVLNNINVGYTYDKIRLFK 1460 Query 293 NNIHV 297 NNI++ Sbjct 1461 NNIYI 1465 >ref|XP_001349792.1| hypothetical protein [Plasmodium falciparum 3D7] Length=4273 Score = 28.9 bits (63), Expect = 3.1, Method: Compositional matrix adjust. Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 1/65 (1%) Query 233 YYSDYTVEIKLVKEKENNNVKLITSVKCNSKNGEEGNSFYINVVKNIAMLLTYDIFEGIN 292 Y DY+++ + K K V++ ++ N+KN E+ F++NV+ NI + TYD Sbjct 1377 YTYDYSIDYDIKKNKYIK-VRIERNMLYNNKNKEQTMLFFLNVLNNINVGYTYDKIRLFK 1435 Query 293 NNIHV 297 NNI++ Sbjct 1436 NNIYI 1440 >gb|AAO67398.1| erythrocyte membrane protein 1 [Plasmodium falciparum] Length=175 Score = 27.7 bits (60), Expect = 6.8, Method: Compositional matrix adjust. Identities = 30/95 (31%), Positives = 45/95 (47%), Gaps = 16/95 (16%) Query 176 NVENFFDYIKMVKHEGPILLFQGKLKKQYWIHLPLKYEILKPGDGSLCTLTFTPLHKYYS 235 N+ NFF K+ G Q +++ W + KY DG LC+L++ K Sbjct 89 NINNFF------KYGGQTDSVQ---QREQWWNNNAKY----IWDGMLCSLSYNTNDKKM- 134 Query 236 DYTVEIKLVKEKENNNVKLITSVKCNSKNGEEGNS 270 D V KL+ + +NNN TSVK SK+G ++ Sbjct 135 DLVVRKKLIIDHQNNNN--YTSVKFPSKSGPSADA 167 >gb|AAB06961.1| erythrocyte membrane protein 1 [Plasmodium falciparum] Length=2212 Score = 27.3 bits (59), Expect = 7.7, Method: Compositional matrix adjust. Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 9/74 (12%) Query 201 KKQYWIHLPLKYEILKPGDGSLCTLTFTPLHKYYSDYTVEIKLVKEKENNN--VKLITSV 258 K+ W ++ ++ GD + +F + D +I +K N+N VKL+ SV Sbjct 1915 KRTEWTNIKNRFNEQYNGDDTEMKSSF---RSFLVDLIRQIAATIDKGNHNGLVKLVKSV 1971 Query 259 KC----NSKNGEEG 268 KC NS+NG+EG Sbjct 1972 KCNCGNNSQNGKEG 1985 >ref|XP_966219.1| ATP-dependent DEAD box helicase [Plasmodium falciparum 3D7] emb|CAG25049.1| ATP dependent DEAD-box helicase, putative [Plasmodium falciparum 3D7] Length=1137 Score = 26.9 bits (58), Expect = 9.9, Method: Compositional matrix adjust. Identities = 19/65 (29%), Positives = 37/65 (56%), Gaps = 2/65 (3%) Query 221 SLCTLTFTPLHKYYSDYTV-EIKLVKEKENNNVKLITSVKCNSKNGEE-GNSFYINVVKN 278 ++CT+ TPL+K Y V EI+++ + + + NSK+ G+ + I+++KN Sbjct 412 TVCTIEMTPLNKEYDCAIVDEIQMINNESRGHAWTNVLMNLNSKDIYLCGSEYIIDLIKN 471 Query 279 IAMLL 283 +A +L Sbjct 472 LADIL 476 Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects Posted date: Mar 22, 2010 5:43 PM Number of letters in database: 7,740,688 Number of sequences in database: 15,987 Lambda K H 0.322 0.137 0.408 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 15987 Number of Hits to DB: 121969 Number of extensions: 6353 Number of successful extensions: 20 Number of sequences better than 100: 2 Number of HSP's better than 100 without gapping: 0 Number of HSP's gapped: 20 Number of HSP's successfully gapped: 2 Length of query: 322 Length of database: 7740688 Length adjustment: 97 Effective length of query: 225 Effective length of database: 6189949 Effective search space: 1392738525 Effective search space used: 1392738525 T: 11 A: 40 X1: 16 (7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.4 bits) S2: 50 (23.9 bits)