assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	1	441669-441752	1	CRISPRCasFinder	no		cas3,csa3,WYL	Orphan	ACAAGCTGCTTTCTTTCCTTTTCTTTT	27	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL	NA,NA|161aa|down_0|NC_010830.1_442156_442639_-,NA|85aa|down_1|NC_010830.1_443301_443556_-,NA|54aa|down_2|NC_010830.1_443637_443799_-,NA|383aa|down_7|NC_010830.1_451291_452440_-	NA|343aa|up_9|NC_010830.1_422156_423185_+	pfam02450, LCAT, Lecithin:cholesterol acyltransferase	NA|220aa|up_8|NC_010830.1_423566_424226_-	pfam09365, DUF2461, Conserved hypothetical protein (DUF2461)	NA|497aa|up_7|NC_010830.1_424353_425844_-	PRK04173, PRK04173, glycyl-tRNA synthetase; Provisional	NA|2172aa|up_6|NC_010830.1_427183_433699_-	sd00045, ANK, ankyrin repeats	NA|192aa|up_5|NC_010830.1_433940_434516_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|408aa|up_4|NC_010830.1_435429_436653_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|275aa|up_3|NC_010830.1_436854_437679_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|74aa|up_2|NC_010830.1_438409_438631_+	pfam11387, DUF2795, Protein of unknown function (DUF2795)	NA|428aa|up_1|NC_010830.1_438666_439950_-	cd04243, AAK_AK-HSDH-like, AAK_AK-HSDH-like: Amino Acid Kinase Superfamily (AAK), AK-HSDH-like; this family includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK- homoserine dehydrogenase (HSDH)	NA|332aa|up_0|NC_010830.1_440045_441041_+	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|161aa|down_0|NC_010830.1_442156_442639_-	NA	NA|85aa|down_1|NC_010830.1_443301_443556_-	NA	NA|54aa|down_2|NC_010830.1_443637_443799_-	NA	NA|515aa|down_3|NC_010830.1_443910_445455_+	TIGR00763, Lon_protease, endopeptidase La	NA|227aa|down_4|NC_010830.1_446111_446791_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|936aa|down_5|NC_010830.1_446881_449689_-	cd16452, SP-RING_like, A group of variants of RING finger including SP-RING finger, SPL-RING finger, dRING finger, and RING-like Rtf2 domain	NA|358aa|down_6|NC_010830.1_450124_451198_-	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|383aa|down_7|NC_010830.1_451291_452440_-	NA	NA|176aa|down_8|NC_010830.1_452660_453188_-	cd01055, Nonheme_Ferritin, nonheme-containing ferritins	NA|264aa|down_9|NC_010830.1_453478_454271_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	2	705919-706071	2	CRISPRCasFinder	no		cas3,csa3,WYL	Orphan	GTTATCTTCTGCAGGTATAAAGCATGATACCCATAGCACTTCCCATACTAG	51	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL	NA|293aa|up_7|NC_010830.1_683618_684497_-,NA|88aa|down_0|NC_010830.1_709176_709440_+,NA|86aa|down_1|NC_010830.1_709453_709711_+,NA|131aa|down_2|NC_010830.1_709736_710129_+,NA|56aa|down_5|NC_010830.1_712998_713166_-,NA|248aa|down_6|NC_010830.1_713228_713972_-,NA|199aa|down_8|NC_010830.1_717921_718518_+	NA|791aa|up_9|NC_010830.1_679204_681577_+	PLN02437, PLN02437, ribonucleoside--diphosphate reductase large subunit	NA|276aa|up_8|NC_010830.1_682203_683030_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|293aa|up_7|NC_010830.1_683618_684497_-	NA	NA|73aa|up_6|NC_010830.1_684605_684824_-	pfam12937, F-box-like, F-box-like	NA|531aa|up_5|NC_010830.1_685783_687376_+	pfam00743, FMO-like, Flavin-binding monooxygenase-like	NA|649aa|up_4|NC_010830.1_688010_689957_-	sd00010, SLR, Sel1-like repeat	NA|507aa|up_3|NC_010830.1_690877_692398_-	cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily	NA|1135aa|up_2|NC_010830.1_692468_695873_-	sd00010, SLR, Sel1-like repeat	NA|187aa|up_1|NC_010830.1_696180_696741_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|2301aa|up_0|NC_010830.1_696931_703834_-	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|88aa|down_0|NC_010830.1_709176_709440_+	NA	NA|86aa|down_1|NC_010830.1_709453_709711_+	NA	NA|131aa|down_2|NC_010830.1_709736_710129_+	NA	NA|273aa|down_3|NC_010830.1_710521_711340_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|315aa|down_4|NC_010830.1_711425_712370_-	pfam00665, rve, Integrase core domain	NA|56aa|down_5|NC_010830.1_712998_713166_-	NA	NA|248aa|down_6|NC_010830.1_713228_713972_-	NA	NA|1259aa|down_7|NC_010830.1_713976_717753_-	TIGR02243, hypothetical_protein_SCD8A	NA|199aa|down_8|NC_010830.1_717921_718518_+	NA	NA|645aa|down_9|NC_010830.1_718613_720548_+	PRK00413, thrS, threonyl-tRNA synthetase; Reviewed
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	3	764804-764927	3	CRISPRCasFinder	no		cas3,csa3,WYL	Orphan	TACTTGGAATATCAGTTACTGTAGCCGTTAGGCTAAC	37	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL	NA|777aa|up_0|NC_010830.1_760997_763328_-,NA|119aa|down_3|NC_010830.1_772902_773259_-,NA|946aa|down_4|NC_010830.1_773454_776292_+,NA|166aa|down_5|NC_010830.1_776323_776821_+	NA|312aa|up_9|NC_010830.1_747071_748007_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|459aa|up_8|NC_010830.1_748372_749749_-	PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated	NA|234aa|up_7|NC_010830.1_751032_751734_+	pfam13568, OMP_b-brl_2, Outer membrane protein beta-barrel domain	NA|243aa|up_6|NC_010830.1_752116_752845_+	cd11649, RsmI_like, uncharacterized subfamily of the tetrapyrrole methylase family similar to Ribosomal RNA small subunit methyltransferase I (RsmI)	NA|276aa|up_5|NC_010830.1_753612_754440_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|428aa|up_4|NC_010830.1_754778_756062_+	PRK05912, PRK05912, tyrosyl-tRNA synthetase; Validated	NA|513aa|up_3|NC_010830.1_756073_757612_-	COG0606, COG0606, Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones]	NA|433aa|up_2|NC_010830.1_757853_759152_-	pfam11013, DUF2851, Protein of unknown function (DUF2851)	NA|353aa|up_1|NC_010830.1_759642_760701_+	cd05656, M42_Frv, M42 Peptidase, endoglucanases	NA|777aa|up_0|NC_010830.1_760997_763328_-	NA	NA|174aa|down_0|NC_010830.1_768707_769229_-	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional	NA|487aa|down_1|NC_010830.1_769323_770784_-	PRK00139, murE, UDP-N-acetylmuramoylalanyl-D-glutamate--2,6-diaminopimelate ligase; Provisional	NA|705aa|down_2|NC_010830.1_770787_772902_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|119aa|down_3|NC_010830.1_772902_773259_-	NA	NA|946aa|down_4|NC_010830.1_773454_776292_+	NA	NA|166aa|down_5|NC_010830.1_776323_776821_+	NA	NA|597aa|down_6|NC_010830.1_777218_779009_+	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|323aa|down_7|NC_010830.1_779081_780050_-	COG0545, FkpA, FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]	NA|337aa|down_8|NC_010830.1_780060_781071_-	COG0618, COG0618, Exopolyphosphatase-related proteins [General function prediction only]	NA|123aa|down_9|NC_010830.1_781272_781641_-	pfam03840, SecG, Preprotein translocase SecG subunit
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	4	872778-872930	4	CRISPRCasFinder	no		cas3,csa3,WYL	Orphan	GTTATCTTCTGCAGGTATAAAGCATGATACCCATAGCACTTCCCATACTAG	51	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL	NA|51aa|up_2|NC_010830.1_870308_870461_+,NA|325aa|up_0|NC_010830.1_871367_872342_+,NA|195aa|down_0|NC_010830.1_873925_874510_+,NA|157aa|down_2|NC_010830.1_876051_876522_+,NA|108aa|down_3|NC_010830.1_876543_876867_+,NA|192aa|down_4|NC_010830.1_877140_877716_+,NA|63aa|down_8|NC_010830.1_881138_881327_+	NA|59aa|up_9|NC_010830.1_864870_865047_+	pfam13427, DUF4111, Domain of unknown function (DUF4111)	NA|64aa|up_8|NC_010830.1_865409_865601_+	pfam03190, Thioredox_DsbH, Protein of unknown function, DUF255	NA|608aa|up_7|NC_010830.1_865623_867447_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|582aa|up_6|NC_010830.1_867531_869277_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|72aa|up_5|NC_010830.1_869394_869610_+	pfam13173, AAA_14, AAA domain	NA|65aa|up_4|NC_010830.1_869646_869841_+	pfam13173, AAA_14, AAA domain	NA|44aa|up_3|NC_010830.1_870107_870239_+	pfam13635, DUF4143, Domain of unknown function (DUF4143)	NA|51aa|up_2|NC_010830.1_870308_870461_+	NA	NA|139aa|up_1|NC_010830.1_870597_871014_+	COG1734, DksA, DnaK suppressor protein [Signal transduction mechanisms]	NA|325aa|up_0|NC_010830.1_871367_872342_+	NA	NA|195aa|down_0|NC_010830.1_873925_874510_+	NA	NA|206aa|down_1|NC_010830.1_875384_876002_+	cd00737, lyz_endolysin_autolysin, endolysin and autolysin	NA|157aa|down_2|NC_010830.1_876051_876522_+	NA	NA|108aa|down_3|NC_010830.1_876543_876867_+	NA	NA|192aa|down_4|NC_010830.1_877140_877716_+	NA	NA|227aa|down_5|NC_010830.1_877740_878420_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|298aa|down_6|NC_010830.1_879685_880579_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|162aa|down_7|NC_010830.1_880621_881107_+	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|63aa|down_8|NC_010830.1_881138_881327_+	NA	NA|135aa|down_9|NC_010830.1_881861_882266_-	pfam10116, Host_attach, Protein required for attachment to host cells
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	5	1072716-1072868	5	CRISPRCasFinder	no	WYL	cas3,csa3,WYL	Unclear	CTAGAATGGGAAGTGCTATGGGTATCATGCTTTATACCTGCAGAAGATAAC	51	0	0	NA	NA	NA	1	1	Orphan	cas3,csa3,WYL	NA|112aa|up_4|NC_010830.1_1068088_1068424_-,NA|97aa|up_3|NC_010830.1_1068639_1068930_-,NA|162aa|up_2|NC_010830.1_1069192_1069678_-,NA|145aa|up_1|NC_010830.1_1069665_1070100_-,NA|68aa|down_0|NC_010830.1_1074103_1074307_+,NA|62aa|down_6|NC_010830.1_1088462_1088648_+,NA|155aa|down_7|NC_010830.1_1088887_1089352_+	NA|138aa|up_9|NC_010830.1_1061571_1061985_-	COG4430, COG4430, Uncharacterized protein conserved in bacteria [Function unknown]	NA|139aa|up_8|NC_010830.1_1062016_1062433_-	pfam08570, DUF1761, Protein of unknown function (DUF1761)	WYL|230aa|up_7|NC_010830.1_1062508_1063198_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|1106aa|up_6|NC_010830.1_1064040_1067358_+	cd10322, SLC5sbd, Solute carrier 5 family, sodium/glucose transporters and related proteins; solute-binding domain	NA|199aa|up_5|NC_010830.1_1067364_1067961_+	cd07182, RNase_HII_bacteria_HII_like, Bacterial Ribonuclease HII-like	NA|112aa|up_4|NC_010830.1_1068088_1068424_-	NA	NA|97aa|up_3|NC_010830.1_1068639_1068930_-	NA	NA|162aa|up_2|NC_010830.1_1069192_1069678_-	NA	NA|145aa|up_1|NC_010830.1_1069665_1070100_-	NA	NA|540aa|up_0|NC_010830.1_1070100_1071720_-	PRK09510, tolA, cell envelope integrity inner membrane protein TolA; Provisional	NA|68aa|down_0|NC_010830.1_1074103_1074307_+	NA	NA|724aa|down_1|NC_010830.1_1075077_1077249_+	PHA02876, PHA02876, ankyrin repeat protein; Provisional	NA|962aa|down_2|NC_010830.1_1078069_1080955_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|779aa|down_3|NC_010830.1_1081030_1083367_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|685aa|down_4|NC_010830.1_1084708_1086763_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|405aa|down_5|NC_010830.1_1086792_1088007_+	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|62aa|down_6|NC_010830.1_1088462_1088648_+	NA	NA|155aa|down_7|NC_010830.1_1088887_1089352_+	NA	NA|325aa|down_8|NC_010830.1_1089605_1090580_+	COG2515, Acd, 1-aminocyclopropane-1-carboxylate deaminase [Amino acid transport and metabolism]	NA|599aa|down_9|NC_010830.1_1090738_1092535_-	PRK00558, uvrC, excinuclease ABC subunit UvrC
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	6	1306972-1307217	1	PILER-CR	no		cas3,csa3,WYL	Orphan	GGTATATATCAATAGCTGAAAAGTTAATAGATAAATTACCT	41	0	0	NA	NA	NA	2	2	Orphan	cas3,csa3,WYL	NA|142aa|up_9|NC_010830.1_1290705_1291131_+,NA|49aa|up_6|NC_010830.1_1293111_1293258_+,NA|115aa|up_5|NC_010830.1_1293431_1293776_+,NA|335aa|down_4|NC_010830.1_1314793_1315798_+,NA|621aa|down_9|NC_010830.1_1320385_1322248_-	NA|142aa|up_9|NC_010830.1_1290705_1291131_+	NA	NA|276aa|up_8|NC_010830.1_1291414_1292242_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|261aa|up_7|NC_010830.1_1292277_1293060_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|49aa|up_6|NC_010830.1_1293111_1293258_+	NA	NA|115aa|up_5|NC_010830.1_1293431_1293776_+	NA	NA|413aa|up_4|NC_010830.1_1294439_1295678_+	cd06453, SufS_like, Cysteine desulfurase (SufS)-like	NA|448aa|up_3|NC_010830.1_1295778_1297122_+	sd00045, ANK, ankyrin repeats	NA|141aa|up_2|NC_010830.1_1297164_1297587_+	pfam02657, SufE, Fe-S metabolism associated domain	NA|109aa|up_1|NC_010830.1_1297588_1297915_+	TIGR02945, SUF_assoc, FeS assembly SUF system protein	NA|2273aa|up_0|NC_010830.1_1298411_1305230_+	COG3899, COG3899, Predicted ATPase [General function prediction only]	NA|512aa|down_0|NC_010830.1_1307689_1309225_+	sd00045, ANK, ankyrin repeats	NA|139aa|down_1|NC_010830.1_1309377_1309794_+	pfam06491, Disulph_isomer, Disulphide isomerase	NA|152aa|down_2|NC_010830.1_1309840_1310296_+	COG1225, Bcp, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]	NA|1117aa|down_3|NC_010830.1_1311225_1314576_+	cd10322, SLC5sbd, Solute carrier 5 family, sodium/glucose transporters and related proteins; solute-binding domain	NA|335aa|down_4|NC_010830.1_1314793_1315798_+	NA	NA|171aa|down_5|NC_010830.1_1315936_1316449_+	PRK00802, PRK00802, DNA-3-methyladenine glycosylase	NA|302aa|down_6|NC_010830.1_1316579_1317485_+	PRK06080, PRK06080, 1,4-dihydroxy-2-naphthoate octaprenyltransferase; Validated	NA|56aa|down_7|NC_010830.1_1317576_1317744_+	cd03673, Ap6A_hydrolase, Diadenosine hexaphosphate (Ap6A) hydrolase is a member of the Nudix hydrolase superfamily	NA|720aa|down_8|NC_010830.1_1317880_1320040_-	pfam01764, Lipase_3, Lipase (class 3)	NA|621aa|down_9|NC_010830.1_1320385_1322248_-	NA
GCF_000020565.1_ASM2056v1	NC_010830	Candidatus Amoebophilus asiaticus 5a2, complete sequence	7	1554772-1555024	2	PILER-CR	no		cas3,csa3,WYL	Orphan	TTTAGCATTGACATCAGCTCCTGATTCTATTAACAGTTTAGCTACTTCTAGGTGC	55	1	1	1554827-1554870	NC_010830.1_1554728-1554771	NA	2	2	Orphan	cas3,csa3,WYL	NA|364aa|up_5|NC_010830.1_1548987_1550079_+,NA|83aa|up_3|NC_010830.1_1550565_1550814_-,NA|158aa|down_1|NC_010830.1_1558214_1558688_-,NA|435aa|down_4|NC_010830.1_1562263_1563568_-	NA|56aa|up_9|NC_010830.1_1545118_1545286_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|56aa|up_8|NC_010830.1_1545293_1545461_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|385aa|up_7|NC_010830.1_1545695_1546850_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	NA|305aa|up_6|NC_010830.1_1547515_1548430_-	sd00010, SLR, Sel1-like repeat	NA|364aa|up_5|NC_010830.1_1548987_1550079_+	NA	NA|126aa|up_4|NC_010830.1_1550191_1550569_-	cd18683, PIN_VapC-like, Uncharacterized subfamily of the VapC (virulence-associated protein C)-like family of the PIN domain superfamily	NA|83aa|up_3|NC_010830.1_1550565_1550814_-	NA	NA|57aa|up_2|NC_010830.1_1551168_1551339_-	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|166aa|up_1|NC_010830.1_1551265_1551763_-	pfam13102, Phage_int_SAM_5, Phage integrase SAM-like domain	NA|75aa|up_0|NC_010830.1_1551674_1551899_-	pfam17293, Arm-DNA-bind_5, Arm DNA-binding domain	NA|553aa|down_0|NC_010830.1_1556590_1558249_-	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|158aa|down_1|NC_010830.1_1558214_1558688_-	NA	NA|139aa|down_2|NC_010830.1_1559129_1559546_-	sd00045, ANK, ankyrin repeats	NA|473aa|down_3|NC_010830.1_1559613_1561032_-	PHA03100, PHA03100, ankyrin repeat protein; Provisional	NA|435aa|down_4|NC_010830.1_1562263_1563568_-	NA	NA|275aa|down_5|NC_010830.1_1564106_1564931_+	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|297aa|down_6|NC_010830.1_1565096_1565987_-	pfam12937, F-box-like, F-box-like	NA|431aa|down_7|NC_010830.1_1566174_1567467_-	cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily	NA|110aa|down_8|NC_010830.1_1568071_1568401_+	sd00045, ANK, ankyrin repeats	NA|1250aa|down_9|NC_010830.1_1568696_1572446_-	PHA03100, PHA03100, ankyrin repeat protein; Provisional
