assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002355735.1_ASM235573v1	NZ_AP014879	Sulfuricaulis limicola strain HA5	1	2408975-2409076	1	PILER-CR	no		cas3,cas4,csa3,RT,cas6,DEDDh,DinG	Orphan	GCGCGAGCTGCGGCAGGA	18	0	0	NA	NA	NA	2	2	Orphan	cas3,cas4,csa3,RT,cas6,DEDDh,DinG	NA,NA|266aa|down_0|NZ_AP014879.1_2409194_2409992_-	NA|140aa|up_9|NZ_AP014879.1_2399804_2400224_-	PRK10738, PRK10738, OsmC family protein	NA|276aa|up_8|NZ_AP014879.1_2400220_2401048_-	PRK00278, trpC, indole-3-glycerol phosphate synthase TrpC	NA|342aa|up_7|NZ_AP014879.1_2401044_2402070_-	PRK00188, trpD, anthranilate phosphoribosyltransferase; Provisional	NA|411aa|up_6|NZ_AP014879.1_2402130_2403363_-	cd17320, MFS_MdfA_MDR_like, Multidrug transporter MdfA and similar multidrug resistance (MDR) transporters of the Major Facilitator Superfamily	NA|193aa|up_5|NZ_AP014879.1_2403373_2403952_-	PRK05670, PRK05670, anthranilate synthase component II; Provisional	NA|499aa|up_4|NZ_AP014879.1_2403954_2405451_-	PRK13565, PRK13565, anthranilate synthase component I; Provisional	NA|236aa|up_3|NZ_AP014879.1_2405564_2406272_-	PRK13222, PRK13222, N-acetylmuramic acid 6-phosphate phosphatase MupP	NA|231aa|up_2|NZ_AP014879.1_2406271_2406964_-	PRK08883, PRK08883, ribulose-phosphate 3-epimerase; Provisional	NA|232aa|up_1|NZ_AP014879.1_2407250_2407946_-	PRK12274, PRK12274, serine/threonine protein kinase; Provisional	NA|159aa|up_0|NZ_AP014879.1_2408058_2408535_+	COG5615, COG5615, Predicted integral membrane protein [Function unknown]	NA|266aa|down_0|NZ_AP014879.1_2409194_2409992_-	NA	NA|121aa|down_1|NZ_AP014879.1_2410041_2410404_-	pfam13473, Cupredoxin_1, Cupredoxin-like domain	NA|762aa|down_2|NZ_AP014879.1_2410400_2412686_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|72aa|down_3|NZ_AP014879.1_2412675_2412891_-	cd00371, HMA, Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones	NA|124aa|down_4|NZ_AP014879.1_2413081_2413453_-	cd14797, DUF302, Uncharacterized domain family DUF302	NA|827aa|down_5|NZ_AP014879.1_2413713_2416194_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|154aa|down_6|NZ_AP014879.1_2416324_2416786_-	pfam00034, Cytochrom_C, Cytochrome c	NA|162aa|down_7|NZ_AP014879.1_2416872_2417358_-	pfam00034, Cytochrom_C, Cytochrome c	NA|552aa|down_8|NZ_AP014879.1_2417504_2419160_+	COG2509, COG2509, Uncharacterized FAD-dependent dehydrogenases [General function prediction only]	NA|297aa|down_9|NZ_AP014879.1_2419386_2420277_-	PRK13961, PRK13961, phosphoribosylaminoimidazole-succinocarboxamide synthase; Provisional
GCF_002355735.1_ASM235573v1	NZ_AP014879	Sulfuricaulis limicola strain HA5	2	2810977-2811071	1	CRISPRCasFinder	no		cas3,cas4,csa3,RT,cas6,DEDDh,DinG	Orphan	CCTGCCGGTGATCGTGCCGCTGTTG	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas4,csa3,RT,cas6,DEDDh,DinG	NA,NA|79aa|down_6|NZ_AP014879.1_2821128_2821365_-	NA|309aa|up_9|NZ_AP014879.1_2795907_2796834_+	PRK06223, PRK06223, malate dehydrogenase; Reviewed	NA|308aa|up_8|NZ_AP014879.1_2796833_2797757_+	PRK11805, PRK11805, 50S ribosomal protein L3 N(5)-glutamine methyltransferase	NA|570aa|up_7|NZ_AP014879.1_2797753_2799463_-	cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and Bubblegum-like very long-chain fatty acid CoA synthetases	NA|454aa|up_6|NZ_AP014879.1_2799491_2800853_-	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|678aa|up_5|NZ_AP014879.1_2800999_2803033_-	PRK11154, fadJ, fatty acid oxidation complex subunit alpha FadJ	NA|431aa|up_4|NZ_AP014879.1_2803026_2804319_-	PRK08170, PRK08170, acetyl-CoA C-acetyltransferase	NA|799aa|up_3|NZ_AP014879.1_2804413_2806810_-	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|57aa|up_2|NZ_AP014879.1_2806935_2807106_-	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|325aa|up_1|NZ_AP014879.1_2807193_2808168_+	pfam00762, Ferrochelatase, Ferrochelatase	NA|274aa|up_0|NZ_AP014879.1_2808164_2808986_+	cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate kinase (HMPP-kinase) catalyzes two consecutive phosphorylation steps in the thiamine phosphate biosynthesis pathway, leading to the synthesis of vitamin B1	NA|560aa|down_0|NZ_AP014879.1_2813285_2814965_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|721aa|down_1|NZ_AP014879.1_2814969_2817132_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|271aa|down_2|NZ_AP014879.1_2817134_2817947_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|296aa|down_3|NZ_AP014879.1_2817954_2818842_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|210aa|down_4|NZ_AP014879.1_2818927_2819557_+	PRK00043, thiE, thiamine phosphate synthase	NA|427aa|down_5|NZ_AP014879.1_2819673_2820954_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|79aa|down_6|NZ_AP014879.1_2821128_2821365_-	NA	NA|205aa|down_7|NZ_AP014879.1_2821639_2822254_-	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|921aa|down_8|NZ_AP014879.1_2822826_2825589_-	PRK05755, PRK05755, DNA polymerase I; Provisional	NA|223aa|down_9|NZ_AP014879.1_2825818_2826487_+	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]
GCF_002355735.1_ASM235573v1	NZ_AP014879	Sulfuricaulis limicola strain HA5	3	2811388-2811482	2	CRISPRCasFinder	no		cas3,cas4,csa3,RT,cas6,DEDDh,DinG	Orphan	CCTGCCGGTGATCGTGCCGCTGTTG	25	0	0	NA	NA	NA	1	1	Orphan	cas3,cas4,csa3,RT,cas6,DEDDh,DinG	NA,NA|79aa|down_6|NZ_AP014879.1_2821128_2821365_-	NA|309aa|up_9|NZ_AP014879.1_2795907_2796834_+	PRK06223, PRK06223, malate dehydrogenase; Reviewed	NA|308aa|up_8|NZ_AP014879.1_2796833_2797757_+	PRK11805, PRK11805, 50S ribosomal protein L3 N(5)-glutamine methyltransferase	NA|570aa|up_7|NZ_AP014879.1_2797753_2799463_-	cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and Bubblegum-like very long-chain fatty acid CoA synthetases	NA|454aa|up_6|NZ_AP014879.1_2799491_2800853_-	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|678aa|up_5|NZ_AP014879.1_2800999_2803033_-	PRK11154, fadJ, fatty acid oxidation complex subunit alpha FadJ	NA|431aa|up_4|NZ_AP014879.1_2803026_2804319_-	PRK08170, PRK08170, acetyl-CoA C-acetyltransferase	NA|799aa|up_3|NZ_AP014879.1_2804413_2806810_-	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|57aa|up_2|NZ_AP014879.1_2806935_2807106_-	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|325aa|up_1|NZ_AP014879.1_2807193_2808168_+	pfam00762, Ferrochelatase, Ferrochelatase	NA|274aa|up_0|NZ_AP014879.1_2808164_2808986_+	cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate kinase (HMPP-kinase) catalyzes two consecutive phosphorylation steps in the thiamine phosphate biosynthesis pathway, leading to the synthesis of vitamin B1	NA|560aa|down_0|NZ_AP014879.1_2813285_2814965_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|721aa|down_1|NZ_AP014879.1_2814969_2817132_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|271aa|down_2|NZ_AP014879.1_2817134_2817947_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|296aa|down_3|NZ_AP014879.1_2817954_2818842_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|210aa|down_4|NZ_AP014879.1_2818927_2819557_+	PRK00043, thiE, thiamine phosphate synthase	NA|427aa|down_5|NZ_AP014879.1_2819673_2820954_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|79aa|down_6|NZ_AP014879.1_2821128_2821365_-	NA	NA|205aa|down_7|NZ_AP014879.1_2821639_2822254_-	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|921aa|down_8|NZ_AP014879.1_2822826_2825589_-	PRK05755, PRK05755, DNA polymerase I; Provisional	NA|223aa|down_9|NZ_AP014879.1_2825818_2826487_+	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]
GCF_002355735.1_ASM235573v1	NZ_AP014879	Sulfuricaulis limicola strain HA5	4	2811637-2811811	3	CRISPRCasFinder	no		cas3,cas4,csa3,RT,cas6,DEDDh,DinG	Orphan	CCTGCCGGTGATCGTGCCGCTGTTG	25	0	0	NA	NA	NA	2	2	Orphan	cas3,cas4,csa3,RT,cas6,DEDDh,DinG	NA,NA|79aa|down_6|NZ_AP014879.1_2821128_2821365_-	NA|309aa|up_9|NZ_AP014879.1_2795907_2796834_+	PRK06223, PRK06223, malate dehydrogenase; Reviewed	NA|308aa|up_8|NZ_AP014879.1_2796833_2797757_+	PRK11805, PRK11805, 50S ribosomal protein L3 N(5)-glutamine methyltransferase	NA|570aa|up_7|NZ_AP014879.1_2797753_2799463_-	cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and Bubblegum-like very long-chain fatty acid CoA synthetases	NA|454aa|up_6|NZ_AP014879.1_2799491_2800853_-	pfam03349, Toluene_X, Outer membrane protein transport protein (OMPP1/FadL/TodX)	NA|678aa|up_5|NZ_AP014879.1_2800999_2803033_-	PRK11154, fadJ, fatty acid oxidation complex subunit alpha FadJ	NA|431aa|up_4|NZ_AP014879.1_2803026_2804319_-	PRK08170, PRK08170, acetyl-CoA C-acetyltransferase	NA|799aa|up_3|NZ_AP014879.1_2804413_2806810_-	PRK09463, fadE, acyl-CoA dehydrogenase; Reviewed	NA|57aa|up_2|NZ_AP014879.1_2806935_2807106_-	COG1773, COG1773, Rubredoxin [Energy production and conversion]	NA|325aa|up_1|NZ_AP014879.1_2807193_2808168_+	pfam00762, Ferrochelatase, Ferrochelatase	NA|274aa|up_0|NZ_AP014879.1_2808164_2808986_+	cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate kinase (HMPP-kinase) catalyzes two consecutive phosphorylation steps in the thiamine phosphate biosynthesis pathway, leading to the synthesis of vitamin B1	NA|560aa|down_0|NZ_AP014879.1_2813285_2814965_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|721aa|down_1|NZ_AP014879.1_2814969_2817132_+	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|271aa|down_2|NZ_AP014879.1_2817134_2817947_+	pfam00561, Abhydrolase_1, alpha/beta hydrolase fold	NA|296aa|down_3|NZ_AP014879.1_2817954_2818842_-	COG0384, COG0384, Predicted epimerase, PhzC/PhzF homolog [General function prediction only]	NA|210aa|down_4|NZ_AP014879.1_2818927_2819557_+	PRK00043, thiE, thiamine phosphate synthase	NA|427aa|down_5|NZ_AP014879.1_2819673_2820954_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|79aa|down_6|NZ_AP014879.1_2821128_2821365_-	NA	NA|205aa|down_7|NZ_AP014879.1_2821639_2822254_-	COG2863, COG2863, Cytochrome c553 [Energy production and conversion]	NA|921aa|down_8|NZ_AP014879.1_2822826_2825589_-	PRK05755, PRK05755, DNA polymerase I; Provisional	NA|223aa|down_9|NZ_AP014879.1_2825818_2826487_+	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]
