assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000021325.1_ASM2132v1	NC_011365	Gluconacetobacter diazotrophicus PA1 5, complete sequence	1	388303-388602	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas1,cas2	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	Unclear	AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC,AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC,AGCCTACCATCGGCAAATCGGTAGGGAAACCACGGC	36,36,36	0	0	NA	NA	NA:NA:NA	4,4,4	4	Unclear	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	NA,NA|213aa|down_3|NC_011365.1_391016_391655_-	NA|270aa|up_9|NC_011365.1_376709_377519_+	COG5375, COG5375, Uncharacterized protein conserved in bacteria [Function unknown]	NA|363aa|up_8|NC_011365.1_377515_378604_+	pfam03968, OstA, OstA-like protein	NA|264aa|up_7|NC_011365.1_378600_379392_+	COG1137, YhbG, ABC-type (unclassified) transport system, ATPase component [General function prediction only]	NA|459aa|up_6|NC_011365.1_379402_380779_+	PRK05932, PRK05932, RNA polymerase factor sigma-54; Reviewed	NA|196aa|up_5|NC_011365.1_380873_381461_+	cd00552, RaiA, RaiA ("ribosome-associated inhibitor A", also known as Protein Y (PY), YfiA, and SpotY,  is a stress-response protein that binds the ribosomal subunit interface and arrests translation by interfering with aminoacyl-tRNA binding to the ribosomal A site	NA|88aa|up_4|NC_011365.1_381531_381795_-	pfam06620, DUF1150, Protein of unknown function (DUF1150)	NA|163aa|up_3|NC_011365.1_381908_382397_-	cd06470, ACD_IbpA-B_like, Alpha-crystallin domain (ACD) found in Escherichia coli inclusion body-associated proteins IbpA and IbpB, and similar proteins	NA|353aa|up_2|NC_011365.1_385809_386867_+	pfam13358, DDE_3, DDE superfamily endonuclease	cas1|298aa|up_1|NC_011365.1_387013_387907_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|110aa|up_0|NC_011365.1_387916_388246_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|180aa|down_0|NC_011365.1_388734_389274_+	PRK00150, def, peptide deformylase; Reviewed	NA|306aa|down_1|NC_011365.1_389319_390237_+	PRK00005, fmt, methionyl-tRNA formyltransferase; Reviewed	NA|259aa|down_2|NC_011365.1_390233_391010_+	PRK00021, truA, tRNA pseudouridine(38-40) synthase TruA	NA|213aa|down_3|NC_011365.1_391016_391655_-	NA	NA|386aa|down_4|NC_011365.1_391651_392809_-	PRK13009, PRK13009, succinyl-diaminopimelate desuccinylase; Reviewed	NA|282aa|down_5|NC_011365.1_392805_393651_-	PRK11830, dapD, 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional	NA|294aa|down_6|NC_011365.1_393777_394659_-	PRK00942, PRK00942, acetylglutamate kinase; Provisional	NA|224aa|down_7|NC_011365.1_394759_395431_-	PRK00454, engB, GTP-binding protein YsxC; Reviewed	NA|590aa|down_8|NC_011365.1_395444_397214_-	PRK01318, PRK01318, membrane protein insertase; Provisional	NA|84aa|down_9|NC_011365.1_397235_397487_-	pfam01809, Haemolytic, Haemolytic domain
GCF_000021325.1_ASM2132v1	NC_011365	Gluconacetobacter diazotrophicus PA1 5, complete sequence	2	460172-461968	2,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	Type I-E	CGGTTCATCCCCGCACGTGCGGGGAACAC,CGGTTCATCCCCGCACGTGCGGGGAACAC,CGGTTCATCCCCGCACGTGCGGGGAACAC	29,29,29	0	0	NA	NA	I-E:I-E:I-E	29,29,27	29	TypeI-E	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	NA|46aa|up_8|NC_011365.1_451132_451270_-,NA|134aa|down_9|NC_011365.1_472607_473009_-	NA|238aa|up_9|NC_011365.1_450404_451118_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|46aa|up_8|NC_011365.1_451132_451270_-	NA	NA|199aa|up_7|NC_011365.1_451478_452075_+	cd02219, cupin_YjlB-like, Bacillus subtilis YjlB and related proteins, cupin domain	NA|249aa|up_6|NC_011365.1_452082_452829_+	COG2085, COG2085, Predicted dinucleotide-binding enzymes [General function prediction only]	NA|184aa|up_5|NC_011365.1_452895_453447_-	COG3247, HdeD, Uncharacterized conserved protein [Function unknown]	NA|206aa|up_4|NC_011365.1_453780_454398_+	PRK05327, rpsD, 30S ribosomal protein S4; Validated	NA|528aa|up_3|NC_011365.1_454516_456100_+	COG4108, PrfC, Peptide chain release factor RF-3 [Translation, ribosomal structure and biogenesis]	NA|467aa|up_2|NC_011365.1_456071_457472_-	cd13131, MATE_NorM_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Vibrio cholerae NorM	NA|454aa|up_1|NC_011365.1_457596_458958_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|263aa|up_0|NC_011365.1_458942_459731_+	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]	cas2|116aa|down_0|NC_011365.1_462036_462384_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas1|320aa|down_1|NC_011365.1_462364_463324_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|229aa|down_2|NC_011365.1_463335_464022_-	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas5|261aa|down_3|NC_011365.1_464018_464801_-	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas7|353aa|down_4|NC_011365.1_464809_465868_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|197aa|down_5|NC_011365.1_465864_466455_-	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas8e|547aa|down_6|NC_011365.1_466451_468092_-	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cas3|900aa|down_7|NC_011365.1_468473_471173_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|444aa|down_8|NC_011365.1_471211_472543_-	PRK12558, PRK12558, glutamyl-tRNA synthetase; Provisional	NA|134aa|down_9|NC_011365.1_472607_473009_-	NA
GCF_000021325.1_ASM2132v1	NC_011365	Gluconacetobacter diazotrophicus PA1 5, complete sequence	3	1814792-1817999	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	 Type I-U?,Type I-U,Type I-C	GTCGCTCCCTGTGCGGGAGCGTGGATTGAAAC,GTCGCTCCCTGTGCGGGAGCGTGGATTGAAAC,GTCGCTCCCTGTGCGGGAGCGTGGATTGAAAC	32,32,32	1	1	1816150-1816182	NC_011365.1_2742257-2742289	I-C:I-C:I-C	48,48,48	48	TypeI-U?,TypeI-U,TypeI-C	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	NA,NA	NA|127aa|up_9|NC_011365.1_1805379_1805760_+	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	cas3|783aa|up_8|NC_011365.1_1805790_1808139_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|137aa|up_7|NC_011365.1_1808299_1808710_-	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|75aa|up_6|NC_011365.1_1808709_1808934_-	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]	cas5|224aa|up_5|NC_011365.1_1809187_1809859_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|608aa|up_4|NC_011365.1_1809855_1811679_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|313aa|up_3|NC_011365.1_1811671_1812610_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|219aa|up_2|NC_011365.1_1812618_1813275_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|347aa|up_1|NC_011365.1_1813274_1814315_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NC_011365.1_1814318_1814609_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|622aa|down_0|NC_011365.1_1820209_1822075_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|229aa|down_1|NC_011365.1_1822298_1822985_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|708aa|down_2|NC_011365.1_1822988_1825112_-	pfam03772, Competence, Competence protein	NA|467aa|down_3|NC_011365.1_1825294_1826695_+	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|322aa|down_4|NC_011365.1_1826691_1827657_+	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|431aa|down_5|NC_011365.1_1827653_1828946_+	PRK09357, pyrC, dihydroorotase; Validated	NA|213aa|down_6|NC_011365.1_1828942_1829581_+	PRK00220, PRK00220, glycerol-3-phosphate 1-O-acyltransferase PlsY	NA|396aa|down_7|NC_011365.1_1829577_1830765_+	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|925aa|down_8|NC_011365.1_1830786_1833561_+	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|782aa|down_9|NC_011365.1_1833563_1835909_+	TIGR02063, Ribonuclease_R, ribonuclease R
GCF_000021325.1_ASM2132v1	NC_011365	Gluconacetobacter diazotrophicus PA1 5, complete sequence	4	1819355-1820112	4,4,4	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	 Type I-U?,Type I-U,Type I-C	GTCGCTCCCTGTGCGGGAGCGTGGATTGAAAC,GTCGCTCCCTGTGCGGGAGCGTGGATTGAAAC,GTCGCTCCCTGTGCGGGAGCGTGGATTGAAAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	10,11,11	11	TypeI-U?,TypeI-U,TypeI-C	DinG,DEDDh,cas1,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,csa3,cas8c,cas4	NA,NA	NA|127aa|up_9|NC_011365.1_1805379_1805760_+	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	cas3|783aa|up_8|NC_011365.1_1805790_1808139_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|137aa|up_7|NC_011365.1_1808299_1808710_-	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|75aa|up_6|NC_011365.1_1808709_1808934_-	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]	cas5|224aa|up_5|NC_011365.1_1809187_1809859_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|608aa|up_4|NC_011365.1_1809855_1811679_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|313aa|up_3|NC_011365.1_1811671_1812610_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|219aa|up_2|NC_011365.1_1812618_1813275_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|347aa|up_1|NC_011365.1_1813274_1814315_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NC_011365.1_1814318_1814609_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|622aa|down_0|NC_011365.1_1820209_1822075_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|229aa|down_1|NC_011365.1_1822298_1822985_+	COG0177, Nth, Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair]	NA|708aa|down_2|NC_011365.1_1822988_1825112_-	pfam03772, Competence, Competence protein	NA|467aa|down_3|NC_011365.1_1825294_1826695_+	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|322aa|down_4|NC_011365.1_1826691_1827657_+	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|431aa|down_5|NC_011365.1_1827653_1828946_+	PRK09357, pyrC, dihydroorotase; Validated	NA|213aa|down_6|NC_011365.1_1828942_1829581_+	PRK00220, PRK00220, glycerol-3-phosphate 1-O-acyltransferase PlsY	NA|396aa|down_7|NC_011365.1_1829577_1830765_+	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|925aa|down_8|NC_011365.1_1830786_1833561_+	PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated	NA|782aa|down_9|NC_011365.1_1833563_1835909_+	TIGR02063, Ribonuclease_R, ribonuclease R
