assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900209925.1_EH1	NZ_LT907978	Anaerobutyricum hallii isolate EH1 chromosome I	1	1783035-1783326	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	 Type I-U?,Type I-U,Type I-C	TTTCAATCCACGCCTCCGCGAAGGAGGCGAC,TTTCAATCCACGCCTCCGCGAAGGAGGCGAC,TTTCAATCCACGCCTCCGCGAAGGAGGCGAC	31,31,31	0	0	NA	NA	I-C:I-C:I-C	4,4,4	4	TypeI-U?,TypeI-U,TypeI-C	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	NA|50aa|up_3|NZ_LT907978.1_1777960_1778110_+,NA|75aa|up_1|NZ_LT907978.1_1779721_1779946_-,NA	NA|473aa|up_9|NZ_LT907978.1_1771460_1772879_+	pfam00665, rve, Integrase core domain	NA|273aa|up_8|NZ_LT907978.1_1772871_1773690_+	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|168aa|up_7|NZ_LT907978.1_1773755_1774259_+	pfam06541, ABC_trans_CmpB, Putative ABC-transporter type IV	NA|147aa|up_6|NZ_LT907978.1_1774429_1774870_+	pfam08239, SH3_3, Bacterial SH3 domain	NA|138aa|up_5|NZ_LT907978.1_1776207_1776621_+	pfam04688, Holin_SPP1, SPP1 phage holin	NA|438aa|up_4|NZ_LT907978.1_1776620_1777934_+	COG1705, FlgJ, Muramidase (flagellum-specific) [Cell motility and secretion / Intracellular trafficking and secretion]	NA|50aa|up_3|NZ_LT907978.1_1777960_1778110_+	NA	NA|400aa|up_2|NZ_LT907978.1_1778277_1779477_+	pfam00872, Transposase_mut, Transposase, Mutator family	NA|75aa|up_1|NZ_LT907978.1_1779721_1779946_-	NA	NA|876aa|up_0|NZ_LT907978.1_1780172_1782800_-	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	cas2|97aa|down_0|NZ_LT907978.1_1783645_1783936_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|NZ_LT907978.1_1783946_1784978_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|226aa|down_2|NZ_LT907978.1_1784974_1785652_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|287aa|down_3|NZ_LT907978.1_1785638_1786499_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8c|327aa|down_4|NZ_LT907978.1_1786499_1787480_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas8c|328aa|down_5|NZ_LT907978.1_1787419_1788403_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|243aa|down_6|NZ_LT907978.1_1788399_1789128_-	TIGR01876, cas_Cas5d, CRISPR-associated protein Cas5, subtype I-C/DVULG	NA|486aa|down_7|NZ_LT907978.1_1789290_1790748_+	pfam05598, DUF772, Transposase domain (DUF772)	cas3|425aa|down_8|NZ_LT907978.1_1790926_1792201_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas3HD|368aa|down_9|NZ_LT907978.1_1792204_1793308_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3
GCF_900209925.1_EH1	NZ_LT907978	Anaerobutyricum hallii isolate EH1 chromosome I	2	2069965-2072212	2,2,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	 Type I-U?,Type I-U,Type I-C	ATTTCTACTCACACATCCCGTGTGGGATGTGAC,ATTTCTACTCACACATCCCGTGTGGGATGTGAC,ATTTCTACTCACACATCCCGTGTGGGATGTGAC,ATTTCTACTCACACATCCCGTGTGGGATGTGAC	33,33,33,33	7	7	2071074-2071106|2071140-2071173|2071407-2071443|2071678-2071710|2071877-2071909|2072080-2072112|2072146-2072179	NZ_LT907978.1_2300132-2300100|NZ_LT907978.1_2295224-2295191|NZ_LT907978.1_2298623-2298587|NZ_LT907978.1_2291087-2291055|NZ_LT907978.1_2287446-2287414|NZ_LT907978.1_2284513-2284481|NZ_LT907978.1_2293813-2293780	NA:NA:NA:NA	31,33,33,31	33	TypeI-U?,TypeI-U,TypeI-C	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	NA|150aa|up_8|NZ_LT907978.1_2060436_2060886_-,NA	NA|384aa|up_9|NZ_LT907978.1_2059227_2060379_-	cd17292, RMtype1_S_LlaA17I_TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to the S subunit TRD2-CR2 regions of Lactococcus lactis subsp	NA|150aa|up_8|NZ_LT907978.1_2060436_2060886_-	NA	NA|162aa|up_7|NZ_LT907978.1_2060866_2061352_-	pfam13151, DUF3990, Protein of unknown function (DUF3990)	NA|411aa|up_6|NZ_LT907978.1_2061503_2062736_-	COG0732, HsdS, Restriction endonuclease S subunits [Defense mechanisms]	NA|533aa|up_5|NZ_LT907978.1_2062740_2064339_-	COG0286, HsdM, Type I restriction-modification system methyltransferase subunit [Defense mechanisms]	NA|248aa|up_4|NZ_LT907978.1_2064754_2065498_-	TIGR00186, Uncharacterized_tRNA/rRNA_methyltransferase_MG252, rRNA methylase, putative, group 3	NA|146aa|up_3|NZ_LT907978.1_2065499_2065937_-	COG1939, COG1939, Ribonuclease III family protein [Replication, recombination, and    repair]	NA|465aa|up_2|NZ_LT907978.1_2065921_2067316_-	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|417aa|up_1|NZ_LT907978.1_2067486_2068737_-	pfam13527, Acetyltransf_9, Acetyltransferase (GNAT) domain	NA|187aa|up_0|NZ_LT907978.1_2068733_2069294_-	pfam02542, YgbB, YgbB family	cas2|97aa|down_0|NZ_LT907978.1_2072373_2072664_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|341aa|down_1|NZ_LT907978.1_2072692_2073715_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|225aa|down_2|NZ_LT907978.1_2073711_2074386_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7|287aa|down_3|NZ_LT907978.1_2074382_2075243_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|611aa|down_4|NZ_LT907978.1_2075244_2077077_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|243aa|down_5|NZ_LT907978.1_2077070_2077799_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|740aa|down_6|NZ_LT907978.1_2077818_2080038_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|394aa|down_7|NZ_LT907978.1_2080327_2081509_-	cd03886, M20_Acy1, M20 Peptidase Aminoacylase 1 family	NA|159aa|down_8|NZ_LT907978.1_2081537_2082014_-	pfam08006, DUF1700, Protein of unknown function (DUF1700)	NA|221aa|down_9|NZ_LT907978.1_2082168_2082831_-	COG0569, TrkA, K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism]
GCF_900209925.1_EH1	NZ_LT907978	Anaerobutyricum hallii isolate EH1 chromosome I	3	2110733-2110827	3	CRISPRCasFinder	no		PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	Orphan	GTTTTAGACTCCGGTTTTTCGGCCAG	26	0	0	NA	NA	NA	1	1	Orphan	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	NA|70aa|up_2|NZ_LT907978.1_2109302_2109512_-,NA|74aa|down_2|NZ_LT907978.1_2114223_2114445_-	NA|79aa|up_9|NZ_LT907978.1_2100853_2101090_+	pfam00381, PTS-HPr, PTS HPr component phosphorylation site	NA|395aa|up_8|NZ_LT907978.1_2101172_2102357_-	PRK01565, PRK01565, thiamine biosynthesis protein ThiI; Provisional	NA|383aa|up_7|NZ_LT907978.1_2102453_2103602_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|279aa|up_6|NZ_LT907978.1_2103957_2104794_-	COG5492, COG5492, Bacterial surface proteins containing Ig-like domains [Cell motility and secretion]	NA|420aa|up_5|NZ_LT907978.1_2107397_2108657_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|73aa|up_4|NZ_LT907978.1_2108741_2108960_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system	NA|65aa|up_3|NZ_LT907978.1_2109060_2109255_-	TIGR01764, Probable_excisionase, DNA binding domain, excisionase family	NA|70aa|up_2|NZ_LT907978.1_2109302_2109512_-	NA	NA|82aa|up_1|NZ_LT907978.1_2109680_2109926_-	pfam12645, HTH_16, Helix-turn-helix domain	NA|153aa|up_0|NZ_LT907978.1_2109894_2110353_-	TIGR02985, Sig70_bacteroi1, RNA polymerase sigma-70 factor, Bacteroides expansion family 1	NA|274aa|down_0|NZ_LT907978.1_2111232_2112054_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|114aa|down_1|NZ_LT907978.1_2113777_2114119_-	pfam12650, DUF3784, Domain of unknown function (DUF3784)	NA|74aa|down_2|NZ_LT907978.1_2114223_2114445_-	NA	NA|622aa|down_3|NZ_LT907978.1_2114612_2116478_-	COG1653, UgpB, ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism]	NA|200aa|down_4|NZ_LT907978.1_2116989_2117589_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|375aa|down_5|NZ_LT907978.1_2117585_2118710_-	pfam01955, CbiZ, Adenosylcobinamide amidohydrolase	NA|338aa|down_6|NZ_LT907978.1_2119088_2120102_-	pfam01261, AP_endonuc_2, Xylose isomerase-like TIM barrel	NA|486aa|down_7|NZ_LT907978.1_2120269_2121727_+	pfam05598, DUF772, Transposase domain (DUF772)	NA|486aa|down_8|NZ_LT907978.1_2122023_2123481_+	pfam05598, DUF772, Transposase domain (DUF772)	NA|284aa|down_9|NZ_LT907978.1_2123633_2124485_-	pfam07155, ECF-ribofla_trS, ECF-type riboflavin transporter, S component
GCF_900209925.1_EH1	NZ_LT907978	Anaerobutyricum hallii isolate EH1 chromosome I	4	2463065-2465456	4,3,4	CRISPRCasFinder,CRT,PILER-CR	no	WYL,csx1,csx20,cas2,cas1,csm3gr7,csx19,cas10,cas6	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	Type III-A,Type III-D,Type III-C,Type III-B	GTCTCAATCCCTCATAGGTAATGTAATCC,GTCTCAATCCCTCATAGGTAATGTAATCC,GTCTCAATCCCTCATAGGTAATGTAATCC	29,29,29	0	0	NA	NA	NA:NA:NA	35,35,33	35	TypeIII-A,TypeIII-D,TypeIII-C,TypeIII-B	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	NA|66aa|up_9|NZ_LT907978.1_2451454_2451652_-,NA|176aa|up_7|NZ_LT907978.1_2452798_2453326_+,NA|214aa|up_4|NZ_LT907978.1_2456416_2457058_-,csx20|121aa|up_2|NZ_LT907978.1_2459007_2459370_-,NA|47aa|down_6|NZ_LT907978.1_2474600_2474741_-,NA|366aa|down_9|NZ_LT907978.1_2476133_2477231_-	NA|66aa|up_9|NZ_LT907978.1_2451454_2451652_-	NA	WYL|254aa|up_8|NZ_LT907978.1_2451771_2452533_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|176aa|up_7|NZ_LT907978.1_2452798_2453326_+	NA	WYL|251aa|up_6|NZ_LT907978.1_2454653_2455406_+	pfam13280, WYL, WYL domain	WYL|348aa|up_5|NZ_LT907978.1_2455398_2456442_+	pfam13280, WYL, WYL domain	NA|214aa|up_4|NZ_LT907978.1_2456416_2457058_-	NA	csx1|515aa|up_3|NZ_LT907978.1_2457296_2458841_-	pfam09455, Cas_DxTHG, CRISPR-associated (Cas) DxTHG family	csx20|121aa|up_2|NZ_LT907978.1_2459007_2459370_-	NA	cas2|87aa|up_1|NZ_LT907978.1_2459366_2459627_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|637aa|up_0|NZ_LT907978.1_2459653_2461564_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	csm3gr7|641aa|down_0|NZ_LT907978.1_2465846_2467769_-	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csx19|168aa|down_1|NZ_LT907978.1_2467781_2468285_-	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|450aa|down_2|NZ_LT907978.1_2468284_2469634_-	TIGR02581, putative_CRISPR-associated_protein, CRISPR-associated RAMP protein, SSO1426 family	csm3gr7|788aa|down_3|NZ_LT907978.1_2469636_2472000_-	pfam03787, RAMPs, RAMP superfamily	cas10|567aa|down_4|NZ_LT907978.1_2471980_2473681_-	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	cas6|247aa|down_5|NZ_LT907978.1_2473801_2474542_+	COG1583, COG1583, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|47aa|down_6|NZ_LT907978.1_2474600_2474741_-	NA	NA|131aa|down_7|NZ_LT907978.1_2475035_2475428_-	pfam15970, HicB-like_2, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|67aa|down_8|NZ_LT907978.1_2475874_2476075_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|366aa|down_9|NZ_LT907978.1_2476133_2477231_-	NA
GCF_900209925.1_EH1	NZ_LT907978	Anaerobutyricum hallii isolate EH1 chromosome I	5	2970957-2971044	5	CRISPRCasFinder	no		PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	Orphan	AACAAAATCAATGAATATTACTTA	24	0	0	NA	NA	NA	1	1	Orphan	PrimPol,RT,c2c10_CAS-V-U3,csa3,cas2,cas1,cas4,cas7,cas8c,cas5,cas3,cas3HD,PD-DExK,DinG,WYL,csx1,csx20,csm3gr7,csx19,cas10,cas6,DEDDh,cas14j	NA,NA|676aa|down_7|NZ_LT907978.1_2980599_2982627_-	NA|288aa|up_9|NZ_LT907978.1_2959069_2959933_-	COG1092, COG1092, Predicted SAM-dependent methyltransferases [General function prediction only]	NA|790aa|up_8|NZ_LT907978.1_2960023_2962393_-	pfam01841, Transglut_core, Transglutaminase-like superfamily	NA|397aa|up_7|NZ_LT907978.1_2962382_2963573_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|326aa|up_6|NZ_LT907978.1_2963569_2964547_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|280aa|up_5|NZ_LT907978.1_2964630_2965470_-	COG0348, NapH, Polyferredoxin [Energy production and conversion]	NA|164aa|up_4|NZ_LT907978.1_2965469_2965961_-	COG3976, COG3976, Uncharacterized protein conserved in bacteria [Function unknown]	NA|32aa|up_3|NZ_LT907978.1_2966078_2966174_-	pfam04205, FMN_bind, FMN-binding domain	NA|146aa|up_2|NZ_LT907978.1_2966146_2966584_+	smart00060, FN3, Fibronectin type 3 domain	NA|302aa|up_1|NZ_LT907978.1_2967170_2968076_-	TIGR00762, DegV, EDD domain protein, DegV family	NA|618aa|up_0|NZ_LT907978.1_2968300_2970154_-	pfam08282, Hydrolase_3, haloacid dehalogenase-like hydrolase	NA|413aa|down_0|NZ_LT907978.1_2971107_2972346_+	pfam13786, DUF4179, Domain of unknown function (DUF4179)	NA|222aa|down_1|NZ_LT907978.1_2972937_2973603_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|276aa|down_2|NZ_LT907978.1_2973724_2974552_-	COG0619, CbiQ, ABC-type cobalt transport system, permease component CbiQ and related transporters [Inorganic ion transport and metabolism]	NA|336aa|down_3|NZ_LT907978.1_2974551_2975559_-	PRK07331, PRK07331, cobalt transporter CbiM	NA|403aa|down_4|NZ_LT907978.1_2975752_2976961_+	pfam03690, UPF0160, Uncharacterized protein family (UPF0160)	NA|210aa|down_5|NZ_LT907978.1_2977725_2978355_+	PRK00129, upp, uracil phosphoribosyltransferase; Reviewed	NA|461aa|down_6|NZ_LT907978.1_2978449_2979832_+	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|676aa|down_7|NZ_LT907978.1_2980599_2982627_-	NA	NA|300aa|down_8|NZ_LT907978.1_2982601_2983501_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|124aa|down_9|NZ_LT907978.1_2983497_2983869_-	COG1725, COG1725, Predicted transcriptional regulators [Transcription]
