assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003612855.1_ASM361285v1	CP025288	Ethanoligenens harbinense strain X-29 chromosome, complete genome	2	674784-693588	2,1,1,2,3	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Type I-U,Type I-C, Type I-U?	ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,TCCATCATTTCAATCCACGCTCTCCGTGTGGAGAGCGACCATGT,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC	33,33,44,33,33	0	0	NA	NA	NA:NA:NA:NA:NA	279,279,268,268,268	279	TypeI-U,TypeI-C,TypeI-U?	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA|100aa|up_5|CP025288.1_666377_666677_-,NA|77aa|up_4|CP025288.1_666793_667024_+,NA	NA|156aa|up_9|CP025288.1_661198_661666_-	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|213aa|up_8|CP025288.1_662890_663529_-	PRK09289, PRK09289, riboflavin synthase	NA|368aa|up_7|CP025288.1_663510_664614_-	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|295aa|up_6|CP025288.1_665480_666365_-	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|100aa|up_5|CP025288.1_666377_666677_-	NA	NA|77aa|up_4|CP025288.1_666793_667024_+	NA	NA|68aa|up_3|CP025288.1_667324_667528_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|118aa|up_2|CP025288.1_667735_668089_+	PRK00215, PRK00215, transcriptional repressor LexA	NA|387aa|up_1|CP025288.1_668394_669555_-	pfam07907, YibE_F, YibE/F-like protein	NA|1532aa|up_0|CP025288.1_669698_674294_-	cd07399, MPP_YvnB, Bacillus subtilis YvnB and related proteins, metallophosphatase domain	cas2|97aa|down_0|CP025288.1_693761_694052_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|CP025288.1_694060_695092_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|222aa|down_2|CP025288.1_695088_695754_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas7|283aa|down_3|CP025288.1_695743_696592_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8c|656aa|down_4|CP025288.1_696591_698559_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|248aa|down_5|CP025288.1_698533_699277_-	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|815aa|down_6|CP025288.1_699329_701774_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|418aa|down_7|CP025288.1_701968_703222_-	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|376aa|down_8|CP025288.1_703488_704616_+	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|216aa|down_9|CP025288.1_704832_705480_+	PRK01362, PRK01362, fructose-6-phosphate aldolase
GCA_003612855.1_ASM361285v1	CP025288	Ethanoligenens harbinense strain X-29 chromosome, complete genome	3	3010571-3010671	3	CRISPRCasFinder	no	csa3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Type I-A	AACGTGGGCTGTGCCCACTGCCTGCTTCA	29	0	0	NA	NA	NA	1	1	Orphan	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA,NA|441aa|down_5|CP025288.1_3019011_3020334_+,NA|244aa|down_7|CP025288.1_3022218_3022950_-	NA|115aa|up_9|CP025288.1_3001098_3001443_+	PTZ00397, PTZ00397, macrophage migration inhibition factor-like protein; Provisional	NA|400aa|up_8|CP025288.1_3001460_3002660_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|176aa|up_7|CP025288.1_3002784_3003312_+	TIGR04002, TIGR04002_family_protein, TIGR04002 family protein	NA|713aa|up_6|CP025288.1_3003499_3005638_-	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	csa3|126aa|up_5|CP025288.1_3005644_3006022_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|257aa|up_4|CP025288.1_3006322_3007093_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|238aa|up_3|CP025288.1_3007446_3008160_+	cd00710, LbH_gamma_CA, Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|166aa|up_2|CP025288.1_3008105_3008603_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|152aa|up_1|CP025288.1_3008651_3009107_+	cd00384, ALAD_PBGS, Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll	NA|433aa|up_0|CP025288.1_3009106_3010405_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|205aa|down_0|CP025288.1_3010694_3011309_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|1181aa|down_1|CP025288.1_3011413_3014956_-	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|456aa|down_2|CP025288.1_3015386_3016754_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|257aa|down_3|CP025288.1_3016783_3017554_+	cd09087, Ape1-like_AP-endo, Human Ape1-like subfamily of the ExoIII family apurinic/apyrimidinic (AP) endonucleases	NA|349aa|down_4|CP025288.1_3017917_3018964_+	COG4632, EpsL, Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|441aa|down_5|CP025288.1_3019011_3020334_+	NA	NA|574aa|down_6|CP025288.1_3020408_3022130_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|244aa|down_7|CP025288.1_3022218_3022950_-	NA	NA|99aa|down_8|CP025288.1_3022946_3023243_-	pfam12637, TSCPD, TSCPD domain	NA|293aa|down_9|CP025288.1_3023295_3024174_-	cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase (HisPPase) AMP bound
