assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003020045.1_ASM302004v1	CP025286	Ethanoligenens harbinense YUAN-3 chromosome, complete genome	2	674796-693600	2,1,1,2,3	CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	cas2,cas1,cas4,cas7,cas8c,cas5,cas3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Type I-U,Type I-C, Type I-U?	ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,TCCATCATTTCAATCCACGCTCTCCGTGTGGAGAGCGACCATGT,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC,ATTTCAATCCACGCTCTCCGTGTGGAGAGCGAC	33,33,44,33,33	0	0	NA	NA	NA:NA:NA:NA:NA	279,279,268,268,268	279	TypeI-U,TypeI-C,TypeI-U?	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA|100aa|up_5|CP025286.1_666389_666689_-,NA|77aa|up_4|CP025286.1_666805_667036_+,NA	NA|400aa|up_9|CP025286.1_661691_662891_-	PRK09311, PRK09311, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase/GTP cyclohydrolase II	NA|213aa|up_8|CP025286.1_662902_663541_-	PRK09289, PRK09289, riboflavin synthase	NA|368aa|up_7|CP025286.1_663522_664626_-	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|295aa|up_6|CP025286.1_665492_666377_-	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|100aa|up_5|CP025286.1_666389_666689_-	NA	NA|77aa|up_4|CP025286.1_666805_667036_+	NA	NA|68aa|up_3|CP025286.1_667336_667540_+	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|118aa|up_2|CP025286.1_667747_668101_+	PRK00215, PRK00215, transcriptional repressor LexA	NA|387aa|up_1|CP025286.1_668406_669567_-	pfam07907, YibE_F, YibE/F-like protein	NA|1532aa|up_0|CP025286.1_669710_674306_-	cd07399, MPP_YvnB, Bacillus subtilis YvnB and related proteins, metallophosphatase domain	cas2|97aa|down_0|CP025286.1_693773_694064_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|344aa|down_1|CP025286.1_694072_695104_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|222aa|down_2|CP025286.1_695100_695766_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas7|283aa|down_3|CP025286.1_695755_696604_-	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8c|656aa|down_4|CP025286.1_696603_698571_-	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas5|248aa|down_5|CP025286.1_698545_699289_-	cd09651, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|815aa|down_6|CP025286.1_699341_701786_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	NA|418aa|down_7|CP025286.1_701980_703234_-	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|376aa|down_8|CP025286.1_703500_704628_+	COG0562, Glf, UDP-galactopyranose mutase [Cell envelope biogenesis, outer membrane]	NA|216aa|down_9|CP025286.1_704844_705492_+	PRK01362, PRK01362, fructose-6-phosphate aldolase
GCA_003020045.1_ASM302004v1	CP025286	Ethanoligenens harbinense YUAN-3 chromosome, complete genome	3	2919962-2920062	3	CRISPRCasFinder	no	csa3	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	Type I-A	AACGTGGGCTGTGCCCACTGCCTGCTTCA	29	0	0	NA	NA	NA	1	1	Orphan	Cas14u_CAS-V,cas3,cas2,cas1,cas4,cas7,cas8c,cas5,csa3,DinG,WYL,DEDDh	NA,NA|441aa|down_5|CP025286.1_2928402_2929725_+,NA|244aa|down_7|CP025286.1_2931609_2932341_-	NA|115aa|up_9|CP025286.1_2910489_2910834_+	PTZ00397, PTZ00397, macrophage migration inhibition factor-like protein; Provisional	NA|400aa|up_8|CP025286.1_2910851_2912051_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|176aa|up_7|CP025286.1_2912175_2912703_+	TIGR04002, TIGR04002_family_protein, TIGR04002 family protein	NA|713aa|up_6|CP025286.1_2912890_2915029_-	cd07548, P-type_ATPase-Cd_Zn_Co_like, P-type heavy metal-transporting ATPase, similar to Bacillus subtilis CadA which appears to transport cadmium, zinc and cobalt but not copper out of the cell	csa3|126aa|up_5|CP025286.1_2915035_2915413_-	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|257aa|up_4|CP025286.1_2915713_2916484_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|238aa|up_3|CP025286.1_2916837_2917551_+	cd00710, LbH_gamma_CA, Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site	NA|166aa|up_2|CP025286.1_2917496_2917994_+	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|152aa|up_1|CP025286.1_2918042_2918498_+	cd00384, ALAD_PBGS, Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll	NA|433aa|up_0|CP025286.1_2918497_2919796_+	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|205aa|down_0|CP025286.1_2920085_2920700_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|1181aa|down_1|CP025286.1_2920804_2924347_-	TIGR02176, pyruvate_flavodoxin/ferrodoxin_oxidoreductase, pyruvate:ferredoxin (flavodoxin) oxidoreductase, homodimeric	NA|456aa|down_2|CP025286.1_2924777_2926145_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]	NA|257aa|down_3|CP025286.1_2926174_2926945_+	cd09087, Ape1-like_AP-endo, Human Ape1-like subfamily of the ExoIII family apurinic/apyrimidinic (AP) endonucleases	NA|349aa|down_4|CP025286.1_2927308_2928355_+	COG4632, EpsL, Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase [Carbohydrate transport and metabolism]	NA|441aa|down_5|CP025286.1_2928402_2929725_+	NA	NA|574aa|down_6|CP025286.1_2929799_2931521_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|244aa|down_7|CP025286.1_2931609_2932341_-	NA	NA|99aa|down_8|CP025286.1_2932337_2932634_-	pfam12637, TSCPD, TSCPD domain	NA|296aa|down_9|CP025286.1_2932678_2933566_-	cd07438, PHP_HisPPase_AMP, Polymerase and Histidinol Phosphatase domain of Histidinol phosphate phosphatase (HisPPase) AMP bound
