assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002158885.1_ASM215888v1	NZ_CP021427	Lactobacillus gasseri strain 4M13, complete genome	1	1744290-1744919	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2,csn2	DEDDh,cas3,cas2,csa3,cas9,cas1,csn2,WYL,DinG	Type II-B,Type II-A,Type II-C	GTTTTAGATGGTTGTTAGATCAATAAGGCTTAGATC,GTTTTAGATGGTTGTTAGATCAATAAGGCTTAGATC,GTTTTAGATGGTTGTTAGATCAATAAGGCTTAGATC	36,36,36	0	0	NA	NA	NA:NA:NA	8,9,9	9	TypeII-B,TypeII-A,TypeII-C	DEDDh,cas3,cas2,csa3,cas9,cas1,csn2,WYL,DinG	NA,NA|135aa|down_6|NZ_CP021427.1_1750688_1751093_+	NA|56aa|up_9|NZ_CP021427.1_1732084_1732252_-	pfam13074, DUF3938, Protein of unknown function (DUF3938)	NA|392aa|up_8|NZ_CP021427.1_1732872_1734048_+	cd01106, HTH_TipAL-Mta, Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA	NA|346aa|up_7|NZ_CP021427.1_1734129_1735167_+	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|295aa|up_6|NZ_CP021427.1_1735173_1736058_+	TIGR01207, Glucose-1-phosphate_thymidylyltransferase_1, glucose-1-phosphate thymidylyltransferase, short form	NA|203aa|up_5|NZ_CP021427.1_1736090_1736699_+	pfam00908, dTDP_sugar_isom, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|329aa|up_4|NZ_CP021427.1_1736715_1737702_+	pfam04321, RmlD_sub_bind, RmlD substrate binding domain	cas9|1387aa|up_3|NZ_CP021427.1_1738034_1742195_+	pfam16592, Cas9_REC, REC lobe of CRISPR-associated endonuclease Cas9	cas1|303aa|up_2|NZ_CP021427.1_1742400_1743309_+	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas2|114aa|up_1|NZ_CP021427.1_1743286_1743628_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	csn2|223aa|up_0|NZ_CP021427.1_1743588_1744257_+	cd12218, Csn2, CRISPR/Cas system-associated protein Csn2	NA|142aa|down_0|NZ_CP021427.1_1745568_1745994_+	smart00910, HIRAN, The HIRAN protein (HIP116, Rad5p N-terminal) is found in the N-terminal regions of the SWI2/SNF2 proteins typified by HIP116 and Rad5p	NA|350aa|down_1|NZ_CP021427.1_1746060_1747110_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|306aa|down_2|NZ_CP021427.1_1747284_1748202_+	cd07340, M48B_Htpx_like, Peptidase M48 subfamily B HtpX-like membrane-bound metallopeptidase	NA|280aa|down_3|NZ_CP021427.1_1748406_1749246_-	TIGR04211, hypothetical_protein, SH3 domain protein	NA|147aa|down_4|NZ_CP021427.1_1749369_1749810_+	cd03427, MTH1, MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily	NA|220aa|down_5|NZ_CP021427.1_1749839_1750499_-	pfam09726, Macoilin, Macoilin family	NA|135aa|down_6|NZ_CP021427.1_1750688_1751093_+	NA	NA|128aa|down_7|NZ_CP021427.1_1751169_1751553_+	pfam08861, DUF1828, Domain of unknown function DUF1828	NA|268aa|down_8|NZ_CP021427.1_1751769_1752573_+	pfam08428, Rib, Rib/alpha-like repeat	NA|451aa|down_9|NZ_CP021427.1_1752641_1753994_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]
GCF_002158885.1_ASM215888v1	NZ_CP021427	Lactobacillus gasseri strain 4M13, complete genome	2	2091681-2091783	2	CRISPRCasFinder	no		DEDDh,cas3,cas2,csa3,cas9,cas1,csn2,WYL,DinG	Orphan	CTTTCGATTTTCGAAATTCCGCTCGTAGCAAGGGTTT	37	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,cas2,csa3,cas9,cas1,csn2,WYL,DinG	NA|151aa|up_8|NZ_CP021427.1_2085458_2085911_+,NA|42aa|up_7|NZ_CP021427.1_2086091_2086217_-,NA|60aa|up_6|NZ_CP021427.1_2086363_2086543_+,NA|70aa|up_4|NZ_CP021427.1_2089216_2089426_+,NA|93aa|up_3|NZ_CP021427.1_2089448_2089727_+,NA|62aa|up_1|NZ_CP021427.1_2090369_2090555_-,NA|72aa|down_1|NZ_CP021427.1_2093631_2093847_+,NA|131aa|down_7|NZ_CP021427.1_2101628_2102021_-	NA|127aa|up_9|NZ_CP021427.1_2085085_2085466_+	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|151aa|up_8|NZ_CP021427.1_2085458_2085911_+	NA	NA|42aa|up_7|NZ_CP021427.1_2086091_2086217_-	NA	NA|60aa|up_6|NZ_CP021427.1_2086363_2086543_+	NA	NA|687aa|up_5|NZ_CP021427.1_2086886_2088947_-	pfam03389, MobA_MobL, MobA/MobL family	NA|70aa|up_4|NZ_CP021427.1_2089216_2089426_+	NA	NA|93aa|up_3|NZ_CP021427.1_2089448_2089727_+	NA	NA|219aa|up_2|NZ_CP021427.1_2089716_2090373_-	COG4185, COG4185, Uncharacterized protein conserved in bacteria [Function unknown]	NA|62aa|up_1|NZ_CP021427.1_2090369_2090555_-	NA	NA|94aa|up_0|NZ_CP021427.1_2090601_2090883_-	TIGR02384, Putative_antitoxin_RelB, addiction module antitoxin, RelB/DinJ family	NA|262aa|down_0|NZ_CP021427.1_2092839_2093625_+	pfam13614, AAA_31, AAA domain	NA|72aa|down_1|NZ_CP021427.1_2093631_2093847_+	NA	NA|265aa|down_2|NZ_CP021427.1_2095198_2095993_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|524aa|down_3|NZ_CP021427.1_2096547_2098119_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|529aa|down_4|NZ_CP021427.1_2098125_2099712_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|281aa|down_5|NZ_CP021427.1_2099722_2100565_-	COG1944, COG1944, Uncharacterized conserved protein [Function unknown]	NA|339aa|down_6|NZ_CP021427.1_2100597_2101614_-	pfam00899, ThiF, ThiF family	NA|131aa|down_7|NZ_CP021427.1_2101628_2102021_-	NA	NA|350aa|down_8|NZ_CP021427.1_2102432_2103482_-	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|211aa|down_9|NZ_CP021427.1_2103847_2104480_-	pfam00881, Nitroreductase, Nitroreductase family
