assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900097105.1_WK001	NZ_LT629973	Akkermansia glycaniphila isolate APytT chromosome I	1	1786706-1787812	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	Orphan	GTCGCACTCCTCACGGAGTGCGTGAATTGAAAC,GTCGCACTCCTCACGGAGTGCGTGGATTGAAAC,GTCGCACTCCTCACGGAGTGCGTG	33,33,24	0	0	NA	NA	NA:NA:NA	15,16,16	16	Orphan	DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	NA|314aa|up_8|NZ_LT629973.1_1776432_1777374_+,NA|108aa|up_7|NZ_LT629973.1_1777444_1777768_-,NA|76aa|up_1|NZ_LT629973.1_1785372_1785600_+,NA|257aa|up_0|NZ_LT629973.1_1785604_1786375_+,NA	NA|539aa|up_9|NZ_LT629973.1_1774431_1776048_+	PRK05290, PRK05290, hybrid cluster protein; Provisional	NA|314aa|up_8|NZ_LT629973.1_1776432_1777374_+	NA	NA|108aa|up_7|NZ_LT629973.1_1777444_1777768_-	NA	NA|150aa|up_6|NZ_LT629973.1_1777764_1778214_-	COG2510, COG2510, Predicted membrane protein [Function unknown]	NA|369aa|up_5|NZ_LT629973.1_1778524_1779631_-	COG4335, COG4335, DNA alkylation repair enzyme [DNA replication, recombination, and repair]	NA|439aa|up_4|NZ_LT629973.1_1779759_1781076_-	PRK07232, PRK07232, bifunctional malic enzyme oxidoreductase/phosphotransacetylase; Reviewed	NA|161aa|up_3|NZ_LT629973.1_1781212_1781695_-	pfam05656, DUF805, Protein of unknown function (DUF805)	NA|1066aa|up_2|NZ_LT629973.1_1781899_1785097_+	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|76aa|up_1|NZ_LT629973.1_1785372_1785600_+	NA	NA|257aa|up_0|NZ_LT629973.1_1785604_1786375_+	NA	NA|457aa|down_0|NZ_LT629973.1_1787819_1789190_-	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|342aa|down_1|NZ_LT629973.1_1789247_1790273_-	cd02801, DUS_like_FMN, Dihydrouridine synthase-like (DUS-like) FMN-binding domain	NA|403aa|down_2|NZ_LT629973.1_1790352_1791561_+	PRK13371, PRK13371, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional	NA|358aa|down_3|NZ_LT629973.1_1791603_1792677_+	pfam01757, Acyl_transf_3, Acyltransferase family	NA|484aa|down_4|NZ_LT629973.1_1792789_1794241_+	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|508aa|down_5|NZ_LT629973.1_1794254_1795778_+	PRK00074, guaA, GMP synthase; Reviewed	NA|201aa|down_6|NZ_LT629973.1_1795799_1796402_+	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|226aa|down_7|NZ_LT629973.1_1796398_1797076_+	pfam13419, HAD_2, Haloacid dehalogenase-like hydrolase	NA|170aa|down_8|NZ_LT629973.1_1797072_1797582_+	COG3467, COG3467, Predicted flavin-nucleotide-binding protein [General function prediction only]	NA|324aa|down_9|NZ_LT629973.1_1797578_1798550_+	COG0679, COG0679, Predicted permeases [General function prediction only]
GCF_900097105.1_WK001	NZ_LT629973	Akkermansia glycaniphila isolate APytT chromosome I	2	1801379-1801750	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no		DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	Orphan	GTCGCACTCCTCACGGAGTGCGTGGATTGAAAC,GTCGCACTCCTCACGGAGTGCGTGGATT,GTCGCACTCCTCACGGAGTGCGTGGATTGAAAC	33,28,33	0	0	NA	NA	NA:NA:NA	4,5,5	5	Orphan	DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	NA|139aa|up_3|NZ_LT629973.1_1798646_1799063_+,NA|93aa|up_2|NZ_LT629973.1_1799072_1799351_+,NA|179aa|up_1|NZ_LT629973.1_1799747_1800284_+,NA|100aa|down_0|NZ_LT629973.1_1801931_1802231_-,NA|125aa|down_1|NZ_LT629973.1_1802235_1802610_-,NA|1112aa|down_3|NZ_LT629973.1_1803492_1806828_-,NA|149aa|down_4|NZ_LT629973.1_1806830_1807277_-,NA|215aa|down_7|NZ_LT629973.1_1810790_1811435_+,NA|194aa|down_9|NZ_LT629973.1_1813146_1813728_-	NA|484aa|up_9|NZ_LT629973.1_1792789_1794241_+	pfam00478, IMPDH, IMP dehydrogenase / GMP reductase domain	NA|508aa|up_8|NZ_LT629973.1_1794254_1795778_+	PRK00074, guaA, GMP synthase; Reviewed	NA|201aa|up_7|NZ_LT629973.1_1795799_1796402_+	cd03357, LbH_MAT_GAT, Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates	NA|226aa|up_6|NZ_LT629973.1_1796398_1797076_+	pfam13419, HAD_2, Haloacid dehalogenase-like hydrolase	NA|170aa|up_5|NZ_LT629973.1_1797072_1797582_+	COG3467, COG3467, Predicted flavin-nucleotide-binding protein [General function prediction only]	NA|324aa|up_4|NZ_LT629973.1_1797578_1798550_+	COG0679, COG0679, Predicted permeases [General function prediction only]	NA|139aa|up_3|NZ_LT629973.1_1798646_1799063_+	NA	NA|93aa|up_2|NZ_LT629973.1_1799072_1799351_+	NA	NA|179aa|up_1|NZ_LT629973.1_1799747_1800284_+	NA	NA|269aa|up_0|NZ_LT629973.1_1800307_1801114_+	pfam01430, HSP33, Hsp33 protein	NA|100aa|down_0|NZ_LT629973.1_1801931_1802231_-	NA	NA|125aa|down_1|NZ_LT629973.1_1802235_1802610_-	NA	NA|276aa|down_2|NZ_LT629973.1_1802660_1803488_-	pfam00852, Glyco_transf_10, Glycosyltransferase family 10 (fucosyltransferase) C-term	NA|1112aa|down_3|NZ_LT629973.1_1803492_1806828_-	NA	NA|149aa|down_4|NZ_LT629973.1_1806830_1807277_-	NA	NA|139aa|down_5|NZ_LT629973.1_1807416_1807833_+	pfam14237, DUF4339, Domain of unknown function (DUF4339)	NA|952aa|down_6|NZ_LT629973.1_1807829_1810685_-	PTZ00121, PTZ00121, MAEBL; Provisional	NA|215aa|down_7|NZ_LT629973.1_1810790_1811435_+	NA	NA|357aa|down_8|NZ_LT629973.1_1811542_1812613_+	pfam13614, AAA_31, AAA domain	NA|194aa|down_9|NZ_LT629973.1_1813146_1813728_-	NA
GCF_900097105.1_WK001	NZ_LT629973	Akkermansia glycaniphila isolate APytT chromosome I	3	1836746-1837309	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2	DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	Type II-A,Type II-C, Type II-B,Type II-B, or Type II-C?	ACTGTACCATGCCTTACTTTGGATTCAAGGCAAAAC,ACTGTACCATGCCTTACTTTGGATTCAAGGCAAAAC,ACTGTACCATGCCTTACTTTGGATTCAAGGCAAAAC	36,36,36	1	1	1836848-1836877	NZ_LT629973.1_1318994-1318965	NA:NA:NA	7,8,8	8	TypeII-A,TypeII-C,TypeII-B,TypeII-B,orTypeII-C?	DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	NA,NA|389aa|down_7|NZ_LT629973.1_1845909_1847076_-,NA|397aa|down_8|NZ_LT629973.1_1847138_1848329_-,NA|158aa|down_9|NZ_LT629973.1_1848616_1849090_-	NA|833aa|up_9|NZ_LT629973.1_1822383_1824882_+	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|358aa|up_8|NZ_LT629973.1_1825516_1826590_+	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|383aa|up_7|NZ_LT629973.1_1826614_1827763_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|331aa|up_6|NZ_LT629973.1_1827875_1828868_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|327aa|up_5|NZ_LT629973.1_1829346_1830327_+	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|264aa|up_4|NZ_LT629973.1_1830434_1831226_+	COG1119, ModF, ABC-type molybdenum transport system, ATPase component/photorepair protein PhrA [Inorganic ion transport and metabolism]	NA|222aa|up_3|NZ_LT629973.1_1831262_1831928_+	COG2186, FadR, Transcriptional regulators [Transcription]	cas9|1120aa|up_2|NZ_LT629973.1_1832092_1835452_+	pfam13395, HNH_4, HNH endonuclease	cas1|312aa|up_1|NZ_LT629973.1_1835455_1836391_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|111aa|up_0|NZ_LT629973.1_1836354_1836687_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|175aa|down_0|NZ_LT629973.1_1837329_1837854_-	cd16343, LMWPTP, Low molecular weight protein tyrosine phosphatase	NA|161aa|down_1|NZ_LT629973.1_1838083_1838566_-	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|266aa|down_2|NZ_LT629973.1_1838795_1839593_-	pfam03808, Glyco_tran_WecB, Glycosyl transferase WecB/TagA/CpsF family	NA|696aa|down_3|NZ_LT629973.1_1839638_1841726_-	PRK05443, PRK05443, polyphosphate kinase; Provisional	NA|529aa|down_4|NZ_LT629973.1_1841741_1843328_-	COG0248, GppA, Exopolyphosphatase [Nucleotide transport and metabolism / Inorganic ion transport and metabolism]	NA|514aa|down_5|NZ_LT629973.1_1843491_1845033_-	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|159aa|down_6|NZ_LT629973.1_1845259_1845736_-	PRK11465, PRK11465, putative mechanosensitive channel protein; Provisional	NA|389aa|down_7|NZ_LT629973.1_1845909_1847076_-	NA	NA|397aa|down_8|NZ_LT629973.1_1847138_1848329_-	NA	NA|158aa|down_9|NZ_LT629973.1_1848616_1849090_-	NA
GCF_900097105.1_WK001	NZ_LT629973	Akkermansia glycaniphila isolate APytT chromosome I	4	2800145-2801170	4,4,4	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	Type I-U,Type I-C, Type I-U?	GTCGCACCCCTCGCGGGTGCGTGGATTGAAAC,GTCGCACCCCTCGCGGGTGCGTGGATTGAAAC,GTCGCACCCCTCGCGGGTGCGTGGATTGAAAC	32,32,32	0	0	NA	NA	I-C:I-C:I-C	14,14,15	15	TypeI-U,TypeI-C,TypeI-U?	DEDDh,csa3,DinG,cas9,cas1,cas2,cas3,cas5,cas8c,cas7,cas4	NA|269aa|up_8|NZ_LT629973.1_2790645_2791452_+,NA|148aa|down_2|NZ_LT629973.1_2803523_2803967_-	NA|248aa|up_9|NZ_LT629973.1_2789902_2790646_+	PRK00235, cobS, cobalamin synthase; Reviewed	NA|269aa|up_8|NZ_LT629973.1_2790645_2791452_+	NA	NA|195aa|up_7|NZ_LT629973.1_2791423_2792008_+	pfam02283, CobU, Cobinamide kinase / cobinamide phosphate guanyltransferase	cas3|788aa|up_6|NZ_LT629973.1_2792093_2794457_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|238aa|up_5|NZ_LT629973.1_2794460_2795174_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|607aa|up_4|NZ_LT629973.1_2795177_2796998_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|287aa|up_3|NZ_LT629973.1_2797043_2797904_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|223aa|up_2|NZ_LT629973.1_2797954_2798623_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|341aa|up_1|NZ_LT629973.1_2798619_2799642_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_LT629973.1_2799643_2799934_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|188aa|down_0|NZ_LT629973.1_2801290_2801854_-	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional	NA|513aa|down_1|NZ_LT629973.1_2801861_2803400_-	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|148aa|down_2|NZ_LT629973.1_2803523_2803967_-	NA	NA|260aa|down_3|NZ_LT629973.1_2803963_2804743_-	pfam09578, Spore_YabQ, Spore cortex protein YabQ (Spore_YabQ)	NA|891aa|down_4|NZ_LT629973.1_2804765_2807438_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|91aa|down_5|NZ_LT629973.1_2807583_2807856_+	TIGR01411, Sec-independent_protein_translocase_protein_TatA, twin arginine-targeting protein translocase, TatA/E family	NA|390aa|down_6|NZ_LT629973.1_2807998_2809168_-	COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown]	NA|396aa|down_7|NZ_LT629973.1_2809248_2810436_-	COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown]	NA|830aa|down_8|NZ_LT629973.1_2810453_2812943_-	COG1289, COG1289, Predicted membrane protein [Function unknown]	NA|808aa|down_9|NZ_LT629973.1_2812939_2815363_-	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau
