assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002288045.1_ASM228804v1	NZ_CP016772	Candidatus Planktophila dulcis isolate MMS-IA-53 chromosome, complete genome	1	216822-216934	1	CRISPRCasFinder	no		cas3,DinG,WYL,cas4,DEDDh	Orphan	AAGAAGTCTGTAGCGAAGAAGAAGAC	26	0	0	NA	NA	NA	2	2	Orphan	cas3,DinG,WYL,cas4,DEDDh	NA|102aa|up_9|NZ_CP016772.1_209838_210144_+,NA|282aa|down_1|NZ_CP016772.1_219405_220251_-,NA|74aa|down_5|NZ_CP016772.1_223265_223487_+	NA|102aa|up_9|NZ_CP016772.1_209838_210144_+	NA	NA|590aa|up_8|NZ_CP016772.1_210136_211906_+	cd02874, GH18_CFLE_spore_hydrolase, Cortical fragment-lytic enzyme (CFLE) is a peptidoglycan hydrolase involved in  bacterial endospore germination	NA|98aa|up_7|NZ_CP016772.1_211905_212199_+	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|197aa|up_6|NZ_CP016772.1_212195_212786_+	pfam09438, DUF2017, Domain of unknown function (DUF2017)	NA|139aa|up_5|NZ_CP016772.1_212810_213227_+	cd08070, MPN_like, Mpr1p, Pad1p N-terminal (MPN) domains with catalytic isopeptidase activity (metal-binding)	NA|91aa|up_4|NZ_CP016772.1_213272_213545_+	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|317aa|up_3|NZ_CP016772.1_213547_214498_+	TIGR01136, Cysteine_synthase, cysteine synthase	NA|266aa|up_2|NZ_CP016772.1_214524_215322_+	PRK00865, PRK00865, glutamate racemase; Provisional	NA|242aa|up_1|NZ_CP016772.1_215352_216078_+	PRK00173, rph, ribonuclease PH; Reviewed	NA|206aa|up_0|NZ_CP016772.1_216078_216696_+	PRK00120, PRK00120, dITP/XTP pyrophosphatase; Reviewed	NA|582aa|down_0|NZ_CP016772.1_217654_219400_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|282aa|down_1|NZ_CP016772.1_219405_220251_-	NA	NA|216aa|down_2|NZ_CP016772.1_220315_220963_-	TIGR01065, Hypothetical_UPF0073_protein_yqfA	NA|404aa|down_3|NZ_CP016772.1_220972_222184_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|254aa|down_4|NZ_CP016772.1_222410_223172_+	cd15242, 7tm_Proteorhodopsin, green- and blue-light absorbing proteorhodopsins, member of the seven-transmembrane GPCR superfamily	NA|74aa|down_5|NZ_CP016772.1_223265_223487_+	NA	NA|438aa|down_6|NZ_CP016772.1_223483_224797_-	PRK07812, PRK07812, O-acetylhomoserine aminocarboxypropyltransferase; Validated	NA|109aa|down_7|NZ_CP016772.1_225033_225360_-	PRK07758, PRK07758, hypothetical protein; Provisional	NA|408aa|down_8|NZ_CP016772.1_225524_226748_-	COG4214, XylH, ABC-type xylose transport system, permease component [Carbohydrate transport and metabolism]	NA|250aa|down_9|NZ_CP016772.1_226747_227497_-	COG1129, MglA, ABC-type sugar transport system, ATPase component [Carbohydrate transport and metabolism]
GCF_002288045.1_ASM228804v1	NZ_CP016772	Candidatus Planktophila dulcis isolate MMS-IA-53 chromosome, complete genome	2	321039-321178	2	CRISPRCasFinder	no		cas3,DinG,WYL,cas4,DEDDh	Orphan	GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC	50	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,WYL,cas4,DEDDh	NA|204aa|up_5|NZ_CP016772.1_317380_317992_+,NA|167aa|up_3|NZ_CP016772.1_318448_318949_+,NA|160aa|up_2|NZ_CP016772.1_319164_319644_+,NA|223aa|up_1|NZ_CP016772.1_319823_320492_+,NA|150aa|up_0|NZ_CP016772.1_320500_320950_+,NA	NA|543aa|up_9|NZ_CP016772.1_311101_312730_+	cd01297, D-aminoacylase, D-aminoacylases (N-acyl-D-Amino acid amidohydrolases) catalyze the hydrolysis of N-acyl-D-amino acids to produce the corresponding D-amino acids, which are used as intermediates in the synthesis of pesticides, bioactive peptides, and antibiotics	NA|253aa|up_8|NZ_CP016772.1_312719_313478_+	cd05233, SDR_c, classical (c) SDRs	NA|896aa|up_7|NZ_CP016772.1_313525_316213_+	PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional	NA|181aa|up_6|NZ_CP016772.1_316826_317369_+	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|204aa|up_5|NZ_CP016772.1_317380_317992_+	NA	NA|138aa|up_4|NZ_CP016772.1_318001_318415_+	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion]	NA|167aa|up_3|NZ_CP016772.1_318448_318949_+	NA	NA|160aa|up_2|NZ_CP016772.1_319164_319644_+	NA	NA|223aa|up_1|NZ_CP016772.1_319823_320492_+	NA	NA|150aa|up_0|NZ_CP016772.1_320500_320950_+	NA	NA|163aa|down_0|NZ_CP016772.1_321465_321954_+	cd07825, SRPBCC_7, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|108aa|down_1|NZ_CP016772.1_322139_322463_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|317aa|down_2|NZ_CP016772.1_322601_323552_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|336aa|down_3|NZ_CP016772.1_323505_324513_-	TIGR04380, hypothetical_protein_HOLDEFILI_04020, inositol 2-dehydrogenase	NA|341aa|down_4|NZ_CP016772.1_324549_325572_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|359aa|down_5|NZ_CP016772.1_325581_326658_-	pfam00923, TAL_FSA, Transaldolase/Fructose-6-phosphate aldolase	NA|305aa|down_6|NZ_CP016772.1_326662_327577_-	TIGR04379, myo-inositol_catabolism_protein, myo-inosose-2 dehydratase	NA|638aa|down_7|NZ_CP016772.1_327586_329500_-	TIGR04377, 3D-35/4-trihydroxycyclohexane-12-dione_hydrolase, 3,5/4-trihydroxycyclohexa-1,2-dione hydrolase	NA|307aa|down_8|NZ_CP016772.1_329501_330422_-	pfam04962, KduI, KduI/IolB family	NA|497aa|down_9|NZ_CP016772.1_330430_331921_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2
GCF_002288045.1_ASM228804v1	NZ_CP016772	Candidatus Planktophila dulcis isolate MMS-IA-53 chromosome, complete genome	3	935901-936000	3	CRISPRCasFinder	no		cas3,DinG,WYL,cas4,DEDDh	Orphan	TGCAGCTTTCTTTGCGCGCTTAGGCGCA	28	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,WYL,cas4,DEDDh	NA,NA	NA|588aa|up_9|NZ_CP016772.1_922287_924051_-	PRK07789, PRK07789, acetolactate synthase 1 catalytic subunit; Validated	NA|558aa|up_8|NZ_CP016772.1_924262_925936_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|502aa|up_7|NZ_CP016772.1_925945_927451_-	PRK05477, gatB, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatB	NA|499aa|up_6|NZ_CP016772.1_927450_928947_-	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|100aa|up_5|NZ_CP016772.1_928943_929243_-	PRK00034, gatC, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatC	NA|423aa|up_4|NZ_CP016772.1_929268_930537_-	PRK00197, proA, gamma-glutamyl phosphate reductase; Provisional	NA|372aa|up_3|NZ_CP016772.1_930533_931649_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|505aa|up_2|NZ_CP016772.1_931645_933160_-	PRK12296, obgE, GTPase CgtA; Reviewed	NA|85aa|up_1|NZ_CP016772.1_933255_933510_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|105aa|up_0|NZ_CP016772.1_933529_933844_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|387aa|down_0|NZ_CP016772.1_936302_937463_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|719aa|down_1|NZ_CP016772.1_937459_939616_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|174aa|down_2|NZ_CP016772.1_939612_940134_-	TIGR03426, shape_MreD, rod shape-determining protein MreD	NA|315aa|down_3|NZ_CP016772.1_940136_941081_-	PRK13922, PRK13922, rod shape-determining protein MreC; Provisional	NA|340aa|down_4|NZ_CP016772.1_941091_942111_-	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|141aa|down_5|NZ_CP016772.1_942123_942546_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|112aa|down_6|NZ_CP016772.1_942558_942894_-	pfam14017, DUF4233, Protein of unknown function (DUF4233)	NA|453aa|down_7|NZ_CP016772.1_942893_944252_-	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|860aa|down_8|NZ_CP016772.1_944252_946832_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|433aa|down_9|NZ_CP016772.1_946854_948153_-	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX
GCF_002288045.1_ASM228804v1	NZ_CP016772	Candidatus Planktophila dulcis isolate MMS-IA-53 chromosome, complete genome	4	1013663-1013759	4	CRISPRCasFinder	no	cas3	cas3,DinG,WYL,cas4,DEDDh	Unclear	TGCAGATGTTCTTGAAGAGATGGAT	25	0	0	NA	NA	NA	1	1	Unclear	cas3,DinG,WYL,cas4,DEDDh	NA|104aa|up_8|NZ_CP016772.1_1005959_1006271_-,NA	NA|528aa|up_9|NZ_CP016772.1_1004374_1005958_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|104aa|up_8|NZ_CP016772.1_1005959_1006271_-	NA	NA|290aa|up_7|NZ_CP016772.1_1006270_1007140_-	TIGR03445, mycothiol_MshB, N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha-D-glucopyranoside deacetylase	NA|394aa|up_6|NZ_CP016772.1_1007139_1008321_-	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated	NA|212aa|up_5|NZ_CP016772.1_1008376_1009012_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|84aa|up_4|NZ_CP016772.1_1009028_1009280_+	pfam11305, DUF3107, Protein of unknown function (DUF3107)	cas3|454aa|up_3|NZ_CP016772.1_1009280_1010642_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|204aa|up_2|NZ_CP016772.1_1010642_1011254_-	COG2095, MarC, Multiple antibiotic transporter [Intracellular trafficking and secretion]	NA|282aa|up_1|NZ_CP016772.1_1011250_1012096_-	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|304aa|up_0|NZ_CP016772.1_1012092_1013004_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|165aa|down_0|NZ_CP016772.1_1014349_1014844_+	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|373aa|down_1|NZ_CP016772.1_1014814_1015933_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|103aa|down_2|NZ_CP016772.1_1015929_1016238_-	PRK01371, PRK01371, Sec-independent protein translocase protein TatB	NA|373aa|down_3|NZ_CP016772.1_1016257_1017376_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|209aa|down_4|NZ_CP016772.1_1017388_1018015_-	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|494aa|down_5|NZ_CP016772.1_1018011_1019493_-	cd00433, Peptidase_M17, Cytosol aminopeptidase family, N-terminal and catalytic domains	NA|60aa|down_6|NZ_CP016772.1_1019508_1019688_-	pfam11314, DUF3117, Protein of unknown function (DUF3117)	NA|67aa|down_7|NZ_CP016772.1_1019800_1020001_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|150aa|down_8|NZ_CP016772.1_1020053_1020503_-	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	NA|180aa|down_9|NZ_CP016772.1_1020505_1021045_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein
