assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002287965.1_ASM228796v1	NZ_CP016770	Candidatus Planktophila dulcis isolate MMS-21-155 chromosome, complete genome	1	301955-302094	1	CRISPRCasFinder	no		cas3,DinG,WYL,cas4,DEDDh	Orphan	GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC	50	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,WYL,cas4,DEDDh	NA|204aa|up_5|NZ_CP016770.1_297978_298590_+,NA|180aa|up_3|NZ_CP016770.1_299046_299586_+,NA|223aa|up_1|NZ_CP016770.1_300738_301407_+,NA|156aa|up_0|NZ_CP016770.1_301415_301883_+,NA	NA|335aa|up_9|NZ_CP016770.1_294315_295320_+	cd05285, sorbitol_DH, Sorbitol dehydrogenase	NA|187aa|up_8|NZ_CP016770.1_295873_296434_-	pfam01541, GIY-YIG, GIY-YIG catalytic domain	NA|165aa|up_7|NZ_CP016770.1_296337_296832_+	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|181aa|up_6|NZ_CP016770.1_297424_297967_+	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|204aa|up_5|NZ_CP016770.1_297978_298590_+	NA	NA|138aa|up_4|NZ_CP016770.1_298599_299013_+	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion]	NA|180aa|up_3|NZ_CP016770.1_299046_299586_+	NA	NA|111aa|up_2|NZ_CP016770.1_300306_300639_+	pfam07739, TipAS, TipAS antibiotic-recognition domain	NA|223aa|up_1|NZ_CP016770.1_300738_301407_+	NA	NA|156aa|up_0|NZ_CP016770.1_301415_301883_+	NA	NA|161aa|down_0|NZ_CP016770.1_302381_302864_+	cd07825, SRPBCC_7, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|108aa|down_1|NZ_CP016770.1_302873_303197_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|329aa|down_2|NZ_CP016770.1_303335_304322_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|336aa|down_3|NZ_CP016770.1_304239_305247_-	TIGR04380, hypothetical_protein_HOLDEFILI_04020, inositol 2-dehydrogenase	NA|341aa|down_4|NZ_CP016770.1_305283_306306_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|359aa|down_5|NZ_CP016770.1_306315_307392_-	pfam00923, TAL_FSA, Transaldolase/Fructose-6-phosphate aldolase	NA|305aa|down_6|NZ_CP016770.1_307396_308311_-	TIGR04379, myo-inositol_catabolism_protein, myo-inosose-2 dehydratase	NA|638aa|down_7|NZ_CP016770.1_308320_310234_-	TIGR04377, 3D-35/4-trihydroxycyclohexane-12-dione_hydrolase, 3,5/4-trihydroxycyclohexa-1,2-dione hydrolase	NA|307aa|down_8|NZ_CP016770.1_310235_311156_-	pfam04962, KduI, KduI/IolB family	NA|497aa|down_9|NZ_CP016770.1_311164_312655_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2
GCF_002287965.1_ASM228796v1	NZ_CP016770	Candidatus Planktophila dulcis isolate MMS-21-155 chromosome, complete genome	2	1006759-1006855	2	CRISPRCasFinder	no	cas3	cas3,DinG,WYL,cas4,DEDDh	Unclear	TGCAGATGTTCTTGAAGAGATGGAT	25	0	0	NA	NA	NA	1	1	Unclear	cas3,DinG,WYL,cas4,DEDDh	NA|104aa|up_8|NZ_CP016770.1_999055_999367_-,NA	NA|528aa|up_9|NZ_CP016770.1_997470_999054_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|104aa|up_8|NZ_CP016770.1_999055_999367_-	NA	NA|290aa|up_7|NZ_CP016770.1_999366_1000236_-	TIGR03445, mycothiol_MshB, N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha-D-glucopyranoside deacetylase	NA|394aa|up_6|NZ_CP016770.1_1000235_1001417_-	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated	NA|212aa|up_5|NZ_CP016770.1_1001472_1002108_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|84aa|up_4|NZ_CP016770.1_1002124_1002376_+	pfam11305, DUF3107, Protein of unknown function (DUF3107)	cas3|454aa|up_3|NZ_CP016770.1_1002376_1003738_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|204aa|up_2|NZ_CP016770.1_1003738_1004350_-	COG2095, MarC, Multiple antibiotic transporter [Intracellular trafficking and secretion]	NA|282aa|up_1|NZ_CP016770.1_1004346_1005192_-	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|304aa|up_0|NZ_CP016770.1_1005188_1006100_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|165aa|down_0|NZ_CP016770.1_1007445_1007940_+	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|373aa|down_1|NZ_CP016770.1_1007910_1009029_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|103aa|down_2|NZ_CP016770.1_1009025_1009334_-	PRK01371, PRK01371, Sec-independent protein translocase protein TatB	NA|373aa|down_3|NZ_CP016770.1_1009353_1010472_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|209aa|down_4|NZ_CP016770.1_1010484_1011111_-	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|494aa|down_5|NZ_CP016770.1_1011107_1012589_-	cd00433, Peptidase_M17, Cytosol aminopeptidase family, N-terminal and catalytic domains	NA|60aa|down_6|NZ_CP016770.1_1012604_1012784_-	pfam11314, DUF3117, Protein of unknown function (DUF3117)	NA|67aa|down_7|NZ_CP016770.1_1012896_1013097_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|150aa|down_8|NZ_CP016770.1_1013149_1013599_-	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	NA|180aa|down_9|NZ_CP016770.1_1013601_1014141_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein
