assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002288225.1_ASM228822v1	NZ_CP016777	Candidatus Planktophila dulcis isolate MMS-IIA-65 chromosome, complete genome	1	293189-293328	1	CRISPRCasFinder	no		cas3,DinG,WYL,cas4,DEDDh	Orphan	GTTTGGTAGTAATCGGGACTTGGTTAATAACCAGGTCCCTTTTGCTTTCC	50	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,WYL,cas4,DEDDh	NA|204aa|up_7|NZ_CP016777.1_288306_288918_+,NA|180aa|up_5|NZ_CP016777.1_289374_289914_+,NA|160aa|up_2|NZ_CP016777.1_291314_291794_+,NA|223aa|up_1|NZ_CP016777.1_291973_292642_+,NA|150aa|up_0|NZ_CP016777.1_292650_293100_+,NA	NA|68aa|up_9|NZ_CP016777.1_286893_287097_+	cd00371, HMA, Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones	NA|181aa|up_8|NZ_CP016777.1_287752_288295_+	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|204aa|up_7|NZ_CP016777.1_288306_288918_+	NA	NA|138aa|up_6|NZ_CP016777.1_288927_289341_+	COG0723, QcrA, Rieske Fe-S protein [Energy production and conversion]	NA|180aa|up_5|NZ_CP016777.1_289374_289914_+	NA	NA|154aa|up_4|NZ_CP016777.1_290312_290774_+	pfam12680, SnoaL_2, SnoaL-like domain	NA|111aa|up_3|NZ_CP016777.1_290822_291155_+	pfam07739, TipAS, TipAS antibiotic-recognition domain	NA|160aa|up_2|NZ_CP016777.1_291314_291794_+	NA	NA|223aa|up_1|NZ_CP016777.1_291973_292642_+	NA	NA|150aa|up_0|NZ_CP016777.1_292650_293100_+	NA	NA|161aa|down_0|NZ_CP016777.1_293615_294098_+	cd07825, SRPBCC_7, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|108aa|down_1|NZ_CP016777.1_294107_294431_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|317aa|down_2|NZ_CP016777.1_294569_295520_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|336aa|down_3|NZ_CP016777.1_295473_296481_-	TIGR04380, hypothetical_protein_HOLDEFILI_04020, inositol 2-dehydrogenase	NA|341aa|down_4|NZ_CP016777.1_296517_297540_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|359aa|down_5|NZ_CP016777.1_297549_298626_-	pfam00923, TAL_FSA, Transaldolase/Fructose-6-phosphate aldolase	NA|305aa|down_6|NZ_CP016777.1_298630_299545_-	TIGR04379, myo-inositol_catabolism_protein, myo-inosose-2 dehydratase	NA|638aa|down_7|NZ_CP016777.1_299554_301468_-	TIGR04377, 3D-35/4-trihydroxycyclohexane-12-dione_hydrolase, 3,5/4-trihydroxycyclohexa-1,2-dione hydrolase	NA|307aa|down_8|NZ_CP016777.1_301469_302390_-	pfam04962, KduI, KduI/IolB family	NA|497aa|down_9|NZ_CP016777.1_302398_303889_-	cd07085, ALDH_F6_MMSDH, Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2
GCF_002288225.1_ASM228822v1	NZ_CP016777	Candidatus Planktophila dulcis isolate MMS-IIA-65 chromosome, complete genome	2	993027-993123	2	CRISPRCasFinder	no	cas3	cas3,DinG,WYL,cas4,DEDDh	Unclear	TGCAGATGTTCTTGAAGAGATGGAT	25	0	0	NA	NA	NA	1	1	Unclear	cas3,DinG,WYL,cas4,DEDDh	NA|104aa|up_8|NZ_CP016777.1_985323_985635_-,NA	NA|528aa|up_9|NZ_CP016777.1_983738_985322_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|104aa|up_8|NZ_CP016777.1_985323_985635_-	NA	NA|290aa|up_7|NZ_CP016777.1_985634_986504_-	TIGR03445, mycothiol_MshB, N-acetyl-1-D-myo-inositol-2-amino-2-deoxy-alpha-D-glucopyranoside deacetylase	NA|394aa|up_6|NZ_CP016777.1_986503_987685_-	PRK07878, PRK07878, molybdopterin biosynthesis-like protein MoeZ; Validated	NA|212aa|up_5|NZ_CP016777.1_987740_988376_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|84aa|up_4|NZ_CP016777.1_988392_988644_+	pfam11305, DUF3107, Protein of unknown function (DUF3107)	cas3|454aa|up_3|NZ_CP016777.1_988644_990006_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|204aa|up_2|NZ_CP016777.1_990006_990618_-	COG2095, MarC, Multiple antibiotic transporter [Intracellular trafficking and secretion]	NA|282aa|up_1|NZ_CP016777.1_990614_991460_-	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|304aa|up_0|NZ_CP016777.1_991456_992368_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|165aa|down_0|NZ_CP016777.1_993713_994208_+	pfam06210, DUF1003, Protein of unknown function (DUF1003)	NA|373aa|down_1|NZ_CP016777.1_994178_995297_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|103aa|down_2|NZ_CP016777.1_995293_995602_-	PRK01371, PRK01371, Sec-independent protein translocase protein TatB	NA|373aa|down_3|NZ_CP016777.1_995621_996740_-	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|209aa|down_4|NZ_CP016777.1_996752_997379_-	COG4122, COG4122, Predicted O-methyltransferase [General function prediction only]	NA|494aa|down_5|NZ_CP016777.1_997375_998857_-	cd00433, Peptidase_M17, Cytosol aminopeptidase family, N-terminal and catalytic domains	NA|60aa|down_6|NZ_CP016777.1_998872_999052_-	pfam11314, DUF3117, Protein of unknown function (DUF3117)	NA|67aa|down_7|NZ_CP016777.1_999164_999365_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|150aa|down_8|NZ_CP016777.1_999417_999867_-	cd07812, SRPBCC, START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC (SRPBCC) ligand-binding domain superfamily	NA|180aa|down_9|NZ_CP016777.1_999869_1000409_-	TIGR00730, LOG_family_protein_YJL055W, TIGR00730 family protein
