assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007747995.1_ASM774799v1	NZ_CP036276	Planctomycetes bacterium Mal52 chromosome, complete genome	1	70105-70205	1	CRISPRCasFinder	no		WYL,RT,DinG,DEDDh,cas3,csa3	Orphan	TGACGGTTGACCGCAGCAGTCGC	23	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,DinG,DEDDh,cas3,csa3	NA|150aa|up_8|NZ_CP036276.1_52568_53018_+,NA|145aa|up_6|NZ_CP036276.1_54331_54766_-,NA	NA|146aa|up_9|NZ_CP036276.1_51788_52226_+	cd07246, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|150aa|up_8|NZ_CP036276.1_52568_53018_+	NA	NA|315aa|up_7|NZ_CP036276.1_53289_54234_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|145aa|up_6|NZ_CP036276.1_54331_54766_-	NA	NA|398aa|up_5|NZ_CP036276.1_54920_56114_-	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|1010aa|up_4|NZ_CP036276.1_56319_59349_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|1052aa|up_3|NZ_CP036276.1_59468_62624_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|478aa|up_2|NZ_CP036276.1_62620_64054_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|353aa|up_1|NZ_CP036276.1_64248_65307_+	cd01902, Ntn_CGH, Choloylglycine hydrolase (CGH) is a bile salt-modifying enzyme that hydrolyzes non-peptide carbon-nitrogen bonds in choloylglycine and choloyltaurine, both of which are present in bile	NA|355aa|up_0|NZ_CP036276.1_65652_66717_+	smart00089, PKD, Repeats in polycystic kidney disease 1 (PKD1) and other proteins	NA|995aa|down_0|NZ_CP036276.1_71732_74717_-	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|258aa|down_1|NZ_CP036276.1_74814_75588_-	COG3836, HpcH, 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase [Carbohydrate transport and metabolism]	NA|447aa|down_2|NZ_CP036276.1_75852_77193_+	pfam07632, DUF1593, Protein of unknown function (DUF1593)	NA|397aa|down_3|NZ_CP036276.1_77218_78409_-	pfam09492, Pec_lyase, Pectic acid lyase	NA|201aa|down_4|NZ_CP036276.1_78884_79487_+	PRK14054, PRK14054, peptide-methionine (S)-S-oxide reductase	NA|422aa|down_5|NZ_CP036276.1_79630_80896_+	pfam05448, AXE1, Acetyl xylan esterase (AXE1)	NA|187aa|down_6|NZ_CP036276.1_80912_81473_+	pfam09346, SMI1_KNR4, SMI1 / KNR4 family (SUKH-1)	NA|423aa|down_7|NZ_CP036276.1_81873_83142_+	cd06114, EcCS_like, Escherichia coli (Ec) citrate synthase (CS) GltA_like	NA|322aa|down_8|NZ_CP036276.1_83252_84218_+	cd12821, EcCorA_ZntB-like, Escherichia coli CorA-Salmonella typhimurium ZntB_like family	NA|357aa|down_9|NZ_CP036276.1_84393_85464_-	PRK00772, PRK00772, 3-isopropylmalate dehydrogenase; Provisional
GCF_007747995.1_ASM774799v1	NZ_CP036276	Planctomycetes bacterium Mal52 chromosome, complete genome	2	2520005-2520104	2	CRISPRCasFinder	no		WYL,RT,DinG,DEDDh,cas3,csa3	Orphan	CAAATTAGTGATGCGGGGCTTGAGCAT	27	0	0	NA	NA	NA	1	1	Orphan	WYL,RT,DinG,DEDDh,cas3,csa3	NA|99aa|up_9|NZ_CP036276.1_2509590_2509887_-,NA|90aa|up_7|NZ_CP036276.1_2511765_2512035_-,NA|389aa|down_4|NZ_CP036276.1_2526251_2527418_-,NA|274aa|down_6|NZ_CP036276.1_2531281_2532103_-	NA|99aa|up_9|NZ_CP036276.1_2509590_2509887_-	NA	NA|410aa|up_8|NZ_CP036276.1_2510367_2511597_-	sd00006, TPR, Tetratricopeptide repeat	NA|90aa|up_7|NZ_CP036276.1_2511765_2512035_-	NA	NA|413aa|up_6|NZ_CP036276.1_2512329_2513568_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|261aa|up_5|NZ_CP036276.1_2513711_2514494_-	cd05233, SDR_c, classical (c) SDRs	NA|499aa|up_4|NZ_CP036276.1_2514815_2516312_+	cd07771, FGGY_RhuK, L-rhamnulose kinases; a subfamily of the FGGY family of carbohydrate kinases	NA|239aa|up_3|NZ_CP036276.1_2516351_2517068_+	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|377aa|up_2|NZ_CP036276.1_2517085_2518216_+	NF033189, internalin_A, class 1 internalin InlA	NA|281aa|up_1|NZ_CP036276.1_2518248_2519091_+	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|185aa|up_0|NZ_CP036276.1_2519123_2519678_+	sd00031, LRR_1, leucine-rich repeats	NA|257aa|down_0|NZ_CP036276.1_2520690_2521461_+	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|325aa|down_1|NZ_CP036276.1_2521784_2522759_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|277aa|down_2|NZ_CP036276.1_2523109_2523940_+	pfam04592, SelP_N, Selenoprotein P, N terminal region	NA|495aa|down_3|NZ_CP036276.1_2524184_2525669_+	cd16030, iduronate-2-sulfatase, iduronate-2-sulfatase	NA|389aa|down_4|NZ_CP036276.1_2526251_2527418_-	NA	NA|1014aa|down_5|NZ_CP036276.1_2527734_2530776_+	COG1524, COG1524, Uncharacterized proteins of the AP superfamily [General function prediction only]	NA|274aa|down_6|NZ_CP036276.1_2531281_2532103_-	NA	NA|385aa|down_7|NZ_CP036276.1_2532480_2533635_-	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|293aa|down_8|NZ_CP036276.1_2533923_2534802_-	pfam02633, Creatininase, Creatinine amidohydrolase	NA|472aa|down_9|NZ_CP036276.1_2534950_2536366_-	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]
GCF_007747995.1_ASM774799v1	NZ_CP036276	Planctomycetes bacterium Mal52 chromosome, complete genome	3	2520221-2520535	3	CRISPRCasFinder	no		WYL,RT,DinG,DEDDh,cas3,csa3	Orphan	CAAATTAGTGATGCGGGGCTTGAGCAT	27	0	0	NA	NA	NA	4	4	Orphan	WYL,RT,DinG,DEDDh,cas3,csa3	NA|99aa|up_9|NZ_CP036276.1_2509590_2509887_-,NA|90aa|up_7|NZ_CP036276.1_2511765_2512035_-,NA|389aa|down_4|NZ_CP036276.1_2526251_2527418_-,NA|274aa|down_6|NZ_CP036276.1_2531281_2532103_-	NA|99aa|up_9|NZ_CP036276.1_2509590_2509887_-	NA	NA|410aa|up_8|NZ_CP036276.1_2510367_2511597_-	sd00006, TPR, Tetratricopeptide repeat	NA|90aa|up_7|NZ_CP036276.1_2511765_2512035_-	NA	NA|413aa|up_6|NZ_CP036276.1_2512329_2513568_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|261aa|up_5|NZ_CP036276.1_2513711_2514494_-	cd05233, SDR_c, classical (c) SDRs	NA|499aa|up_4|NZ_CP036276.1_2514815_2516312_+	cd07771, FGGY_RhuK, L-rhamnulose kinases; a subfamily of the FGGY family of carbohydrate kinases	NA|239aa|up_3|NZ_CP036276.1_2516351_2517068_+	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|377aa|up_2|NZ_CP036276.1_2517085_2518216_+	NF033189, internalin_A, class 1 internalin InlA	NA|281aa|up_1|NZ_CP036276.1_2518248_2519091_+	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|185aa|up_0|NZ_CP036276.1_2519123_2519678_+	sd00031, LRR_1, leucine-rich repeats	NA|257aa|down_0|NZ_CP036276.1_2520690_2521461_+	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|325aa|down_1|NZ_CP036276.1_2521784_2522759_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|277aa|down_2|NZ_CP036276.1_2523109_2523940_+	pfam04592, SelP_N, Selenoprotein P, N terminal region	NA|495aa|down_3|NZ_CP036276.1_2524184_2525669_+	cd16030, iduronate-2-sulfatase, iduronate-2-sulfatase	NA|389aa|down_4|NZ_CP036276.1_2526251_2527418_-	NA	NA|1014aa|down_5|NZ_CP036276.1_2527734_2530776_+	COG1524, COG1524, Uncharacterized proteins of the AP superfamily [General function prediction only]	NA|274aa|down_6|NZ_CP036276.1_2531281_2532103_-	NA	NA|385aa|down_7|NZ_CP036276.1_2532480_2533635_-	COG1194, MutY, A/G-specific DNA glycosylase [DNA replication, recombination, and repair]	NA|293aa|down_8|NZ_CP036276.1_2533923_2534802_-	pfam02633, Creatininase, Creatinine amidohydrolase	NA|472aa|down_9|NZ_CP036276.1_2534950_2536366_-	COG3206, GumC, Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]
