assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000522985.1_ASM52298v1	NZ_CP007128	Gemmatirosa kalamazoonesis strain KBS708 chromosome, complete genome	1	1463089-1463156	1	CRISPRCasFinder	no		csa3,cas3,DinG,DEDDh,Cas9_archaeal	Orphan	ACGGCGGGACGTCTACGGCGGGACG	25	1	9	1463114-1463131|1463114-1463131|1463114-1463131|1463114-1463131|1463114-1463131|1463114-1463131|1463114-1463131|1463114-1463131|1463114-1463131	NZ_CP007128.1_1463085-1463102|NZ_CP007128.1_1463448-1463431|NZ_CP007128.1_1463477-1463460|NZ_CP007128.1_1104390-1104373|NZ_CP007128.1_1909118-1909101|NZ_CP007129.1_861949-861966|NZ_CP007129.1_120370-120353|NZ_CP007129.1_221967-221950|NZ_CP007129.1_460393-460376	NA	1	1	Orphan	csa3,cas3,DinG,DEDDh,Cas9_archaeal	NA|100aa|up_5|NZ_CP007128.1_1452741_1453041_+,NA|484aa|up_3|NZ_CP007128.1_1459137_1460589_+,NA|145aa|up_0|NZ_CP007128.1_1462605_1463040_+,NA|462aa|down_6|NZ_CP007128.1_1473125_1474511_+,NA|237aa|down_7|NZ_CP007128.1_1474507_1475218_+	NA|308aa|up_9|NZ_CP007128.1_1444922_1445846_+	PRK02308, uvsE, putative UV damage endonuclease; Provisional	NA|612aa|up_8|NZ_CP007128.1_1445997_1447833_+	cd05387, BY-kinase, bacterial tyrosine-kinase	NA|584aa|up_7|NZ_CP007128.1_1447943_1449695_-	PRK09247, PRK09247, ATP-dependent DNA ligase; Validated	NA|949aa|up_6|NZ_CP007128.1_1449691_1452538_-	COG3629, DnrI, DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]	NA|100aa|up_5|NZ_CP007128.1_1452741_1453041_+	NA	NA|1977aa|up_4|NZ_CP007128.1_1453055_1458986_+	pfam16640, Big_3_5, Bacterial Ig-like domain (group 3)	NA|484aa|up_3|NZ_CP007128.1_1459137_1460589_+	NA	NA|217aa|up_2|NZ_CP007128.1_1460639_1461290_-	pfam00052, Laminin_B, Laminin B (Domain IV)	NA|345aa|up_1|NZ_CP007128.1_1461464_1462499_-	TIGR04122, hypothetical_protein, putative exonuclease, DNA ligase-associated	NA|145aa|up_0|NZ_CP007128.1_1462605_1463040_+	NA	NA|162aa|down_0|NZ_CP007128.1_1463519_1464005_+	COG1956, COG1956, GAF domain-containing protein [Signal transduction mechanisms]	NA|209aa|down_1|NZ_CP007128.1_1464147_1464774_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|1038aa|down_2|NZ_CP007128.1_1465068_1468182_+	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|470aa|down_3|NZ_CP007128.1_1468181_1469591_+	cd08977, SusD, starch binding outer membrane protein SusD	NA|810aa|down_4|NZ_CP007128.1_1469766_1472196_+	TIGR03434, ADOP, Acidobacterial duplicated orphan permease	NA|200aa|down_5|NZ_CP007128.1_1472392_1472992_+	cd07366, 3MGA_Dioxygenase, Subunit B of the Class III Extradiol ring-cleavage dioxygenase, 3-O-Methylgallate Dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 3-O-Methylgallate	NA|462aa|down_6|NZ_CP007128.1_1473125_1474511_+	NA	NA|237aa|down_7|NZ_CP007128.1_1474507_1475218_+	NA	NA|213aa|down_8|NZ_CP007128.1_1475502_1476141_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|261aa|down_9|NZ_CP007128.1_1476137_1476920_+	PTZ00121, PTZ00121, MAEBL; Provisional
GCF_000522985.1_ASM52298v1	NZ_CP007128	Gemmatirosa kalamazoonesis strain KBS708 chromosome, complete genome	2	5262581-5262678	2	CRISPRCasFinder	no		csa3,cas3,DinG,DEDDh,Cas9_archaeal	Orphan	GCGCGTCGCGCTGCGCGCCGGCA	23	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DinG,DEDDh,Cas9_archaeal	NA,NA	NA|363aa|up_9|NZ_CP007128.1_5241755_5242844_-	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|831aa|up_8|NZ_CP007128.1_5242840_5245333_-	cd02850, E_set_Cellulase_N, N-terminal Early set domain associated with the catalytic domain of cellulase	NA|995aa|up_7|NZ_CP007128.1_5245368_5248353_-	pfam15979, Glyco_hydro_115, Glycosyl hydrolase family 115	NA|374aa|up_6|NZ_CP007128.1_5248382_5249504_-	pfam00331, Glyco_hydro_10, Glycosyl hydrolase family 10	NA|385aa|up_5|NZ_CP007128.1_5249526_5250681_-	pfam00331, Glyco_hydro_10, Glycosyl hydrolase family 10	NA|504aa|up_4|NZ_CP007128.1_5250688_5252200_-	COG2211, MelB, Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism]	NA|780aa|up_3|NZ_CP007128.1_5252338_5254678_-	PRK15098, PRK15098, beta-glucosidase BglX	NA|665aa|up_2|NZ_CP007128.1_5254787_5256782_-	pfam03629, SASA, Carbohydrate esterase, sialic acid-specific acetylesterase	NA|581aa|up_1|NZ_CP007128.1_5257114_5258857_+	cd18617, GH43_XynB-like, Glycosyl hydrolase family 43, such as Bacteroides ovatus alpha-L-arabinofuranosidase (BoGH43, XynB)	NA|645aa|up_0|NZ_CP007128.1_5258856_5260791_+	pfam07944, Glyco_hydro_127, Beta-L-arabinofuranosidase, GH127	NA|691aa|down_0|NZ_CP007128.1_5262859_5264932_+	COG3534, AbfA, Alpha-L-arabinofuranosidase [Carbohydrate transport and metabolism]	NA|358aa|down_1|NZ_CP007128.1_5264944_5266018_+	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|902aa|down_2|NZ_CP007128.1_5266020_5268726_-	COG3250, LacZ, Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]	NA|156aa|down_3|NZ_CP007128.1_5268743_5269211_-	pfam07715, Plug, TonB-dependent Receptor Plug Domain	NA|546aa|down_4|NZ_CP007128.1_5269329_5270967_-	pfam07980, SusD_RagB, SusD family	NA|1018aa|down_5|NZ_CP007128.1_5270999_5274053_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|341aa|down_6|NZ_CP007128.1_5274455_5275478_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|743aa|down_7|NZ_CP007128.1_5275563_5277792_+	cd01820, PAF_acetylesterase_like, PAF_acetylhydrolase (PAF-AH)_like subfamily of SGNH-hydrolases	NA|383aa|down_8|NZ_CP007128.1_5277971_5279120_-	pfam06439, DUF1080, Domain of Unknown Function (DUF1080)	NA|542aa|down_9|NZ_CP007128.1_5279238_5280864_-	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]
