assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009707485.1_ASM970748v1	NZ_CP046121	Tetrasphaera sp. HKS02 chromosome, complete genome	1	362333-362485	1	CRISPRCasFinder	no		csa3,DinG,cas4,WYL,DEDDh	Orphan	GCGCCGTGAGTTCAACCGTGACGA	24	0	0	NA	NA	NA	2	2	Orphan	csa3,DinG,cas4,WYL,DEDDh	NA|56aa|up_8|NZ_CP046121.1_353925_354093_-,NA|100aa|up_3|NZ_CP046121.1_358151_358451_+,NA|101aa|up_0|NZ_CP046121.1_361585_361888_+,NA|136aa|down_3|NZ_CP046121.1_367094_367502_+,NA|290aa|down_9|NZ_CP046121.1_371666_372536_-	NA|225aa|up_9|NZ_CP046121.1_353254_353929_-	pfam13631, Cytochrom_B_N_2, Cytochrome b(N-terminal)/b6/petB	NA|56aa|up_8|NZ_CP046121.1_353925_354093_-	NA	NA|146aa|up_7|NZ_CP046121.1_354276_354714_+	pfam12900, Pyridox_ox_2, Pyridoxamine 5'-phosphate oxidase	NA|148aa|up_6|NZ_CP046121.1_354727_355171_-	cd00293, USP_Like, Usp: Universal stress protein family	NA|581aa|up_5|NZ_CP046121.1_355303_357046_+	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|234aa|up_4|NZ_CP046121.1_357146_357848_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|100aa|up_3|NZ_CP046121.1_358151_358451_+	NA	NA|130aa|up_2|NZ_CP046121.1_358447_358837_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|332aa|up_1|NZ_CP046121.1_360429_361425_-	TIGR03558, oxido_grp_1, luciferase family oxidoreductase, group 1	NA|101aa|up_0|NZ_CP046121.1_361585_361888_+	NA	NA|185aa|down_0|NZ_CP046121.1_364268_364823_+	PRK00228, PRK00228, YqgE/AlgH family protein	NA|105aa|down_1|NZ_CP046121.1_364856_365171_+	pfam11238, DUF3039, Protein of unknown function (DUF3039)	NA|584aa|down_2|NZ_CP046121.1_365288_367040_+	COG1061, SSL2, DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair]	NA|136aa|down_3|NZ_CP046121.1_367094_367502_+	NA	NA|70aa|down_4|NZ_CP046121.1_367557_367767_+	pfam07311, Dodecin, Dodecin	NA|172aa|down_5|NZ_CP046121.1_367732_368248_-	COG1522, Lrp, Transcriptional regulators [Transcription]	NA|411aa|down_6|NZ_CP046121.1_368355_369588_+	TIGR01263, 4-hydroxyphenylpyruvate_dioxygenase, 4-hydroxyphenylpyruvate dioxygenase	NA|259aa|down_7|NZ_CP046121.1_369673_370450_+	pfam06271, RDD, RDD family	NA|404aa|down_8|NZ_CP046121.1_370453_371665_-	PLN02856, PLN02856, fumarylacetoacetase	NA|290aa|down_9|NZ_CP046121.1_371666_372536_-	NA
GCF_009707485.1_ASM970748v1	NZ_CP046121	Tetrasphaera sp. HKS02 chromosome, complete genome	2	1606519-1606625	2	CRISPRCasFinder	no		csa3,DinG,cas4,WYL,DEDDh	Orphan	GGGGCGAACGGGTTGTTGCCCGGGCGCGG	29	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas4,WYL,DEDDh	NA,NA|287aa|down_6|NZ_CP046121.1_1615606_1616467_-,NA|69aa|down_9|NZ_CP046121.1_1620761_1620968_+	NA|323aa|up_9|NZ_CP046121.1_1592633_1593602_-	pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin	NA|667aa|up_8|NZ_CP046121.1_1593733_1595734_-	TIGR03960, radical_SAM_domain_protein, radical SAM family uncharacterized protein	NA|295aa|up_7|NZ_CP046121.1_1595871_1596756_+	cd00229, SGNH_hydrolase, SGNH_hydrolase, or GDSL_hydrolase, is a diverse family of lipases and esterases	NA|377aa|up_6|NZ_CP046121.1_1596833_1597964_-	COG0627, COG0627, Predicted esterase [General function prediction only]	NA|202aa|up_5|NZ_CP046121.1_1598096_1598702_+	PRK00312, pcm, protein-L-isoaspartate(D-aspartate) O-methyltransferase	NA|627aa|up_4|NZ_CP046121.1_1598698_1600579_-	COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid metabolism]	NA|365aa|up_3|NZ_CP046121.1_1600662_1601757_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|320aa|up_2|NZ_CP046121.1_1601819_1602779_-	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|305aa|up_1|NZ_CP046121.1_1602789_1603704_-	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|167aa|up_0|NZ_CP046121.1_1603690_1604191_-	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|102aa|down_0|NZ_CP046121.1_1607328_1607634_-	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|357aa|down_1|NZ_CP046121.1_1607729_1608800_-	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|187aa|down_2|NZ_CP046121.1_1608801_1609362_-	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|343aa|down_3|NZ_CP046121.1_1609472_1610501_+	pfam14530, DUF4439, Domain of unknown function (DUF4439)	NA|306aa|down_4|NZ_CP046121.1_1610524_1611442_+	pfam04655, APH_6_hur, Aminoglycoside/hydroxyurea antibiotic resistance kinase	NA|231aa|down_5|NZ_CP046121.1_1611484_1612177_-	COG4565, CitB, Response regulator of citrate/malate metabolism [Transcription / Signal transduction mechanisms]	NA|287aa|down_6|NZ_CP046121.1_1615606_1616467_-	NA	NA|331aa|down_7|NZ_CP046121.1_1616550_1617543_-	pfam03816, LytR_cpsA_psr, Cell envelope-related transcriptional attenuator domain	NA|1030aa|down_8|NZ_CP046121.1_1617548_1620638_-	pfam13191, AAA_16, AAA ATPase domain	NA|69aa|down_9|NZ_CP046121.1_1620761_1620968_+	NA
GCF_009707485.1_ASM970748v1	NZ_CP046121	Tetrasphaera sp. HKS02 chromosome, complete genome	3	1753044-1753162	3	CRISPRCasFinder	no		csa3,DinG,cas4,WYL,DEDDh	Orphan	GGCGCTCGCTGCGATGCTCACTCGCCCTCCTCCTCG	36	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,cas4,WYL,DEDDh	NA,NA|67aa|down_3|NZ_CP046121.1_1759712_1759913_-	NA|259aa|up_9|NZ_CP046121.1_1740636_1741413_-	PRK06057, PRK06057, short chain dehydrogenase; Provisional	NA|462aa|up_8|NZ_CP046121.1_1741425_1742811_-	pfam00171, Aldedh, Aldehyde dehydrogenase family	NA|456aa|up_7|NZ_CP046121.1_1742918_1744286_-	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|260aa|up_6|NZ_CP046121.1_1744302_1745082_-	COG2186, FadR, Transcriptional regulators [Transcription]	NA|526aa|up_5|NZ_CP046121.1_1745096_1746674_-	TIGR00907, Choline_transport_protein, amino acid permease (GABA permease)	NA|371aa|up_4|NZ_CP046121.1_1746797_1747910_-	PRK13357, PRK13357, branched-chain amino acid aminotransferase; Provisional	NA|354aa|up_3|NZ_CP046121.1_1747999_1749061_-	PRK03437, PRK03437, 3-isopropylmalate dehydrogenase; Provisional	NA|529aa|up_2|NZ_CP046121.1_1749621_1751208_-	PRK13581, PRK13581, D-3-phosphoglycerate dehydrogenase; Provisional	NA|343aa|up_1|NZ_CP046121.1_1751378_1752407_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|171aa|up_0|NZ_CP046121.1_1752507_1753020_-	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|572aa|down_0|NZ_CP046121.1_1755217_1756933_-	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|175aa|down_1|NZ_CP046121.1_1757060_1757585_-	COG2062, SixA, Phosphohistidine phosphatase SixA [Signal transduction mechanisms]	NA|688aa|down_2|NZ_CP046121.1_1757652_1759716_-	COG1511, COG1511, Predicted membrane protein [Function unknown]	NA|67aa|down_3|NZ_CP046121.1_1759712_1759913_-	NA	NA|223aa|down_4|NZ_CP046121.1_1760414_1761083_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|370aa|down_5|NZ_CP046121.1_1761163_1762273_+	COG0031, CysK, Cysteine synthase [Amino acid transport and metabolism]	NA|95aa|down_6|NZ_CP046121.1_1762282_1762567_-	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|362aa|down_7|NZ_CP046121.1_1762563_1763649_-	cd15482, Sialidase_non-viral, Non-viral sialidases	NA|306aa|down_8|NZ_CP046121.1_1763750_1764668_-	cd12166, 2-Hacid_dh_7, Putative D-isomer specific 2-hydroxyacid dehydrogenases	NA|401aa|down_9|NZ_CP046121.1_1764699_1765902_+	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase
