assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	1	100539-100616	1	CRISPRCasFinder	no	Cas9_archaeal	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Type II-A, or Type II-C?, Type II-B	AAGCGCCTGCACCCATCCTACAA	23	0	0	NA	NA	NA	1	1	TypeII-A,orTypeII-C?,TypeII-B	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|75aa|up_4|NZ_AP018052.1_96147_96372_+,NA|138aa|down_6|NZ_AP018052.1_109953_110367_+	NA|359aa|up_9|NZ_AP018052.1_87445_88522_+	pfam02253, PLA1, Phospholipase A1	NA|735aa|up_8|NZ_AP018052.1_88993_91198_+	PRK05580, PRK05580, primosome assembly protein PriA; Validated	NA|586aa|up_7|NZ_AP018052.1_91397_93155_+	COG0018, ArgS, Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]	NA|191aa|up_6|NZ_AP018052.1_93354_93927_+	COG3087, FtsN, Cell division protein [Cell division and chromosome partitioning]	NA|689aa|up_5|NZ_AP018052.1_93995_96062_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|75aa|up_4|NZ_AP018052.1_96147_96372_+	NA	NA|91aa|up_3|NZ_AP018052.1_96383_96656_-	cd02227, cupin_TM1112-like, Thermotoga maritima TM1112 and related proteins, cupin domain	NA|281aa|up_2|NZ_AP018052.1_96658_97501_-	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|625aa|up_1|NZ_AP018052.1_97560_99435_-	PRK05354, PRK05354, biosynthetic arginine decarboxylase	NA|286aa|up_0|NZ_AP018052.1_99587_100445_+	PRK00811, PRK00811, polyamine aminopropyltransferase	NA|271aa|down_0|NZ_AP018052.1_100623_101436_-	pfam03649, UPF0014, Uncharacterized protein family (UPF0014)	NA|201aa|down_1|NZ_AP018052.1_101432_102035_-	COG4619, COG4619, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|368aa|down_2|NZ_AP018052.1_102234_103338_-	cd13604, PBP2_TRAP_ketoacid_lactate_like, Substrate-binding domain of an alpha-keto acid binding Tripartite ATP-independent Periplasmic transporter and related proteins; the type 2 periplasmic-binding protein fold	NA|959aa|down_3|NZ_AP018052.1_103561_106438_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|202aa|down_4|NZ_AP018052.1_106729_107335_-	TIGR04565, hypothetical_protein_N47_G32130, outer membrane beta-barrel protein	NA|735aa|down_5|NZ_AP018052.1_107442_109647_-	PRK11773, uvrD, DNA-dependent helicase II; Provisional	NA|138aa|down_6|NZ_AP018052.1_109953_110367_+	NA	NA|137aa|down_7|NZ_AP018052.1_110641_111052_+	COG3153, COG3153, Predicted acetyltransferase [General function prediction only]	NA|373aa|down_8|NZ_AP018052.1_111150_112269_-	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|916aa|down_9|NZ_AP018052.1_112271_115019_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	2	323270-323363	2	CRISPRCasFinder	no		Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	GATGGGTTGCGCTGCGCTCCACCCATCCT	29	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|30aa|up_0|NZ_AP018052.1_323153_323243_+,NA|243aa|down_3|NZ_AP018052.1_327646_328375_+	NA|277aa|up_9|NZ_AP018052.1_315395_316226_-	PRK00450, dapF, diaminopimelate epimerase; Provisional	NA|357aa|up_8|NZ_AP018052.1_316397_317468_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|418aa|up_7|NZ_AP018052.1_317611_318865_-	cd06828, PLPDE_III_DapDC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Diaminopimelate Decarboxylase	NA|45aa|up_6|NZ_AP018052.1_318872_319007_-	pfam13627, LPAM_2, Prokaryotic lipoprotein-attachment site	NA|221aa|up_5|NZ_AP018052.1_319100_319763_+	pfam02230, Abhydrolase_2, Phospholipase/Carboxylesterase	NA|350aa|up_4|NZ_AP018052.1_319853_320903_+	cd14797, DUF302, Uncharacterized domain family DUF302	NA|154aa|up_3|NZ_AP018052.1_320899_321361_+	cd14797, DUF302, Uncharacterized domain family DUF302	NA|178aa|up_2|NZ_AP018052.1_321518_322052_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|82aa|up_1|NZ_AP018052.1_322093_322339_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|30aa|up_0|NZ_AP018052.1_323153_323243_+	NA	NA|954aa|down_0|NZ_AP018052.1_323416_326278_-	PRK09450, cyaA, class I adenylate cyclase	NA|249aa|down_1|NZ_AP018052.1_326317_327064_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|122aa|down_2|NZ_AP018052.1_327184_327550_-	cd11537, NTP-PPase_RS21-C6_like, Nucleoside Triphosphate Pyrophosphohydrolase (EC 3	NA|243aa|down_3|NZ_AP018052.1_327646_328375_+	NA	NA|907aa|down_4|NZ_AP018052.1_328421_331142_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|465aa|down_5|NZ_AP018052.1_331131_332526_-	PRK00855, PRK00855, argininosuccinate lyase; Provisional	NA|357aa|down_6|NZ_AP018052.1_332654_333725_+	COG2972, COG2972, Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms]	NA|244aa|down_7|NZ_AP018052.1_333721_334453_+	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|163aa|down_8|NZ_AP018052.1_334716_335205_-	cd02951, SoxW, SoxW family; SoxW is a bacterial periplasmic TRX, containing a redox active CXXC motif, encoded by a genetic locus (sox operon) involved in thiosulfate oxidation	NA|312aa|down_9|NZ_AP018052.1_335283_336219_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	3	791940-792062	3	CRISPRCasFinder	no		Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	AAGAAGAAGGTCGCGAAGAAGAA	23	1	1	792011-792038	NZ_AP018052.1_1862507-1862480	NA	2	2	Orphan	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|230aa|up_7|NZ_AP018052.1_783380_784070_-,NA|165aa|down_1|NZ_AP018052.1_793851_794346_+	NA|779aa|up_9|NZ_AP018052.1_779679_782016_+	pfam07793, DUF1631, Protein of unknown function (DUF1631)	NA|341aa|up_8|NZ_AP018052.1_782165_783188_-	pfam04338, DUF481, Protein of unknown function, DUF481	NA|230aa|up_7|NZ_AP018052.1_783380_784070_-	NA	NA|128aa|up_6|NZ_AP018052.1_784269_784653_+	pfam01627, Hpt, Hpt domain	NA|159aa|up_5|NZ_AP018052.1_784694_785171_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|78aa|up_4|NZ_AP018052.1_785404_785638_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|147aa|up_3|NZ_AP018052.1_785639_786080_+	cd00756, MoaE, MoaE family	NA|183aa|up_2|NZ_AP018052.1_786283_786832_-	PRK01250, PRK01250, inorganic diphosphatase	NA|673aa|up_1|NZ_AP018052.1_786983_789002_-	PRK00733, hppA, membrane-bound proton-translocating pyrophosphatase; Validated	NA|425aa|up_0|NZ_AP018052.1_789361_790636_-	PRK14072, PRK14072, diphosphate--fructose-6-phosphate 1-phosphotransferase	NA|371aa|down_0|NZ_AP018052.1_792526_793639_-	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|165aa|down_1|NZ_AP018052.1_793851_794346_+	NA	NA|462aa|down_2|NZ_AP018052.1_794533_795919_+	TIGR01081, mpl, UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase	NA|210aa|down_3|NZ_AP018052.1_796113_796743_+	PRK05920, PRK05920, aromatic acid decarboxylase; Validated	NA|193aa|down_4|NZ_AP018052.1_796894_797473_+	COG2802, COG2802, Uncharacterized protein, similar to the N-terminal domain of Lon protease [General function prediction only]	NA|669aa|down_5|NZ_AP018052.1_797593_799600_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|427aa|down_6|NZ_AP018052.1_799609_800890_-	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|209aa|down_7|NZ_AP018052.1_801014_801641_-	PRK00043, thiE, thiamine phosphate synthase	NA|267aa|down_8|NZ_AP018052.1_801768_802569_-	cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate kinase (HMPP-kinase) catalyzes two consecutive phosphorylation steps in the thiamine phosphate biosynthesis pathway, leading to the synthesis of vitamin B1	NA|185aa|down_9|NZ_AP018052.1_803144_803699_-	pfam00596, Aldolase_II, Class II Aldolase and Adducin N-terminal domain
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	4	792126-792460	4	CRISPRCasFinder	no		Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	AAGAAGAAGGTCGCGAAGAAGAA	23	1	2	792416-792437|792416-792437	NZ_AP018052.1_792446-792467|NZ_AP018052.1_792461-792482	NA	6	6	Orphan	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|230aa|up_7|NZ_AP018052.1_783380_784070_-,NA|165aa|down_1|NZ_AP018052.1_793851_794346_+	NA|779aa|up_9|NZ_AP018052.1_779679_782016_+	pfam07793, DUF1631, Protein of unknown function (DUF1631)	NA|341aa|up_8|NZ_AP018052.1_782165_783188_-	pfam04338, DUF481, Protein of unknown function, DUF481	NA|230aa|up_7|NZ_AP018052.1_783380_784070_-	NA	NA|128aa|up_6|NZ_AP018052.1_784269_784653_+	pfam01627, Hpt, Hpt domain	NA|159aa|up_5|NZ_AP018052.1_784694_785171_-	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|78aa|up_4|NZ_AP018052.1_785404_785638_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|147aa|up_3|NZ_AP018052.1_785639_786080_+	cd00756, MoaE, MoaE family	NA|183aa|up_2|NZ_AP018052.1_786283_786832_-	PRK01250, PRK01250, inorganic diphosphatase	NA|673aa|up_1|NZ_AP018052.1_786983_789002_-	PRK00733, hppA, membrane-bound proton-translocating pyrophosphatase; Validated	NA|425aa|up_0|NZ_AP018052.1_789361_790636_-	PRK14072, PRK14072, diphosphate--fructose-6-phosphate 1-phosphotransferase	NA|371aa|down_0|NZ_AP018052.1_792526_793639_-	PRK01642, cls, cardiolipin synthetase; Reviewed	NA|165aa|down_1|NZ_AP018052.1_793851_794346_+	NA	NA|462aa|down_2|NZ_AP018052.1_794533_795919_+	TIGR01081, mpl, UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase	NA|210aa|down_3|NZ_AP018052.1_796113_796743_+	PRK05920, PRK05920, aromatic acid decarboxylase; Validated	NA|193aa|down_4|NZ_AP018052.1_796894_797473_+	COG2802, COG2802, Uncharacterized protein, similar to the N-terminal domain of Lon protease [General function prediction only]	NA|669aa|down_5|NZ_AP018052.1_797593_799600_-	COG0586, DedA, Uncharacterized membrane-associated protein [Function unknown]	NA|427aa|down_6|NZ_AP018052.1_799609_800890_-	PRK00062, PRK00062, glutamate-1-semialdehyde 2,1-aminomutase	NA|209aa|down_7|NZ_AP018052.1_801014_801641_-	PRK00043, thiE, thiamine phosphate synthase	NA|267aa|down_8|NZ_AP018052.1_801768_802569_-	cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate kinase (HMPP-kinase) catalyzes two consecutive phosphorylation steps in the thiamine phosphate biosynthesis pathway, leading to the synthesis of vitamin B1	NA|185aa|down_9|NZ_AP018052.1_803144_803699_-	pfam00596, Aldolase_II, Class II Aldolase and Adducin N-terminal domain
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	5	1407314-1413105	5,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Type I-F	GTTCGCCGCCGCGTAGGCGGCTTAGAAA,GTTCGCCGCCGCGTAGGCGGCTTAGAAA,GTTCGCCGCCGCGTAGGCGGCTTAGAAA	28,28,28	0	0	NA	NA	I-F:I-F:I-F	96,96,78	96	TypeI-F	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|220aa|up_7|NZ_AP018052.1_1396432_1397092_+,cas8f|474aa|up_4|NZ_AP018052.1_1402102_1403524_+,NA|142aa|down_4|NZ_AP018052.1_1420157_1420583_-,NA|308aa|down_5|NZ_AP018052.1_1420560_1421484_-,NA|130aa|down_6|NZ_AP018052.1_1421608_1421998_-,NA|274aa|down_9|NZ_AP018052.1_1425174_1425996_+	NA|1036aa|up_9|NZ_AP018052.1_1392896_1396004_+	COG3696, COG3696, Putative silver efflux pump [Inorganic ion transport and metabolism]	NA|120aa|up_8|NZ_AP018052.1_1396056_1396416_+	PRK09272, PRK09272, hypothetical protein; Provisional	NA|220aa|up_7|NZ_AP018052.1_1396432_1397092_+	NA	cas3f|1059aa|up_6|NZ_AP018052.1_1397942_1401119_-	cd09673, Cas3_Cas2_I-F, CRISPR/Cas system-associated protein Cas3/Cas2	NA|278aa|up_5|NZ_AP018052.1_1401123_1401957_-	pfam13280, WYL, WYL domain	cas8f|474aa|up_4|NZ_AP018052.1_1402102_1403524_+	NA	cas5f|332aa|up_3|NZ_AP018052.1_1403523_1404519_+	cd09736, Csy2_I-F, CRISPR/Cas system-associated RAMP superfamily protein Csy2	cas7f|379aa|up_2|NZ_AP018052.1_1404548_1405685_+	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas6f|199aa|up_1|NZ_AP018052.1_1405684_1406281_+	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas1|313aa|up_0|NZ_AP018052.1_1406277_1407216_+	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|225aa|down_0|NZ_AP018052.1_1413504_1414179_+	pfam02586, SRAP, SOS response associated peptidase (SRAP)	NA|402aa|down_1|NZ_AP018052.1_1414735_1415941_+	pfam14088, DUF4268, Domain of unknown function (DUF4268)	NA|330aa|down_2|NZ_AP018052.1_1415995_1416985_-	cd00143, PP2Cc, Serine/threonine phosphatases, family 2C, catalytic domain; The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity	NA|709aa|down_3|NZ_AP018052.1_1417926_1420053_-	COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones]	NA|142aa|down_4|NZ_AP018052.1_1420157_1420583_-	NA	NA|308aa|down_5|NZ_AP018052.1_1420560_1421484_-	NA	NA|130aa|down_6|NZ_AP018052.1_1421608_1421998_-	NA	WYL|339aa|down_7|NZ_AP018052.1_1422102_1423119_+	pfam13280, WYL, WYL domain	NA|649aa|down_8|NZ_AP018052.1_1423240_1425187_+	pfam11074, DUF2779, Domain of unknown function(DUF2779)	NA|274aa|down_9|NZ_AP018052.1_1425174_1425996_+	NA
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	6	1456331-1456450	6	CRISPRCasFinder	no	DEDDh	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Unclear	CCACTTCGGGCCCGAAGTGGGCCAATACGGGCCACTTT	38	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|113aa|up_2|NZ_AP018052.1_1453821_1454160_-,NA|90aa|up_0|NZ_AP018052.1_1455594_1455864_+,NA|121aa|down_1|NZ_AP018052.1_1459320_1459683_+,NA|75aa|down_2|NZ_AP018052.1_1459922_1460147_+,NA|110aa|down_8|NZ_AP018052.1_1463537_1463867_-	NA|189aa|up_9|NZ_AP018052.1_1445825_1446392_+	pfam02586, SRAP, SOS response associated peptidase (SRAP)	NA|224aa|up_8|NZ_AP018052.1_1446962_1447634_+	COG2356, EndA, Endonuclease I [DNA replication, recombination, and repair]	NA|200aa|up_7|NZ_AP018052.1_1447982_1448582_+	pfam08937, DUF1863, MTH538 TIR-like domain (DUF1863)	NA|300aa|up_6|NZ_AP018052.1_1448578_1449478_+	pfam18184, SLATT_3, SMODS and SLOG-associating 2TM effector domain 3	NA|131aa|up_5|NZ_AP018052.1_1449609_1450002_+	pfam08937, DUF1863, MTH538 TIR-like domain (DUF1863)	NA|440aa|up_4|NZ_AP018052.1_1450014_1451334_+	pfam13281, DUF4071, Domain of unknown function (DUF4071)	NA|474aa|up_3|NZ_AP018052.1_1451430_1452852_+	pfam18862, ApeA_NTD1, ApeA N-terminal domain 1	NA|113aa|up_2|NZ_AP018052.1_1453821_1454160_-	NA	NA|225aa|up_1|NZ_AP018052.1_1454340_1455015_+	pfam02586, SRAP, SOS response associated peptidase (SRAP)	NA|90aa|up_0|NZ_AP018052.1_1455594_1455864_+	NA	NA|433aa|down_0|NZ_AP018052.1_1456756_1458055_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|121aa|down_1|NZ_AP018052.1_1459320_1459683_+	NA	NA|75aa|down_2|NZ_AP018052.1_1459922_1460147_+	NA	NA|68aa|down_3|NZ_AP018052.1_1460270_1460474_+	TIGR02293, conserved_protein_of_unknown_function, putative toxin-antitoxin system antitoxin component, TIGR02293 family	NA|270aa|down_4|NZ_AP018052.1_1460482_1461292_+	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|128aa|down_5|NZ_AP018052.1_1461298_1461682_+	COG3654, Doc, Prophage maintenance system killer protein [General function prediction only]	NA|134aa|down_6|NZ_AP018052.1_1461940_1462342_+	pfam09722, DUF2384, Protein of unknown function (DUF2384)	NA|355aa|down_7|NZ_AP018052.1_1462391_1463456_-	smart00974, T5orf172, This entry represents the putative helicase A859L	NA|110aa|down_8|NZ_AP018052.1_1463537_1463867_-	NA	NA|118aa|down_9|NZ_AP018052.1_1463866_1464220_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	7	2152849-2152931	7	CRISPRCasFinder	no		Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Orphan	CGGCGGAATCCTGTCGGGTTACGCCCT	27	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|185aa|up_6|NZ_AP018052.1_2144249_2144804_+,NA|122aa|up_5|NZ_AP018052.1_2144933_2145299_+,NA	NA|337aa|up_9|NZ_AP018052.1_2141198_2142209_-	PRK00984, truD, tRNA pseudouridine synthase D; Reviewed	NA|247aa|up_8|NZ_AP018052.1_2142380_2143121_+	pfam01904, DUF72, Protein of unknown function DUF72	NA|292aa|up_7|NZ_AP018052.1_2143124_2144000_-	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|185aa|up_6|NZ_AP018052.1_2144249_2144804_+	NA	NA|122aa|up_5|NZ_AP018052.1_2144933_2145299_+	NA	NA|207aa|up_4|NZ_AP018052.1_2145318_2145939_-	COG4764, COG4764, Uncharacterized protein conserved in bacteria [Function unknown]	NA|797aa|up_3|NZ_AP018052.1_2146155_2148546_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|179aa|up_2|NZ_AP018052.1_2148792_2149329_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|419aa|up_1|NZ_AP018052.1_2149494_2150751_-	cd17260, RMtype1_S_EcoEI-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|494aa|up_0|NZ_AP018052.1_2150747_2152229_-	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|158aa|down_0|NZ_AP018052.1_2153016_2153490_-	pfam02542, YgbB, YgbB family	NA|236aa|down_1|NZ_AP018052.1_2153486_2154194_-	PRK00155, ispD, D-ribitol-5-phosphate cytidylyltransferase	NA|98aa|down_2|NZ_AP018052.1_2154298_2154592_-	PRK00888, ftsB, cell division protein FtsB; Reviewed	NA|428aa|down_3|NZ_AP018052.1_2154607_2155891_-	PRK00077, eno, enolase; Provisional	NA|279aa|down_4|NZ_AP018052.1_2156109_2156946_-	PRK05198, PRK05198, 2-dehydro-3-deoxyphosphooctonate aldolase; Provisional	NA|554aa|down_5|NZ_AP018052.1_2156979_2158641_-	PRK05380, pyrG, CTP synthetase; Validated	NA|321aa|down_6|NZ_AP018052.1_2160229_2161192_-	PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated	NA|1168aa|down_7|NZ_AP018052.1_2161452_2164956_-	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|401aa|down_8|NZ_AP018052.1_2165239_2166442_-	PRK10535, PRK10535, macrolide ABC transporter ATP-binding protein/permease MacB	NA|237aa|down_9|NZ_AP018052.1_2166441_2167152_-	COG1136, SalX, ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms]
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	8	2533732-2533836	8	CRISPRCasFinder	no	DEDDh	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Unclear	CTGAAGGCGTAGGATGGGTGGAGGCGC	27	0	0	NA	NA	NA	1	1	Orphan	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA,NA	NA|197aa|up_9|NZ_AP018052.1_2521921_2522512_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|108aa|up_8|NZ_AP018052.1_2522529_2522853_-	PRK00153, PRK00153, YbaB/EbfC family nucleoid-associated protein	NA|539aa|up_7|NZ_AP018052.1_2522910_2524527_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|130aa|up_6|NZ_AP018052.1_2524939_2525329_-	pfam13698, DUF4156, Domain of unknown function (DUF4156)	NA|325aa|up_5|NZ_AP018052.1_2525737_2526712_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|283aa|up_4|NZ_AP018052.1_2526708_2527557_+	PLN02244, PLN02244, tocopherol O-methyltransferase	NA|392aa|up_3|NZ_AP018052.1_2527667_2528843_+	PRK05250, PRK05250, S-adenosylmethionine synthetase; Validated	NA|646aa|up_2|NZ_AP018052.1_2528998_2530936_+	COG3211, PhoX, Predicted phosphatase [General function prediction only]	NA|286aa|up_1|NZ_AP018052.1_2531027_2531885_-	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|209aa|up_0|NZ_AP018052.1_2532067_2532694_-	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|748aa|down_0|NZ_AP018052.1_2534135_2536379_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|431aa|down_1|NZ_AP018052.1_2536527_2537820_+	cd03817, GT4_UGDG-like, UDP-Glc:1,2-diacylglycerol 3-a-glucosyltransferase and similar proteins	NA|104aa|down_2|NZ_AP018052.1_2538032_2538344_+	pfam18692, DUF5640, Family of unknown function (DUF5640)	NA|837aa|down_3|NZ_AP018052.1_2538431_2540942_-	pfam10070, DUF2309, Uncharacterized protein conserved in bacteria (DUF2309)	NA|548aa|down_4|NZ_AP018052.1_2540987_2542631_-	PRK08168, PRK08168, NADH-quinone oxidoreductase subunit L	NA|260aa|down_5|NZ_AP018052.1_2542944_2543724_-	COG1397, DraG, ADP-ribosylglycohydrolase [Posttranslational modification, protein turnover, chaperones]	NA|369aa|down_6|NZ_AP018052.1_2543863_2544970_+	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|174aa|down_7|NZ_AP018052.1_2544998_2545520_-	pfam03843, Slp, Outer membrane lipoprotein Slp family	NA|392aa|down_8|NZ_AP018052.1_2545675_2546851_+	pfam03486, HI0933_like, HI0933-like protein	NA|386aa|down_9|NZ_AP018052.1_2547304_2548462_+	COG3437, COG3437, Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms]
GCF_002356355.1_ASM235635v1	NZ_AP018052	Thiohalobacter thiocyanaticus strain FOKN1	9	2966522-2971622	2,9,2,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3,WYL	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	Type I-E	CGGTTCATCCCCGCAGGCGCGGGGAACAC,CGGTTCATCCCCGCAGGCGCGGGGAACAC,CGGTTCATCCCCGCAGGCGCGGGGAACAC,GCACGGTTCATCCCCGCAGGCGCGGGGAACACGACT,CGGTTCATCCCCGCAGGCGCGGGGAACAC	29,29,29,36,29	1	1	2970158-2970189	NZ_AP018052.1_307350-307319	I-E:I-E:I-E:I-E:I-E	34,83,83,34,34	83	TypeI-E	Cas9_archaeal,csa3,cas3f,cas8f,cas5f,cas7f,cas6f,cas1,WYL,DEDDh,DinG,cas6,RT,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,cas3	NA|230aa|up_5|NZ_AP018052.1_2960809_2961499_+,NA|204aa|up_4|NZ_AP018052.1_2961502_2962114_-,NA	NA|722aa|up_9|NZ_AP018052.1_2954300_2956466_-	COG1444, COG1444, Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only]	NA|198aa|up_8|NZ_AP018052.1_2956699_2957293_+	cd10033, UDG_like, uncharacterized family of the uracil-DNA glycosylase superfamily	NA|766aa|up_7|NZ_AP018052.1_2957313_2959611_-	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|208aa|up_6|NZ_AP018052.1_2960144_2960768_+	TIGR03370, VPLPA-CTERM, VPLPA-CTERM protein sorting domain	NA|230aa|up_5|NZ_AP018052.1_2960809_2961499_+	NA	NA|204aa|up_4|NZ_AP018052.1_2961502_2962114_-	NA	NA|202aa|up_3|NZ_AP018052.1_2962119_2962725_-	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|488aa|up_2|NZ_AP018052.1_2962721_2964185_-	cd00919, Heme_Cu_Oxidase_I, Heme-copper oxidase subunit I	NA|184aa|up_1|NZ_AP018052.1_2964177_2964729_-	cd13916, CuRO_HCO_II_like_1, Uncharacterized subfamily with similarity to Heme-copper oxidase subunit II cupredoxin domain	NA|455aa|up_0|NZ_AP018052.1_2964925_2966290_+	COG3829, RocR, Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains [Transcription / Signal transduction mechanisms]	cas2|103aa|down_0|NZ_AP018052.1_2971714_2972023_-	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	cas1|304aa|down_1|NZ_AP018052.1_2972003_2972915_-	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas6e|226aa|down_2|NZ_AP018052.1_2972950_2973628_-	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas5|246aa|down_3|NZ_AP018052.1_2973627_2974365_-	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas7|354aa|down_4|NZ_AP018052.1_2974370_2975432_-	pfam09344, Cas_CT1975, CT1975-like protein	cse2gr11|170aa|down_5|NZ_AP018052.1_2975468_2975978_-	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas8e|543aa|down_6|NZ_AP018052.1_2975967_2977596_-	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cas3|887aa|down_7|NZ_AP018052.1_2977640_2980301_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|72aa|down_8|NZ_AP018052.1_2980592_2980808_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|150aa|down_9|NZ_AP018052.1_2980819_2981269_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]
