assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000025545.1_ASM2554v1	NC_013930	Thioalkalivibrio sp. K90mix plasmid pTK9001, complete sequence	1	66903-67447	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas14j,DinG,csf5gr6,csf1gr8,csf2gr7,csf3gr5	cas14j,DinG,csf5gr6,csf1gr8,csf2gr7,csf3gr5	Type IV-A	GTGGAAAGCACGTCCCTGTGGGGGCGTGGTTGGAAC,GTGGAAAGCACGTCCCTGTGGGGGCGTGGTTGGAAC,GTGGAAAGCACGTCCCTGTGGGGGCGTGGTTGGAAC	36,36,36	0	0	NA	NA	NA:NA:NA	6,7,7	7	TypeIV-A,TypeV	DEDDh,csa3,WYL,DinG,RT,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,cas14j,csf5gr6,csf1gr8,csf2gr7,csf3gr5	NA|104aa|up_9|NC_013930.1_57442_57754_+,NA|142aa|up_8|NC_013930.1_57786_58212_-,NA|70aa|up_6|NC_013930.1_60228_60438_-,NA|102aa|up_5|NC_013930.1_60440_60746_-,csf5gr6|234aa|up_3|NC_013930.1_63525_64227_+,NA|82aa|down_0|NC_013930.1_67892_68138_-,NA|51aa|down_1|NC_013930.1_68201_68354_-,NA|161aa|down_3|NC_013930.1_70641_71124_-,NA|85aa|down_4|NC_013930.1_71214_71469_-,NA|244aa|down_5|NC_013930.1_71455_72187_-,NA|165aa|down_6|NC_013930.1_72272_72767_-,NA|104aa|down_7|NC_013930.1_72780_73092_-,NA|162aa|down_8|NC_013930.1_73227_73713_-	NA|104aa|up_9|NC_013930.1_57442_57754_+	NA	NA|142aa|up_8|NC_013930.1_57786_58212_-	NA	NA|653aa|up_7|NC_013930.1_58213_60172_-	COG0272, Lig, NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair]	NA|70aa|up_6|NC_013930.1_60228_60438_-	NA	NA|102aa|up_5|NC_013930.1_60440_60746_-	NA	DinG|767aa|up_4|NC_013930.1_61134_63435_-	TIGR03117, cas_csf4, CRISPR type AFERR-associated DEAD/DEAH-box helicase Csf4	csf5gr6|234aa|up_3|NC_013930.1_63525_64227_+	NA	csf1gr8|244aa|up_2|NC_013930.1_64250_64982_+	cd09705, Csf1_U, CRISPR/Cas system-associated protein Csf1	csf2gr7|350aa|up_1|NC_013930.1_65006_66056_+	TIGR03115, cas7_csf2, CRISPR type IV/AFERR-associated protein Csf2	csf3gr5|219aa|up_0|NC_013930.1_66055_66712_+	cd09707, Csf3_U, CRISPR/Cas system-associated RAMP superfamily protein Csf3	NA|82aa|down_0|NC_013930.1_67892_68138_-	NA	NA|51aa|down_1|NC_013930.1_68201_68354_-	NA	NA|705aa|down_2|NC_013930.1_68442_70557_-	COG0210, UvrD, Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair]	NA|161aa|down_3|NC_013930.1_70641_71124_-	NA	NA|85aa|down_4|NC_013930.1_71214_71469_-	NA	NA|244aa|down_5|NC_013930.1_71455_72187_-	NA	NA|165aa|down_6|NC_013930.1_72272_72767_-	NA	NA|104aa|down_7|NC_013930.1_72780_73092_-	NA	NA|162aa|down_8|NC_013930.1_73227_73713_-	NA	NA|804aa|down_9|NC_013930.1_73800_76212_-	TIGR03183, DNA_S_dndC, putative sulfurtransferase DndC
GCF_000025545.1_ASM2554v1	NC_013930	Thioalkalivibrio sp. K90mix plasmid pTK9001, complete sequence	2	235299-235454	2	PILER-CR	no		cas14j,DinG,csf5gr6,csf1gr8,csf2gr7,csf3gr5	Orphan	TTCGTTTCGCGGGCGCGAAACA	22	0	0	NA	NA	NA	2	2	Orphan	DEDDh,csa3,WYL,DinG,RT,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,cas14j,csf5gr6,csf1gr8,csf2gr7,csf3gr5	NA|191aa|up_9|NC_013930.1_224414_224987_+,NA|222aa|up_7|NC_013930.1_226033_226699_+,NA|64aa|down_1|NC_013930.1_236291_236483_+,NA|146aa|down_2|NC_013930.1_236479_236917_+,NA|139aa|down_3|NC_013930.1_236966_237383_+	NA|191aa|up_9|NC_013930.1_224414_224987_+	NA	NA|315aa|up_8|NC_013930.1_225085_226030_+	TIGR01650, PD_CobS, cobaltochelatase, CobS subunit	NA|222aa|up_7|NC_013930.1_226033_226699_+	NA	NA|332aa|up_6|NC_013930.1_226781_227777_+	pfam11348, DUF3150, Protein of unknown function (DUF3150)	NA|616aa|up_5|NC_013930.1_227855_229703_+	cd01454, vWA_norD_type, norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases	NA|402aa|up_4|NC_013930.1_229778_230984_+	pfam13362, Toprim_3, Toprim domain	NA|381aa|up_3|NC_013930.1_231118_232261_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|260aa|up_2|NC_013930.1_232499_233279_+	pfam08536, Whirly, Whirly transcription factor	NA|265aa|up_1|NC_013930.1_233317_234112_+	COG1192, Soj, ATPases involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|369aa|up_0|NC_013930.1_234182_235289_+	cd16393, SPO0J_N, Thermus thermophilus stage 0 sporulation protein J-like N-terminal domain, ParB family member	NA|157aa|down_0|NC_013930.1_235824_236295_+	COG0614, FepB, ABC-type Fe3+-hydroxamate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|64aa|down_1|NC_013930.1_236291_236483_+	NA	NA|146aa|down_2|NC_013930.1_236479_236917_+	NA	NA|139aa|down_3|NC_013930.1_236966_237383_+	NA	NA|253aa|down_4|NC_013930.1_237573_238332_+	cd04496, SSB_OBF, SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB)	NA|385aa|down_5|NC_013930.1_238425_239579_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|157aa|down_6|NC_013930.1_239674_240145_+	COG3133, SlyB, Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane]	NA|NA	NA	NA|NA	NA	NA|NA	NA
GCF_000025545.1_ASM2554v1	NC_013889	Thioalkalivibrio sp. K90mix, complete sequence	1	1929222-1929309	1	CRISPRCasFinder	no		DEDDh,csa3,WYL,DinG,RT,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f	Orphan	GCGAGGTCTGACCCCGGCGACCGC	24	1	1	1929246-1929285	NC_013889.1_1929332-1929371	NA	1	1	Orphan	DEDDh,csa3,WYL,DinG,RT,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,cas14j,csf5gr6,csf1gr8,csf2gr7,csf3gr5	NA|159aa|up_6|NC_013889.1_1923173_1923650_+,NA|161aa|up_5|NC_013889.1_1923746_1924229_+,NA|134aa|down_2|NC_013889.1_1932424_1932826_-,NA|96aa|down_8|NC_013889.1_1938163_1938451_-	NA|423aa|up_9|NC_013889.1_1920885_1922154_+	COG2230, Cfa, Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane]	NA|157aa|up_8|NC_013889.1_1922316_1922787_+	COG2153, ElaA, Predicted acyltransferase [General function prediction only]	NA|101aa|up_7|NC_013889.1_1922880_1923183_+	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|159aa|up_6|NC_013889.1_1923173_1923650_+	NA	NA|161aa|up_5|NC_013889.1_1923746_1924229_+	NA	NA|279aa|up_4|NC_013889.1_1924194_1925031_+	pfam03958, Secretin_N, Bacterial type II/III secretion system short domain	NA|425aa|up_3|NC_013889.1_1925448_1926723_-	pfam00872, Transposase_mut, Transposase, Mutator family	NA|139aa|up_2|NC_013889.1_1926924_1927341_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|112aa|up_1|NC_013889.1_1927494_1927830_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|278aa|up_0|NC_013889.1_1927983_1928817_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|355aa|down_0|NC_013889.1_1929570_1930635_-	cd00342, gram_neg_porins, Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane	NA|84aa|down_1|NC_013889.1_1931702_1931954_-	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|134aa|down_2|NC_013889.1_1932424_1932826_-	NA	NA|102aa|down_3|NC_013889.1_1932900_1933206_-	cd05403, NT_KNTase_like, Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins	NA|399aa|down_4|NC_013889.1_1933543_1934740_-	TIGR01988, Ubiquinone_biosynthesis_monooxygenase_COQ6, Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family	NA|420aa|down_5|NC_013889.1_1934736_1935996_-	TIGR01988, Ubiquinone_biosynthesis_monooxygenase_COQ6, Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family	NA|455aa|down_6|NC_013889.1_1936200_1937565_-	PRK10879, PRK10879, proline aminopeptidase P II; Provisional	NA|201aa|down_7|NC_013889.1_1937561_1938164_-	PRK01736, PRK01736, hypothetical protein; Reviewed	NA|96aa|down_8|NC_013889.1_1938163_1938451_-	NA	NA|75aa|down_9|NC_013889.1_1938698_1938923_+	TIGR02449, conserved_hypothetical_protein, TIGR02449 family protein
GCF_000025545.1_ASM2554v1	NC_013889	Thioalkalivibrio sp. K90mix, complete sequence	2	2575298-2579946	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f	DEDDh,csa3,WYL,DinG,RT,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f	Type I-F	GTTAGCTGCCGCACAGGCAGCTCAGAAA,GTTAGCTGCCGCACAGGCAGCTCAGAAA,GTTAGCTGCCGCACAGGCAGCTCAGAAA	28,28,28	0	0	NA	NA	I-F:I-F:I-F	76,77,77	77	TypeI-F	DEDDh,csa3,WYL,DinG,RT,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,cas14j,csf5gr6,csf1gr8,csf2gr7,csf3gr5	NA|151aa|up_9|NC_013889.1_2563489_2563942_+,NA|49aa|down_6|NC_013889.1_2586577_2586724_+,NA|284aa|down_7|NC_013889.1_2586745_2587597_+	NA|151aa|up_9|NC_013889.1_2563489_2563942_+	NA	WYL|320aa|up_8|NC_013889.1_2563946_2564906_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|376aa|up_7|NC_013889.1_2565139_2566267_+	pfam14397, ATPgrasp_ST, Sugar-transfer associated ATP-grasp	NA|136aa|up_6|NC_013889.1_2566357_2566765_+	smart00910, HIRAN, The HIRAN protein (HIP116, Rad5p N-terminal) is found in the N-terminal regions of the SWI2/SNF2 proteins typified by HIP116 and Rad5p	cas1|326aa|up_5|NC_013889.1_2566881_2567859_+	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	cas3-cas2|1123aa|up_4|NC_013889.1_2567855_2571224_+	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas8f|425aa|up_3|NC_013889.1_2571310_2572585_+	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas5f|323aa|up_2|NC_013889.1_2572577_2573546_+	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas7f|347aa|up_1|NC_013889.1_2573562_2574603_+	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas6f|187aa|up_0|NC_013889.1_2574606_2575167_+	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	NA|456aa|down_0|NC_013889.1_2580047_2581415_-	cd07100, ALDH_SSADH1_GabD1, Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like	NA|321aa|down_1|NC_013889.1_2581520_2582483_+	cd05271, NDUFA9_like_SDR_a, NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, subunit 9, 39 kDa, (NDUFA9) -like, atypical (a) SDRs	NA|411aa|down_2|NC_013889.1_2582499_2583732_+	PRK10885, cca, multifunctional CCA addition/repair protein	NA|219aa|down_3|NC_013889.1_2583867_2584524_+	COG2518, Pcm, Protein-L-isoaspartate carboxylmethyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|450aa|down_4|NC_013889.1_2584643_2585993_+	PRK09465, tolC, outer membrane channel protein; Reviewed	NA|188aa|down_5|NC_013889.1_2585989_2586553_+	cd01012, YcaC_related, YcaC related amidohydrolases; E	NA|49aa|down_6|NC_013889.1_2586577_2586724_+	NA	NA|284aa|down_7|NC_013889.1_2586745_2587597_+	NA	NA|357aa|down_8|NC_013889.1_2587596_2588667_+	PRK05368, PRK05368, homoserine O-succinyltransferase; Provisional	NA|447aa|down_9|NC_013889.1_2588823_2590164_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters
