assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000147695.2_ASM14769v3	NC_015958	Thermoanaerobacter wiegelii Rt8.B1, complete sequence	1	606921-607029	1	CRISPRCasFinder	no		RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	Orphan	TAAAAGAAGCGGGTTTCCCACTTCTTTTAG	30	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	NA|53aa|up_8|NC_015958.1_589084_589243_+,NA|62aa|up_2|NC_015958.1_603833_604019_+,NA|214aa|down_5|NC_015958.1_613180_613822_+,NA|167aa|down_9|NC_015958.1_616482_616983_+	NA|267aa|up_9|NC_015958.1_587448_588249_+	PRK12804, PRK12804, flagellin; Provisional	NA|53aa|up_8|NC_015958.1_589084_589243_+	NA	NA|495aa|up_7|NC_015958.1_589302_590787_-	pfam13751, DDE_Tnp_1_6, Transposase DDE domain	NA|267aa|up_6|NC_015958.1_592288_593089_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|263aa|up_5|NC_015958.1_596439_597228_+	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|1520aa|up_4|NC_015958.1_597245_601805_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|614aa|up_3|NC_015958.1_601852_603694_-	TIGR01536, Asparagine_synthetase_1, asparagine synthase (glutamine-hydrolyzing)	NA|62aa|up_2|NC_015958.1_603833_604019_+	NA	NA|114aa|up_1|NC_015958.1_604161_604503_+	pfam03646, FlaG, FlaG protein	NA|510aa|up_0|NC_015958.1_604518_606048_+	PRK07737, fliD, flagellar hook-associated protein 2	NA|66aa|down_0|NC_015958.1_609364_609562_+	pfam13451, zf-trcl, Probable zinc-ribbon domain	NA|262aa|down_1|NC_015958.1_609901_610687_+	cd13624, PBP2_Arg_Lys_His, Substrate binding domain of the arginine-, lysine-, histidine-binding protein ArtJ; the type 2 periplasmic binding protein fold	NA|221aa|down_2|NC_015958.1_610752_611415_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|241aa|down_3|NC_015958.1_611401_612124_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|182aa|down_4|NC_015958.1_612505_613051_+	pfam16321, Ribosom_S30AE_C, Sigma 54 modulation/S30EA ribosomal protein C-terminus	NA|214aa|down_5|NC_015958.1_613180_613822_+	NA	NA|425aa|down_6|NC_015958.1_614043_615318_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|192aa|down_7|NC_015958.1_615307_615883_+	pfam17248, DUF5317, Family of unknown function (DUF5317)	NA|171aa|down_8|NC_015958.1_615857_616370_+	COG1827, COG1827, Predicted small molecule binding protein (contains 3H domain) [General function prediction only]	NA|167aa|down_9|NC_015958.1_616482_616983_+	NA
GCF_000147695.2_ASM14769v3	NC_015958	Thermoanaerobacter wiegelii Rt8.B1, complete sequence	2	1497859-1497954	2	CRISPRCasFinder	no	RT	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	Unclear	ACTATTTTCAGGATAGGTAGGCTAAAAAC	29	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	NA,NA	NA|308aa|up_9|NC_015958.1_1483603_1484527_-	TIGR02127, Orotidine_5'-phosphate_decarboxylase, orotidine 5'-phosphate decarboxylase, subfamily 2	NA|432aa|up_8|NC_015958.1_1484586_1485882_-	PRK09357, pyrC, dihydroorotase; Validated	NA|305aa|up_7|NC_015958.1_1485883_1486798_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|179aa|up_6|NC_015958.1_1487003_1487540_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|332aa|up_5|NC_015958.1_1487760_1488756_+	cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup 1 of unknown function	NA|306aa|up_4|NC_015958.1_1489118_1490036_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|145aa|up_3|NC_015958.1_1490040_1490475_-	PRK14791, PRK14791, lipoprotein signal peptidase; Provisional	NA|193aa|up_2|NC_015958.1_1490669_1491248_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|827aa|up_1|NC_015958.1_1491547_1494028_-	pfam09823, DUF2357, Domain of unknown function (DUF2357)	NA|584aa|up_0|NC_015958.1_1494002_1495754_-	pfam12102, DUF3578, Domain of unknown function (DUF3578)	NA|131aa|down_0|NC_015958.1_1498007_1498400_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|392aa|down_1|NC_015958.1_1500037_1501213_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|407aa|down_2|NC_015958.1_1501715_1502936_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|123aa|down_3|NC_015958.1_1505962_1506331_-	pfam01741, MscL, Large-conductance mechanosensitive channel, MscL	NA|252aa|down_4|NC_015958.1_1506468_1507224_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|207aa|down_5|NC_015958.1_1507223_1507844_-	cd02137, MhqN-like, nitroreductase family protein similar to the NAD(P)H nitroreductase MhqN	NA|155aa|down_6|NC_015958.1_1508104_1508569_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|320aa|down_7|NC_015958.1_1508569_1509529_+	pfam13800, Sigma_reg_N, Sigma factor regulator N-terminal	NA|424aa|down_8|NC_015958.1_1509572_1510844_-	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|62aa|down_9|NC_015958.1_1511223_1511409_-	pfam04024, PspC, PspC domain
GCF_000147695.2_ASM14769v3	NC_015958	Thermoanaerobacter wiegelii Rt8.B1, complete sequence	3	1503022-1505765	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	RT	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	Unclear	CTTTCAATTCCTTATAGGTAGGCTAAAAAC,CTTTCAATTCCTTATAGGTAGGCTAAAAAC,CTTTCAATTCCTTATAGGTAGGCTAAAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	41,41,40	41	Orphan	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	NA,NA|104aa|down_9|NC_015958.1_1512665_1512977_-	NA|179aa|up_9|NC_015958.1_1487003_1487540_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|332aa|up_8|NC_015958.1_1487760_1488756_+	cd06061, PurM-like1, AIR synthase (PurM) related protein, subgroup 1 of unknown function	NA|306aa|up_7|NC_015958.1_1489118_1490036_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|145aa|up_6|NC_015958.1_1490040_1490475_-	PRK14791, PRK14791, lipoprotein signal peptidase; Provisional	NA|193aa|up_5|NC_015958.1_1490669_1491248_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|827aa|up_4|NC_015958.1_1491547_1494028_-	pfam09823, DUF2357, Domain of unknown function (DUF2357)	NA|584aa|up_3|NC_015958.1_1494002_1495754_-	pfam12102, DUF3578, Domain of unknown function (DUF3578)	NA|131aa|up_2|NC_015958.1_1498007_1498400_-	PRK09183, PRK09183, transposase/IS protein; Provisional	NA|392aa|up_1|NC_015958.1_1500037_1501213_-	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|407aa|up_0|NC_015958.1_1501715_1502936_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|123aa|down_0|NC_015958.1_1505962_1506331_-	pfam01741, MscL, Large-conductance mechanosensitive channel, MscL	NA|252aa|down_1|NC_015958.1_1506468_1507224_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|207aa|down_2|NC_015958.1_1507223_1507844_-	cd02137, MhqN-like, nitroreductase family protein similar to the NAD(P)H nitroreductase MhqN	NA|155aa|down_3|NC_015958.1_1508104_1508569_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|320aa|down_4|NC_015958.1_1508569_1509529_+	pfam13800, Sigma_reg_N, Sigma factor regulator N-terminal	NA|424aa|down_5|NC_015958.1_1509572_1510844_-	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|62aa|down_6|NC_015958.1_1511223_1511409_-	pfam04024, PspC, PspC domain	NA|139aa|down_7|NC_015958.1_1511589_1512006_+	COG0432, COG0432, Uncharacterized conserved protein [Function unknown]	NA|207aa|down_8|NC_015958.1_1512029_1512650_-	TIGR02890, conserved_hypothetical_protein, regulatory protein, yteA family	NA|104aa|down_9|NC_015958.1_1512665_1512977_-	NA
GCF_000147695.2_ASM14769v3	NC_015958	Thermoanaerobacter wiegelii Rt8.B1, complete sequence	4	2673398-2679131	4,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas3,cas5,cas7b,cas6	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	Unclear	GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC,GTTTCAATTCCTTATAGGTAGGCTAAAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	86,86,85	86	Unclear	RT,csa3,DEDDh,cas3,cas4,cas2,cas1,cas5,cas7b,cas6,cas14k	NA|194aa|up_5|NC_015958.1_2667451_2668033_-,NA|63aa|down_3|NC_015958.1_2685421_2685610_+	NA|473aa|up_9|NC_015958.1_2659583_2661002_+	TIGR00909, putative_amino_acid_transporter, amino acid transporter	NA|260aa|up_8|NC_015958.1_2661038_2661818_-	cd07733, YycJ-like_MBL-fold, uncharacterized subgroup which includes Bacillus subtilis YycJ and related proteins; MBL-fold metallo hydrolase domain	NA|642aa|up_7|NC_015958.1_2662078_2664004_-	TIGR03997, NADH:flavin_oxidoreductase, mycofactocin system FadH/OYE family oxidoreductase 2	NA|438aa|up_6|NC_015958.1_2666001_2667315_-	pfam00665, rve, Integrase core domain	NA|194aa|up_5|NC_015958.1_2667451_2668033_-	NA	cas2|88aa|up_4|NC_015958.1_2668933_2669197_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|331aa|up_3|NC_015958.1_2669213_2670206_-	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas4|166aa|up_2|NC_015958.1_2670202_2670700_-	pfam01930, Cas_Cas4, Domain of unknown function DUF83	NA|243aa|up_1|NC_015958.1_2671028_2671757_-	cd03146, GAT1_Peptidase_E, Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E	NA|407aa|up_0|NC_015958.1_2672091_2673312_+	pfam01548, DEDD_Tnp_IS110, Transposase	cas3|779aa|down_0|NC_015958.1_2679311_2681648_-	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|237aa|down_1|NC_015958.1_2681662_2682373_-	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas7b|294aa|down_2|NC_015958.1_2682387_2683269_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	NA|63aa|down_3|NC_015958.1_2685421_2685610_+	NA	NA|292aa|down_4|NC_015958.1_2685822_2686698_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	cas6|252aa|down_5|NC_015958.1_2686946_2687702_-	TIGR01877, CRISPR-associated_endoribonuclease_Cas6_1, CRISPR-associated endoribonuclease Cas6	NA|424aa|down_6|NC_015958.1_2687954_2689226_+	pfam07833, Cu_amine_oxidN1, Copper amine oxidase N-terminal domain	NA|138aa|down_7|NC_015958.1_2689253_2689667_-	TIGR01994, Iron-sulfur_cluster_assembly_scaffold_protein_IscU, SUF system FeS assembly protein, NifU family	NA|410aa|down_8|NC_015958.1_2689663_2690893_-	TIGR01979, Probable_cysteine_desulfurase, cysteine desulfurases, SufSfamily	NA|350aa|down_9|NC_015958.1_2690889_2691939_-	TIGR01981, UPF0051_protein_Rv1462/MT1509, FeS assembly protein SufD
