assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000217795.1_ASM21779v1	NC_015681	Thermodesulfatator indicus DSM 15286, complete sequence	1	238340-242248	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	Type III-C,Type I-B,Type III-A,Type III-D,Type III-B	GTTCACAGCCTAACTAAAAGGAATGGAAAC,GTTCACAGCCTAACTAAAAGGAATGGAAAC,GTTCACAGCCTAACTAAAAGGAATGGAAAC	30,30,30	0	0	NA	NA	NA:NA:NA	57,57,58	58	TypeIII-C,TypeI-B,TypeIII-A,TypeIII-D,TypeIII-B	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	cas8b1|484aa|up_7|NC_015681.1_230423_231875_+,NA|102aa|down_3|NC_015681.1_246727_247033_-,NA|201aa|down_6|NC_015681.1_249007_249610_-,NA|125aa|down_9|NC_015681.1_252598_252973_-	cmr3gr5|318aa|up_9|NC_015681.1_227965_228919_+	pfam03787, RAMPs, RAMP superfamily	NA|183aa|up_8|NC_015681.1_229441_229990_+	COG4871, COG4871, Uncharacterized protein conserved in archaea [Function unknown]	cas8b1|484aa|up_7|NC_015681.1_230423_231875_+	NA	cas7|317aa|up_6|NC_015681.1_231876_232827_+	TIGR02585, conserved_protein, CRISPR-associated protein Cas7/Cst2/DevR, subtype I-B/TNEAP	cas5|242aa|up_5|NC_015681.1_232814_233540_+	cd09693, Cas5_I, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|727aa|up_4|NC_015681.1_233524_235705_+	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	NA|78aa|up_3|NC_015681.1_235801_236035_+	smart00966, SpoVT_AbrB, SpoVT / AbrB like domain	cas4|166aa|up_2|NC_015681.1_236034_236532_+	pfam01930, Cas_Cas4, Domain of unknown function DUF83	cas1|331aa|up_1|NC_015681.1_236947_237940_+	TIGR03641, cas1_HMARI, CRISPR-associated endonuclease Cas1, subtype I-B/HMARI/TNEAP	cas2|89aa|up_0|NC_015681.1_237939_238206_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|418aa|down_0|NC_015681.1_242647_243901_-	pfam01548, DEDD_Tnp_IS110, Transposase	NA|196aa|down_1|NC_015681.1_244928_245516_+	COG2191, COG2191, Formylmethanofuran dehydrogenase subunit E [Energy production and conversion]	NA|251aa|down_2|NC_015681.1_245599_246352_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|102aa|down_3|NC_015681.1_246727_247033_-	NA	NA|282aa|down_4|NC_015681.1_247120_247966_-	pfam13435, Cytochrome_C554, Cytochrome c554 and c-prime	NA|274aa|down_5|NC_015681.1_248183_249005_-	cd06225, HAMP, Histidine kinase, Adenylyl cyclase, Methyl-accepting protein, and Phosphatase (HAMP) domain	NA|201aa|down_6|NC_015681.1_249007_249610_-	NA	NA|263aa|down_7|NC_015681.1_249606_250395_-	pfam14332, DUF4388, Domain of unknown function (DUF4388)	NA|716aa|down_8|NC_015681.1_250451_252599_-	TIGR02063, Ribonuclease_R, ribonuclease R	NA|125aa|down_9|NC_015681.1_252598_252973_-	NA
GCF_000217795.1_ASM21779v1	NC_015681	Thermodesulfatator indicus DSM 15286, complete sequence	2	479511-479610	2	CRISPRCasFinder	no		cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	Orphan	TGAGACTGCTTCGCTGCGCTCGCAGTGACAGG	32	0	0	NA	NA	NA	1	1	Orphan	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	NA|92aa|up_7|NC_015681.1_468753_469029_+,NA|112aa|down_0|NC_015681.1_479697_480033_-	NA|154aa|up_9|NC_015681.1_465810_466272_-	PRK00326, PRK00326, transcriptional regulator MraZ	NA|514aa|up_8|NC_015681.1_466869_468411_+	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|92aa|up_7|NC_015681.1_468753_469029_+	NA	NA|333aa|up_6|NC_015681.1_469022_470021_+	TIGR03589, PseB, UDP-N-acetylglucosamine 4,6-dehydratase (inverting)	NA|386aa|up_5|NC_015681.1_470016_471174_+	TIGR03588, PseC, UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine transaminase	NA|279aa|up_4|NC_015681.1_471170_472007_+	cd02518, GT2_SpsF, SpsF is a glycosyltrnasferase implicated in the synthesis of the spore coat	NA|490aa|up_3|NC_015681.1_471994_473464_+	TIGR03590, PseG, UDP-2,4-diacetamido-2,4,6-trideoxy-beta-L-altropyranose hydrolase	NA|351aa|up_2|NC_015681.1_473450_474503_+	TIGR03586, PseI, pseudaminic acid synthase	NA|665aa|up_1|NC_015681.1_474579_476574_-	pfam01973, MAF_flag10, Protein of unknown function DUF115	NA|840aa|up_0|NC_015681.1_476885_479405_-	PRK13588, PRK13588, flagellin B; Provisional	NA|112aa|down_0|NC_015681.1_479697_480033_-	NA	NA|145aa|down_1|NC_015681.1_480299_480734_-	pfam02561, FliS, Flagellar protein FliS	NA|560aa|down_2|NC_015681.1_481102_482782_-	COG1345, FliD, Flagellar capping protein [Cell motility and secretion]	NA|162aa|down_3|NC_015681.1_482827_483313_-	PRK13285, PRK13285, flagellar assembly protein FliW; Provisional	NA|77aa|down_4|NC_015681.1_483500_483731_-	PRK01712, PRK01712, carbon storage regulator CsrA	NA|317aa|down_5|NC_015681.1_483742_484693_-	TIGR02550, Flagellar_hook-associated_protein_3, flagellar hook-associated protein 3	NA|1329aa|down_6|NC_015681.1_484708_488695_-	COG1256, FlgK, Flagellar hook-associated protein [Cell motility and secretion]	NA|111aa|down_7|NC_015681.1_488696_489029_-	pfam05130, FlgN, FlgN protein	NA|103aa|down_8|NC_015681.1_489193_489502_-	COG3951, COG3951, Rod binding protein [Cell envelope biogenesis, outer membrane / Cell motility and secretion / Posttranslational modification, protein turnover, chaperones]	NA|359aa|down_9|NC_015681.1_489508_490585_-	pfam02119, FlgI, Flagellar P-ring protein
GCF_000217795.1_ASM21779v1	NC_015681	Thermodesulfatator indicus DSM 15286, complete sequence	3	609953-610103	3	CRISPRCasFinder	no		cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	Orphan	CTCGCAGTGACAAATGTGATCAGGG	25	1	1	609978-610028	NC_015681.1_596456-596406	NA	2	2	Orphan	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	NA|416aa|up_6|NC_015681.1_596574_597822_-,NA|68aa|down_1|NC_015681.1_611430_611634_-	NA|126aa|up_9|NC_015681.1_593426_593804_-	COG0239, CrcB, Integral membrane protein possibly involved in chromosome condensation [Cell division and chromosome partitioning]	NA|495aa|up_8|NC_015681.1_593800_595285_-	pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein mannosyltransferase	NA|199aa|up_7|NC_015681.1_595281_595878_-	cd03395, PAP2_like_4, PAP2_like_4 proteins	NA|416aa|up_6|NC_015681.1_596574_597822_-	NA	NA|290aa|up_5|NC_015681.1_598583_599453_-	cd07025, Peptidase_S66, LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein	NA|256aa|up_4|NC_015681.1_599445_600213_-	PRK13321, PRK13321, type III pantothenate kinase	NA|269aa|up_3|NC_015681.1_600214_601021_-	PRK00517, prmA, 50S ribosomal protein L11 methyltransferase	NA|593aa|up_2|NC_015681.1_601001_602780_-	cd07345, M48A_Ste24p-like, Peptidase M48 subfamily A-like, putative CaaX prenyl protease	NA|737aa|up_1|NC_015681.1_603078_605289_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|1243aa|up_0|NC_015681.1_606184_609913_+	pfam02514, CobN-Mg_chel, CobN/Magnesium Chelatase	NA|370aa|down_0|NC_015681.1_610279_611389_-	TIGR00326, eubact_ribD, riboflavin biosynthesis protein RibD	NA|68aa|down_1|NC_015681.1_611430_611634_-	NA	NA|728aa|down_2|NC_015681.1_611887_614071_+	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|66aa|down_3|NC_015681.1_614067_614265_-	PRK00359, rpmB, 50S ribosomal protein L28; Reviewed	NA|459aa|down_4|NC_015681.1_614348_615725_-	TIGR00665, DnaB, replicative DNA helicase	NA|82aa|down_5|NC_015681.1_615999_616245_-	TIGR01622, transcription_coactivator_CAPER, splicing factor, CC1-like family	NA|158aa|down_6|NC_015681.1_616348_616822_+	pfam02542, YgbB, YgbB family	NA|604aa|down_7|NC_015681.1_616814_618626_+	PRK00558, uvrC, excinuclease ABC subunit UvrC	NA|283aa|down_8|NC_015681.1_618689_619538_+	pfam01797, Y1_Tnp, Transposase IS200 like	NA|214aa|down_9|NC_015681.1_619605_620247_+	COG2191, COG2191, Formylmethanofuran dehydrogenase subunit E [Energy production and conversion]
GCF_000217795.1_ASM21779v1	NC_015681	Thermodesulfatator indicus DSM 15286, complete sequence	4	918034-919769	2,4,2,3	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas3	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	Unclear	GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC,GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC,GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC,GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGACAT	37,37,37,39	0	0	NA	NA	NA:NA:NA:NA	18,22,22,18	22	Unclear	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	NA|315aa|up_9|NC_015681.1_908069_909014_+,NA	NA|315aa|up_9|NC_015681.1_908069_909014_+	NA	NA|123aa|up_8|NC_015681.1_909003_909372_+	pfam04126, Cyclophil_like, Cyclophilin-like	NA|76aa|up_7|NC_015681.1_909385_909613_+	pfam09723, Zn-ribbon_8, Zinc ribbon domain	NA|297aa|up_6|NC_015681.1_909616_910507_+	PRK05687, fliH, flagellar assembly protein FliH	NA|310aa|up_5|NC_015681.1_910560_911490_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|503aa|up_4|NC_015681.1_911489_912998_+	PRK06136, PRK06136, uroporphyrinogen-III C-methyltransferase	NA|179aa|up_3|NC_015681.1_913177_913714_-	pfam03773, ArsP_1, Predicted permease	NA|161aa|up_2|NC_015681.1_913710_914193_-	pfam03773, ArsP_1, Predicted permease	NA|375aa|up_1|NC_015681.1_915040_916165_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|410aa|up_0|NC_015681.1_916174_917404_+	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|206aa|down_0|NC_015681.1_919956_920574_-	COG1611, COG1611, Predicted Rossmann fold nucleotide-binding protein [General function prediction only]	NA|260aa|down_1|NC_015681.1_920645_921425_-	PRK00085, recO, DNA repair protein RecO; Reviewed	NA|172aa|down_2|NC_015681.1_921428_921944_-	COG1426, COG1426, Predicted transcriptional regulator contains Xre-like HTH domain [Function unknown]	NA|306aa|down_3|NC_015681.1_921943_922861_-	PRK00059, prsA, peptidylprolyl isomerase; Provisional	cas3|1168aa|down_4|NC_015681.1_922857_926361_-	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|501aa|down_5|NC_015681.1_926692_928195_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|92aa|down_6|NC_015681.1_928220_928496_-	PRK05435, rpmA, 50S ribosomal protein L27; Validated	NA|104aa|down_7|NC_015681.1_928507_928819_-	PRK05573, rplU, 50S ribosomal protein L21; Validated	NA|382aa|down_8|NC_015681.1_929068_930214_+	TIGR04311, Radical_SAM_domain_protein, putative metalloenzyme radical SAM/SPASM domain maturase	NA|466aa|down_9|NC_015681.1_930235_931633_-	TIGR01479, Mannose-1-phosphate_guanylyltransferase, mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase
GCF_000217795.1_ASM21779v1	NC_015681	Thermodesulfatator indicus DSM 15286, complete sequence	5	1410352-1414024	4,5,3,5	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,cmr1gr7,cas2,cas1,cas6	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	Type III-C,Type III-A,Type III-B,Type III-D	GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC,GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC,GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC,GTGAGAAAACCTTGCCTGATTAAGAAGGCATTACGAC	37,37,37,37	0	0	NA	NA	NA:NA:NA:NA	45,47,47,45	47	TypeIII-C,TypeIII-A,TypeIII-B,TypeIII-D	cas6,cmr1gr7,cmr6gr7,cas10,cmr4gr7,cmr5gr11,cmr3gr5,cas8b1,cas7,cas5,cas3,cas4,cas1,cas2,Cas9_archaeal,csx1,csa3,PD-DExK	NA,NA	NA|406aa|up_9|NC_015681.1_1399289_1400507_-	pfam13635, DUF4143, Domain of unknown function (DUF4143)	csx1|422aa|up_8|NC_015681.1_1400688_1401954_-	cd09747, Csx1_III-U, CRISPR/Cas system-associated protein Csx1	cmr6gr7|308aa|up_7|NC_015681.1_1401943_1402867_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|142aa|up_6|NC_015681.1_1402859_1403285_-	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr4gr7|296aa|up_5|NC_015681.1_1403281_1404169_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	NA|122aa|up_4|NC_015681.1_1404180_1404546_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|80aa|up_3|NC_015681.1_1404542_1404782_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	cmr3gr5|347aa|up_2|NC_015681.1_1404794_1405835_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3	cas10|937aa|up_1|NC_015681.1_1405821_1408632_-	cd09679, Cas10_III, CRISPR/Cas system-associated protein Cas10	cmr1gr7|366aa|up_0|NC_015681.1_1408618_1409716_-	COG1367, COG1367, CRISPR system related protein, RAMP superfamily [Defense mechanisms]	NA|410aa|down_0|NC_015681.1_1414452_1415682_-	COG1373, COG1373, Predicted ATPase (AAA+ superfamily) [General function prediction only]	cas2|96aa|down_1|NC_015681.1_1415917_1416205_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|572aa|down_2|NC_015681.1_1416201_1417917_+	sd00006, TPR, Tetratricopeptide repeat	cas1|339aa|down_3|NC_015681.1_1417916_1418933_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|97aa|down_4|NC_015681.1_1418937_1419228_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas6|315aa|down_5|NC_015681.1_1419176_1420121_-	pfam10040, CRISPR_Cas6, CRISPR-associated endoribonuclease Cas6	NA|255aa|down_6|NC_015681.1_1420172_1420937_-	cd01990, Alpha_ANH_like_I, This is a subfamily of Adenine nucleotide alpha hydrolases superfamily	NA|395aa|down_7|NC_015681.1_1420936_1422121_-	pfam01837, HcyBio, Homocysteine biosynthesis enzyme, sulfur-incorporation	NA|200aa|down_8|NC_015681.1_1422129_1422729_-	cd04584, CBS_pair_AcuB_like, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the ACT domain	NA|1393aa|down_9|NC_015681.1_1422796_1426975_-	COG1924, COG1924, Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) [Lipid metabolism]
