assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003555505.1_ASM355550v1	CP032152	Thermosynechococcus elongatus PKUAC-SCTE542 chromosome, complete genome	1	4409-5157	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	Orphan	GTGCTTCTACCTCTGATGCCGCAAGGCGTTGAGCAC,GTGCTTCTACCTCTGATGCCGCAAGGCGTTGAGCAC,GTGCTTCTACCTCTGATGCCGCAAGGCGTTGAGCAC	36,36,36	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	10,10,10	10	Orphan	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	NA,NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|416aa|up_0|CP032152.1_2794_4042_-	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|258aa|down_0|CP032152.1_5334_6108_+	PRK00748, PRK00748, 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated	NA|605aa|down_1|CP032152.1_6146_7961_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|94aa|down_2|CP032152.1_11880_12162_+	PRK00033, clpS, ATP-dependent Clp protease adaptor protein ClpS; Reviewed	NA|153aa|down_3|CP032152.1_14284_14743_-	pfam12049, DUF3531, Protein of unknown function (DUF3531)	NA|187aa|down_4|CP032152.1_14739_15300_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|216aa|down_5|CP032152.1_15394_16042_-	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|214aa|down_6|CP032152.1_16130_16772_-	COG0009, SUA5, Putative translation factor (SUA5) [Translation, ribosomal structure and biogenesis]	NA|328aa|down_7|CP032152.1_16955_17939_+	CHL00144, odpB, pyruvate dehydrogenase E1 component beta subunit; Validated	NA|108aa|down_8|CP032152.1_22621_22945_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|32aa|down_9|CP032152.1_23089_23185_-	pfam07465, PsaM, Photosystem I protein M (PsaM)
GCA_003555505.1_ASM355550v1	CP032152	Thermosynechococcus elongatus PKUAC-SCTE542 chromosome, complete genome	2	502808-503919	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no		PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	Orphan	GTTTCCATTTATTCGGCTGGGAAAGGTTCCCAGCAC,GTTTCCATTTATTCGGCTGGGAAAGGTTCCCAGCAC,GTTTCCATTTATTCGGCTGGGAAAGGTTCCCAGCAC	36,36,36	0	0	NA	NA	NA:NA:NA	15,15,15	15	Orphan	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	NA|129aa|up_1|CP032152.1_498824_499211_+,NA|121aa|down_1|CP032152.1_504670_505033_-,NA|107aa|down_2|CP032152.1_506948_507269_+	NA|252aa|up_9|CP032152.1_490427_491183_+	pfam13483, Lactamase_B_3, Beta-lactamase superfamily domain	NA|164aa|up_8|CP032152.1_491185_491677_+	COG3431, COG3431, Predicted membrane protein [Function unknown]	NA|288aa|up_7|CP032152.1_491653_492517_-	PRK07417, PRK07417, prephenate/arogenate dehydrogenase	NA|121aa|up_6|CP032152.1_495062_495425_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|287aa|up_5|CP032152.1_495548_496409_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|263aa|up_4|CP032152.1_496410_497199_-	TIGR02454, Uncharacterized_protein_MJ1089, cobalt ECF transporter T component CbiQ	NA|100aa|up_3|CP032152.1_497206_497506_-	PRK02898, PRK02898, energy-coupling factor ABC transporter substrate-binding protein	NA|102aa|up_2|CP032152.1_498511_498817_+	pfam04472, SepF, Cell division protein SepF	NA|129aa|up_1|CP032152.1_498824_499211_+	NA	NA|218aa|up_0|CP032152.1_500832_501486_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|74aa|down_0|CP032152.1_504406_504628_+	pfam14217, DUF4327, Domain of unknown function (DUF4327)	NA|121aa|down_1|CP032152.1_504670_505033_-	NA	NA|107aa|down_2|CP032152.1_506948_507269_+	NA	NA|164aa|down_3|CP032152.1_507473_507965_+	COG2268, COG2268, Uncharacterized protein conserved in bacteria [Function unknown]	NA|316aa|down_4|CP032152.1_509537_510485_+	PRK07405, PRK07405, RNA polymerase sigma factor SigD; Validated	NA|201aa|down_5|CP032152.1_510488_511091_+	PRK05986, PRK05986, cob(I)yrinic acid a,c-diamide adenosyltransferase	NA|151aa|down_6|CP032152.1_511931_512384_+	cd04413, NDPk_I, Nucleoside diphosphate kinase Group I (NDPk_I)-like: NDP kinase domains are present in a large family of structurally and functionally conserved proteins from bacteria to humans that generally catalyze the transfer of gamma-phosphates of a nucleoside triphosphate (NTP) donor onto a nucleoside diphosphate (NDP) acceptor through a phosphohistidine intermediate	NA|247aa|down_7|CP032152.1_512358_513099_-	TIGR03716, R_switched_YkoY, integral membrane protein, YkoY family	NA|304aa|down_8|CP032152.1_513770_514682_-	sd00006, TPR, Tetratricopeptide repeat	NA|350aa|down_9|CP032152.1_520077_521127_+	cd01005, PBP2_CysP, Substrate binding domain of an active sulfate transporter, a member of the type 2 periplasmic binding fold superfamily
GCA_003555505.1_ASM355550v1	CP032152	Thermosynechococcus elongatus PKUAC-SCTE542 chromosome, complete genome	4	1393703-1395039	3,4,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no		PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	Orphan	GTGCTGAGGCTTGTCCTCAGCCGATTAAATGGAAAC,GTGCTGAGGCTTGTCCTCAGCCGATTAAATGGAAAC,GTGCTGAGGCTTGTCCTCAGCCGATTAAATGGAAAC,GTGCTGAGGCTTGTCCTCAGCCGATTAAATGGAAAC	36,36,36,36	0	0	NA	NA	NA:NA:NA:NA	16,18,18,16	18	Orphan	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	NA,NA|107aa|down_0|CP032152.1_1398307_1398628_-,NA|285aa|down_1|CP032152.1_1398727_1399582_-,NA|60aa|down_4|CP032152.1_1402560_1402740_+,NA|62aa|down_6|CP032152.1_1406296_1406482_+	NA|295aa|up_9|CP032152.1_1373662_1374547_-	pfam12183, NotI, Restriction endonuclease NotI	NA|305aa|up_8|CP032152.1_1374592_1375507_-	COG2177, FtsX, Cell division protein [Cell division and chromosome partitioning]	NA|183aa|up_7|CP032152.1_1378378_1378927_-	pfam00908, dTDP_sugar_isom, dTDP-4-dehydrorhamnose 3,5-epimerase	NA|146aa|up_6|CP032152.1_1380994_1381432_+	cd01285, nucleoside_deaminase, Nucleoside deaminases include adenosine, guanine and cytosine deaminases	NA|288aa|up_5|CP032152.1_1383723_1384587_+	pfam01027, Bax1-I, Inhibitor of apoptosis-promoting Bax1	NA|113aa|up_4|CP032152.1_1385688_1386027_-	TIGR00049, Uncharacterized_protein_in_nifU_5'region, Iron-sulfur cluster assembly accessory protein	NA|159aa|up_3|CP032152.1_1386982_1387459_-	pfam06228, ChuX_HutX, Haem utilisation ChuX/HutX	NA|181aa|up_2|CP032152.1_1388778_1389321_-	COG0456, RimI, Acetyltransferases [General function prediction only]	NA|158aa|up_1|CP032152.1_1389765_1390239_+	cd03017, PRX_BCP, Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of  thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides	NA|241aa|up_0|CP032152.1_1390341_1391064_+	pfam06051, DUF928, Domain of Unknown Function (DUF928)	NA|107aa|down_0|CP032152.1_1398307_1398628_-	NA	NA|285aa|down_1|CP032152.1_1398727_1399582_-	NA	NA|257aa|down_2|CP032152.1_1399574_1400345_-	COG0411, LivG, ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism]	NA|251aa|down_3|CP032152.1_1401395_1402148_-	COG0565, LasT, rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|60aa|down_4|CP032152.1_1402560_1402740_+	NA	NA|370aa|down_5|CP032152.1_1403904_1405014_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|62aa|down_6|CP032152.1_1406296_1406482_+	NA	NA|229aa|down_7|CP032152.1_1406375_1407062_-	pfam01300, Sua5_yciO_yrdC, Telomere recombination	NA|610aa|down_8|CP032152.1_1408890_1410720_+	PRK07390, PRK07390, NAD(P)H-quinone oxidoreductase subunit F; Validated	NA|426aa|down_9|CP032152.1_1413407_1414685_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed
GCA_003555505.1_ASM355550v1	CP032152	Thermosynechococcus elongatus PKUAC-SCTE542 chromosome, complete genome	5	1874616-1876304	5,4,5	CRISPRCasFinder,CRT,PILER-CR	no	c2c9_V-U4	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	Type V-U4	GTTTCCATTTAATCGGCTGAGGACAAGCCTCAGCAC,GTTTCCATTTAATCGGCTGAGGACAAGCCTCAGCAC,GTGCTGAGGCTTGTCCTCAGCCGATTAAATGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	23,23,23	23	TypeV-U4	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	NA|131aa|up_4|CP032152.1_1863337_1863730_+,NA	NA|542aa|up_9|CP032152.1_1856523_1858149_-	PRK00095, mutL, DNA mismatch repair endonuclease MutL	NA|389aa|up_8|CP032152.1_1858145_1859312_-	PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated	NA|82aa|up_7|CP032152.1_1859348_1859594_-	pfam10742, DUF2555, Protein of unknown function (DUF2555)	NA|385aa|up_6|CP032152.1_1859851_1861006_+	PRK09776, PRK09776, putative diguanylate cyclase; Provisional	NA|758aa|up_5|CP032152.1_1860981_1863255_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|131aa|up_4|CP032152.1_1863337_1863730_+	NA	NA|159aa|up_3|CP032152.1_1863726_1864203_-	cd01043, DPS, DPS protein, ferritin-like diiron-binding domain	NA|319aa|up_2|CP032152.1_1864406_1865363_+	pfam00498, FHA, FHA domain	NA|463aa|up_1|CP032152.1_1867827_1869216_+	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|683aa|up_0|CP032152.1_1869212_1871261_-	COG0475, KefB, Kef-type K+ transport systems, membrane components [Inorganic ion transport and metabolism]	NA|241aa|down_0|CP032152.1_1876584_1877307_-	pfam02683, DsbD, Cytochrome C biogenesis protein transmembrane region	NA|124aa|down_1|CP032152.1_1880056_1880428_-	COG3883, COG3883, Uncharacterized protein conserved in bacteria [Function unknown]	NA|192aa|down_2|CP032152.1_1883976_1884552_+	pfam01475, FUR, Ferric uptake regulator family	NA|102aa|down_3|CP032152.1_1884536_1884842_+	pfam10989, DUF2808, Protein of unknown function (DUF2808)	NA|300aa|down_4|CP032152.1_1884843_1885743_+	COG1619, LdcA, Uncharacterized proteins, homologs of microcin C7 resistance protein MccF [Defense mechanisms]	c2c9_V-U4|427aa|down_5|CP032152.1_1885945_1887226_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|131aa|down_6|CP032152.1_1887492_1887885_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|516aa|down_7|CP032152.1_1887980_1889528_+	PRK02504, PRK02504, NAD(P)H-quinone oxidoreductase subunit N	NA|112aa|down_8|CP032152.1_1889580_1889916_+	cd01528, RHOD_2, Member of the Rhodanese Homology Domain superfamily, subgroup 2	NA|341aa|down_9|CP032152.1_1889939_1890962_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]
GCA_003555505.1_ASM355550v1	CP032152	Thermosynechococcus elongatus PKUAC-SCTE542 chromosome, complete genome	6	2003823-2004841	6,5,6	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas2,cas6,cas8b3,cas7	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	Unclear	GTGCCTCTACCTCTGATGCCGTAAGGCGTTGAGCAC,GTGCCTCTACCTCTGATGCCGTAAGGCGTTGAGCAC,GTGCCTCTACCTCTGATGCCGTAAGGCGTTGAGCAC	36,36,36	0	0	NA	NA	I-A,I-B,II-B:I-A,I-B,II-B:I-A,I-B,II-B	14,14,13	14	Unclear	PD-DExK,DinG,csa3,cmr5gr11,cmr3gr5,cas10,c2c9_V-U4,csx1,cas1,cas2,cas6,cas8b3,cas7	NA|71aa|up_7|CP032152.1_1991328_1991541_+,NA|66aa|up_6|CP032152.1_1991541_1991739_+,NA	NA|158aa|up_9|CP032152.1_1986156_1986630_-	pfam00226, DnaJ, DnaJ domain	NA|365aa|up_8|CP032152.1_1987893_1988988_-	COG2805, PilT, Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|71aa|up_7|CP032152.1_1991328_1991541_+	NA	NA|66aa|up_6|CP032152.1_1991541_1991739_+	NA	NA|329aa|up_5|CP032152.1_1993271_1994258_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|331aa|up_4|CP032152.1_1994313_1995306_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|351aa|up_3|CP032152.1_1995897_1996950_-	COG1816, Add, Adenosine deaminase [Nucleotide transport and metabolism]	NA|395aa|up_2|CP032152.1_2000196_2001381_+	PRK11705, PRK11705, cyclopropane fatty acyl phospholipid synthase	cas1|553aa|up_1|CP032152.1_2001524_2003183_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|up_0|CP032152.1_2003186_2003480_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas6|219aa|down_0|CP032152.1_2005052_2005709_+	pfam09559, Cas6, Cas6 Crispr	cas8b3|509aa|down_1|CP032152.1_2007983_2009510_+	TIGR03485, hypothetical_protein_L8106_30105, CRISPR-associated protein Cas8a1/Csx13, MYXAN subtype	cas7|287aa|down_2|CP032152.1_2009538_2010399_+	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	NA|244aa|down_3|CP032152.1_2016220_2016952_-	cd19165, HemeO, heme oxygenase in eukaryotes and some bacteria	NA|355aa|down_4|CP032152.1_2017223_2018288_+	PRK13654, PRK13654, magnesium-protoporphyrin IX monomethyl ester cyclase; Provisional	NA|101aa|down_5|CP032152.1_2018290_2018593_-	pfam11378, DUF3181, Protein of unknown function (DUF3181)	NA|799aa|down_6|CP032152.1_2018920_2021317_+	PRK09866, PRK09866, clamp-binding protein CrfC	NA|311aa|down_7|CP032152.1_2021276_2022209_+	pfam13990, YjcZ, YjcZ-like protein	NA|355aa|down_8|CP032152.1_2022208_2023273_+	cd07209, Pat_hypo_Ecoli_Z1214_like, Hypothetical patatin similar to Z1214 protein of Escherichia coli	NA|140aa|down_9|CP032152.1_2023332_2023752_-	pfam02531, PsaD, PsaD
