assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001997385.1_ASM199738v1	NZ_CP019633	Sedimentisphaera cyanobacteriorum strain L21-RPul-D3 chromosome, complete genome	1	170623-171844	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas9	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	Type II-A, Type II-B, or Type II-C?,Type II-C,Type II-B	GTTGTAACTTGCTTTCATTTTCATATCGCTTACAAT,GTTGTAACTTGCTTTCATTTTCATATCGCTTACAAT,GTTGTAACTTGCTTTCATTTTCATATCGCTTACAAT	36,36,36	5	5	170659-170687|171055-171084|171450-171478|171581-171610|171647-171676	NZ_CP019633.1_1251026-1250998|NZ_CP019633.1_2569521-2569550|NZ_CP019633.1_1269615-1269643|NZ_CP019633.1_726614-726585|NZ_CP019633.1_726614-726585	NA:NA:NA	18,18,5	18	TypeII-A,TypeII-B,orTypeII-C?,TypeII-C,TypeII-B	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	NA|178aa|up_6|NZ_CP019633.1_165215_165749_+,NA	NA|131aa|up_9|NZ_CP019633.1_161527_161920_+	pfam08308, PEGA, PEGA domain	NA|403aa|up_8|NZ_CP019633.1_161903_163112_+	COG0826, COG0826, Collagenase and related proteases [Posttranslational modification, protein turnover, chaperones]	NA|588aa|up_7|NZ_CP019633.1_163150_164914_+	PRK13507, PRK13507, formate--tetrahydrofolate ligase; Provisional	NA|178aa|up_6|NZ_CP019633.1_165215_165749_+	NA	NA|307aa|up_5|NZ_CP019633.1_165758_166679_-	TIGR00276, iron-sulfur_cluster-binding_protein, epoxyqueuosine reductase	NA|122aa|up_4|NZ_CP019633.1_166949_167315_+	pfam01627, Hpt, Hpt domain	NA|460aa|up_3|NZ_CP019633.1_167475_168855_+	pfam02673, BacA, Bacitracin resistance protein BacA	NA|139aa|up_2|NZ_CP019633.1_168935_169352_+	COG0432, COG0432, Uncharacterized conserved protein [Function unknown]	cas2|111aa|up_1|NZ_CP019633.1_169356_169689_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|300aa|up_0|NZ_CP019633.1_169696_170596_-	cd09720, Cas1_II, CRISPR/Cas system-associated protein Cas1	cas9|1112aa|down_0|NZ_CP019633.1_171926_175262_-	COG3513, COG3513, Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms]	NA|155aa|down_1|NZ_CP019633.1_175593_176058_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|430aa|down_2|NZ_CP019633.1_176432_177722_+	PRK05342, clpX, ATP-dependent Clp protease ATP-binding subunit ClpX	NA|201aa|down_3|NZ_CP019633.1_177739_178342_+	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|403aa|down_4|NZ_CP019633.1_178370_179579_+	PRK05912, PRK05912, tyrosyl-tRNA synthetase; Validated	NA|423aa|down_5|NZ_CP019633.1_179590_180859_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|258aa|down_6|NZ_CP019633.1_180891_181665_+	pfam06044, DpnI, Dam-replacing family	NA|327aa|down_7|NZ_CP019633.1_181695_182676_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|337aa|down_8|NZ_CP019633.1_182706_183717_-	TIGR04337, Radical_SAM, AmmeMemoRadiSam system radical SAM enzyme	NA|165aa|down_9|NZ_CP019633.1_183723_184218_-	pfam05991, NYN_YacP, YacP-like NYN domain
GCF_001997385.1_ASM199738v1	NZ_CP019633	Sedimentisphaera cyanobacteriorum strain L21-RPul-D3 chromosome, complete genome	2	395173-395271	2	CRISPRCasFinder	no	DEDDh	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	Unclear	GTATAACTCACCTTCGGGGTTTCCATCTCTTTTTAT	36	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	NA|50aa|up_6|NZ_CP019633.1_386638_386788_+,NA|166aa|up_3|NZ_CP019633.1_388806_389304_+,NA|221aa|up_1|NZ_CP019633.1_390369_391032_-,NA|124aa|down_0|NZ_CP019633.1_395839_396211_-,NA|52aa|down_2|NZ_CP019633.1_397226_397382_+	NA|485aa|up_9|NZ_CP019633.1_383699_385154_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|87aa|up_8|NZ_CP019633.1_385275_385536_+	TIGR00009, 50S_ribosomal_protein_L28, ribosomal protein L28	NA|284aa|up_7|NZ_CP019633.1_385765_386617_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|50aa|up_6|NZ_CP019633.1_386638_386788_+	NA	NA|347aa|up_5|NZ_CP019633.1_386892_387933_+	PRK13927, PRK13927, rod shape-determining protein MreB; Provisional	NA|276aa|up_4|NZ_CP019633.1_387982_388810_+	PRK13922, PRK13922, rod shape-determining protein MreC; Provisional	NA|166aa|up_3|NZ_CP019633.1_388806_389304_+	NA	NA|74aa|up_2|NZ_CP019633.1_389930_390152_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|221aa|up_1|NZ_CP019633.1_390369_391032_-	NA	NA|1019aa|up_0|NZ_CP019633.1_391056_394113_-	pfam13229, Beta_helix, Right handed beta helix region	NA|124aa|down_0|NZ_CP019633.1_395839_396211_-	NA	NA|192aa|down_1|NZ_CP019633.1_396688_397264_-	COG0619, CbiQ, ABC-type cobalt transport system, permease component CbiQ and related transporters [Inorganic ion transport and metabolism]	NA|52aa|down_2|NZ_CP019633.1_397226_397382_+	NA	NA|400aa|down_3|NZ_CP019633.1_397501_398701_-	PRK05415, PRK05415, hypothetical protein; Provisional	NA|454aa|down_4|NZ_CP019633.1_398712_400074_-	pfam04317, DUF463, YcjX-like family, DUF463	NA|830aa|down_5|NZ_CP019633.1_400113_402603_-	COG3250, LacZ, Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism]	NA|449aa|down_6|NZ_CP019633.1_402800_404147_+	COG3014, COG3014, Uncharacterized protein conserved in bacteria [Function unknown]	NA|141aa|down_7|NZ_CP019633.1_404168_404591_+	cd09030, DUF1425, Putative periplasmic lipoprotein	NA|205aa|down_8|NZ_CP019633.1_404606_405221_+	pfam13036, LpoB, Peptidoglycan-synthase activator LpoB	NA|770aa|down_9|NZ_CP019633.1_405235_407545_+	COG1462, CsgG, Uncharacterized protein involved in formation of curli polymers [Cell envelope biogenesis, outer membrane]
GCF_001997385.1_ASM199738v1	NZ_CP019633	Sedimentisphaera cyanobacteriorum strain L21-RPul-D3 chromosome, complete genome	3	2551794-2551892	3	CRISPRCasFinder	no		cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	Orphan	CTCTCCGGACTTCTCGGAACTTGCAAAACCGATAAC	36	1	1	2551830-2551856	NZ_CP019633.1_1365222-1365248	NA	1	1	Orphan	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	NA|103aa|up_9|NZ_CP019633.1_2542180_2542489_-,NA|61aa|up_8|NZ_CP019633.1_2542516_2542699_+,NA|420aa|up_7|NZ_CP019633.1_2543098_2544358_+,NA|480aa|up_2|NZ_CP019633.1_2547901_2549341_+,NA|205aa|up_1|NZ_CP019633.1_2549499_2550114_+,NA|292aa|down_1|NZ_CP019633.1_2554171_2555047_+	NA|103aa|up_9|NZ_CP019633.1_2542180_2542489_-	NA	NA|61aa|up_8|NZ_CP019633.1_2542516_2542699_+	NA	NA|420aa|up_7|NZ_CP019633.1_2543098_2544358_+	NA	NA|176aa|up_6|NZ_CP019633.1_2544640_2545168_-	pfam13289, SIR2_2, SIR2-like domain	NA|331aa|up_5|NZ_CP019633.1_2545529_2546522_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|132aa|up_4|NZ_CP019633.1_2546614_2547010_+	COG1725, COG1725, Predicted transcriptional regulators [Transcription]	NA|301aa|up_3|NZ_CP019633.1_2547002_2547905_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|480aa|up_2|NZ_CP019633.1_2547901_2549341_+	NA	NA|205aa|up_1|NZ_CP019633.1_2549499_2550114_+	NA	NA|304aa|up_0|NZ_CP019633.1_2550143_2551055_+	COG2159, COG2159, Predicted metal-dependent hydrolase of the TIM-barrel fold [General function prediction only]	NA|304aa|down_0|NZ_CP019633.1_2553254_2554166_+	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|292aa|down_1|NZ_CP019633.1_2554171_2555047_+	NA	NA|405aa|down_2|NZ_CP019633.1_2555544_2556759_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|541aa|down_3|NZ_CP019633.1_2556886_2558509_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|305aa|down_4|NZ_CP019633.1_2558626_2559541_+	COG5421, COG5421, Transposase [DNA replication, recombination, and repair]	NA|323aa|down_5|NZ_CP019633.1_2559877_2560846_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|291aa|down_6|NZ_CP019633.1_2561324_2562197_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|529aa|down_7|NZ_CP019633.1_2563117_2564704_+	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|347aa|down_8|NZ_CP019633.1_2564711_2565752_+	PRK02615, PRK02615, thiamine phosphate synthase	NA|430aa|down_9|NZ_CP019633.1_2565880_2567170_+	COG1253, TlyC, Hemolysins and related proteins containing CBS domains [General function prediction only]
GCF_001997385.1_ASM199738v1	NZ_CP019633	Sedimentisphaera cyanobacteriorum strain L21-RPul-D3 chromosome, complete genome	4	2883530-2883631	4	CRISPRCasFinder	no		cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	Orphan	GTATTCCACTCCCGCGACCCCCTCGGCTATATAGAC	36	0	0	NA	NA	NA	1	1	Orphan	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	NA|217aa|up_9|NZ_CP019633.1_2875994_2876645_+,NA|206aa|up_7|NZ_CP019633.1_2877536_2878154_+,NA|117aa|up_5|NZ_CP019633.1_2879252_2879603_+,NA|110aa|up_3|NZ_CP019633.1_2881354_2881684_+,NA|228aa|up_2|NZ_CP019633.1_2881674_2882358_+,NA|117aa|up_0|NZ_CP019633.1_2882978_2883329_+,NA|161aa|down_0|NZ_CP019633.1_2884215_2884698_+,NA|173aa|down_1|NZ_CP019633.1_2884700_2885219_+	NA|217aa|up_9|NZ_CP019633.1_2875994_2876645_+	NA	NA|309aa|up_8|NZ_CP019633.1_2876610_2877537_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|206aa|up_7|NZ_CP019633.1_2877536_2878154_+	NA	NA|333aa|up_6|NZ_CP019633.1_2878193_2879192_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|117aa|up_5|NZ_CP019633.1_2879252_2879603_+	NA	NA|159aa|up_4|NZ_CP019633.1_2879648_2880125_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|110aa|up_3|NZ_CP019633.1_2881354_2881684_+	NA	NA|228aa|up_2|NZ_CP019633.1_2881674_2882358_+	NA	NA|203aa|up_1|NZ_CP019633.1_2882360_2882969_+	TIGR03696, tRNA_nuclease_WapA, RHS repeat-associated core domain	NA|117aa|up_0|NZ_CP019633.1_2882978_2883329_+	NA	NA|161aa|down_0|NZ_CP019633.1_2884215_2884698_+	NA	NA|173aa|down_1|NZ_CP019633.1_2884700_2885219_+	NA	NA|445aa|down_2|NZ_CP019633.1_2885398_2886733_-	pfam01264, Chorismate_synt, Chorismate synthase	NA|861aa|down_3|NZ_CP019633.1_2886958_2889541_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|1603aa|down_4|NZ_CP019633.1_2889631_2894440_+	pfam01364, Peptidase_C25, Peptidase family C25	NA|210aa|down_5|NZ_CP019633.1_2894576_2895206_-	PRK14822, PRK14822, XTP/dITP diphosphatase	NA|284aa|down_6|NZ_CP019633.1_2895180_2896032_-	COG0564, RluA, Pseudouridylate synthases, 23S RNA-specific [Translation, ribosomal structure and biogenesis]	NA|474aa|down_7|NZ_CP019633.1_2896046_2897468_-	pfam09224, DUF1961, Domain of unknown function (DUF1961)	NA|194aa|down_8|NZ_CP019633.1_2897487_2898069_-	PRK00076, recR, recombination protein RecR; Reviewed	NA|554aa|down_9|NZ_CP019633.1_2898128_2899790_-	PRK05563, PRK05563, DNA polymerase III subunits gamma and tau; Validated
GCF_001997385.1_ASM199738v1	NZ_CP019633	Sedimentisphaera cyanobacteriorum strain L21-RPul-D3 chromosome, complete genome	5	2924612-2925474	5,2,2	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1,cas4,cas12a	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	Type V-A	ATCTACGACAGTAGAAATTTTATAAGGCCTTTAGAC,ATCTACGACAGTAGAAATTTTATAAGGCCTTTAGAC,ATCTACGACAGTAGAAATTTTATAAGGCCTTTAGAC	36,36,36	0	0	NA	NA	V-A:V-A:V-A	13,13,11	13	TypeV-A	cas2,cas1,cas9,DEDDh,cas3,cas3HD,DinG,csa3,RT,cas4,cas12a	NA|143aa|up_7|NZ_CP019633.1_2917986_2918415_-,NA|71aa|up_4|NZ_CP019633.1_2920582_2920795_-,NA|246aa|up_3|NZ_CP019633.1_2920813_2921551_-,NA|105aa|up_2|NZ_CP019633.1_2923143_2923458_-,NA|107aa|up_1|NZ_CP019633.1_2923476_2923797_-,NA|186aa|down_6|NZ_CP019633.1_2935301_2935859_+,NA|146aa|down_9|NZ_CP019633.1_2937671_2938109_+	NA|269aa|up_9|NZ_CP019633.1_2916686_2917493_-	pfam01555, N6_N4_Mtase, DNA methylase	NA|105aa|up_8|NZ_CP019633.1_2917492_2917807_-	pfam13443, HTH_26, Cro/C1-type HTH DNA-binding domain	NA|143aa|up_7|NZ_CP019633.1_2917986_2918415_-	NA	NA|382aa|up_6|NZ_CP019633.1_2918474_2919620_-	PRK05291, trmE, tRNA uridine-5-carboxymethylaminomethyl(34) synthesis GTPase MnmE	NA|266aa|up_5|NZ_CP019633.1_2919619_2920417_-	pfam03309, Pan_kinase, Type III pantothenate kinase	NA|71aa|up_4|NZ_CP019633.1_2920582_2920795_-	NA	NA|246aa|up_3|NZ_CP019633.1_2920813_2921551_-	NA	NA|105aa|up_2|NZ_CP019633.1_2923143_2923458_-	NA	NA|107aa|up_1|NZ_CP019633.1_2923476_2923797_-	NA	NA|49aa|up_0|NZ_CP019633.1_2924442_2924589_+	TIGR04258, hypothetical_protein, four helix bundle suffix domain	cas2|92aa|down_0|NZ_CP019633.1_2925618_2925894_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|320aa|down_1|NZ_CP019633.1_2925905_2926865_-	TIGR04329, hypothetical_protein_SbacW_07302_partial, CRISPR-associated endonuclease Cas1, subtype PREFRAN	cas4|186aa|down_2|NZ_CP019633.1_2926858_2927416_-	TIGR04328, hypothetical_protein_HMPREF9709_01098, CRISPR-associated protein Cas4, subtype PREFRAN	cas12a|1311aa|down_3|NZ_CP019633.1_2927471_2931404_-	TIGR04330, conserved_hypothetical_protein, CRISPR-associated protein Cpf1, subtype PREFRAN	NA|89aa|down_4|NZ_CP019633.1_2932578_2932845_-	pfam01527, HTH_Tnp_1, Transposase	NA|550aa|down_5|NZ_CP019633.1_2933655_2935305_+	cd07384, MPP_Cdc1_like, Saccharomyces cerevisiae CDC1 and related proteins, metallophosphatase domain	NA|186aa|down_6|NZ_CP019633.1_2935301_2935859_+	NA	NA|408aa|down_7|NZ_CP019633.1_2935861_2937085_+	COG0701, COG0701, Predicted permeases [General function prediction only]	NA|147aa|down_8|NZ_CP019633.1_2937244_2937685_+	COG3600, GepA, Uncharacterized phage-associated protein [Function unknown]	NA|146aa|down_9|NZ_CP019633.1_2937671_2938109_+	NA
