assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	1	327814-327958	1	PILER-CR	no	cas14j	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Unclear	TTTTCAGTATACAACTTTATTGTCT	25	2	2	327839-327868|327894-327935	NC_019753.1_738915-738886|NC_019753.1_738860-738819	NA	2	2	TypeV	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|84aa|up_5|NC_019753.1_321771_322023_-,NA|53aa|up_3|NC_019753.1_323088_323247_+,NA|473aa|down_2|NC_019753.1_329102_330521_+,NA|46aa|down_6|NC_019753.1_335992_336130_+	NA|219aa|up_9|NC_019753.1_316482_317139_-	pfam11318, DUF3120, Protein of unknown function (DUF3120)	NA|313aa|up_8|NC_019753.1_317611_318550_+	PRK00281, PRK00281, undecaprenyl-diphosphate phosphatase	NA|82aa|up_7|NC_019753.1_318845_319091_+	CHL00065, psaC, photosystem I subunit VII	NA|633aa|up_6|NC_019753.1_319346_321245_+	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|84aa|up_5|NC_019753.1_321771_322023_-	NA	NA|269aa|up_4|NC_019753.1_322248_323055_+	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|53aa|up_3|NC_019753.1_323088_323247_+	NA	NA|644aa|up_2|NC_019753.1_323314_325246_+	pfam09299, Mu-transpos_C, Mu transposase, C-terminal	NA|533aa|up_1|NC_019753.1_325235_326834_+	pfam13401, AAA_22, AAA domain	NA|300aa|up_0|NC_019753.1_326848_327748_+	pfam06527, TniQ, TniQ	NA|104aa|down_0|NC_019753.1_328100_328412_-	pfam14072, DndB, DNA-sulfur modification-associated	NA|225aa|down_1|NC_019753.1_328431_329106_+	pfam07929, PRiA4_ORF3, Plasmid pRiA4b ORF-3-like protein	NA|473aa|down_2|NC_019753.1_329102_330521_+	NA	NA|211aa|down_3|NC_019753.1_330647_331280_+	cd01197, INT_FimBE_like, FimB and FimE and related proteins, integrase/recombinases	NA|699aa|down_4|NC_019753.1_332211_334308_-	pfam01804, Penicil_amidase, Penicillin amidase	NA|441aa|down_5|NC_019753.1_334363_335686_-	cd06173, MFS_MefA_like, Macrolide efflux protein A and similar proteins of the Major Facilitator Superfamily of transporters	NA|46aa|down_6|NC_019753.1_335992_336130_+	NA	NA|2853aa|down_7|NC_019753.1_336235_344794_+	PRK12467, PRK12467, peptide synthase; Provisional	NA|2677aa|down_8|NC_019753.1_344842_352873_+	PRK12467, PRK12467, peptide synthase; Provisional	NA|1560aa|down_9|NC_019753.1_352874_357554_+	PRK12316, PRK12316, peptide synthase; Provisional
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	2	653799-656145	2,1,1	PILER-CR,CRISPRCasFinder,CRT	no	csx3,cas10,csm3gr7,csx19,csm6,cas6	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type III-B,Type III-D,Type III-A,Type III-C	CTTCCACTAACCATTTCCCCGTAAGGGGACGGAAAC,CTTCCACTAACCATTTCCCCGTAAGGGGACGGAAAC,CTTCCACTAACCATTTCCCCGTAAGGGGACGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	31,31,31	31	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|256aa|up_9|NC_019753.1_642625_643393_+,NA|67aa|up_8|NC_019753.1_643446_643647_+,NA|325aa|up_7|NC_019753.1_643680_644655_+,NA|203aa|up_5|NC_019753.1_646406_647015_+,NA|117aa|up_3|NC_019753.1_647555_647906_+,NA|308aa|up_2|NC_019753.1_647875_648799_+,NA|170aa|up_0|NC_019753.1_652413_652923_+,NA|56aa|down_0|NC_019753.1_659668_659836_-,csx3|109aa|down_1|NC_019753.1_659834_660161_+,NA|182aa|down_9|NC_019753.1_675374_675920_+	NA|256aa|up_9|NC_019753.1_642625_643393_+	NA	NA|67aa|up_8|NC_019753.1_643446_643647_+	NA	NA|325aa|up_7|NC_019753.1_643680_644655_+	NA	NA|491aa|up_6|NC_019753.1_644771_646244_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|203aa|up_5|NC_019753.1_646406_647015_+	NA	NA|129aa|up_4|NC_019753.1_647108_647495_+	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	NA|117aa|up_3|NC_019753.1_647555_647906_+	NA	NA|308aa|up_2|NC_019753.1_647875_648799_+	NA	NA|230aa|up_1|NC_019753.1_649781_650471_+	cd14840, D-Ala-D-Ala_dipeptidase_Aad, D-Ala-D-Ala dipeptidase (includes Lactobacillus plantarum Aad peptidase)	NA|170aa|up_0|NC_019753.1_652413_652923_+	NA	NA|56aa|down_0|NC_019753.1_659668_659836_-	NA	csx3|109aa|down_1|NC_019753.1_659834_660161_+	NA	cas10|560aa|down_2|NC_019753.1_660857_662537_+	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	csm3gr7|771aa|down_3|NC_019753.1_662600_664913_+	pfam03787, RAMPs, RAMP superfamily	csm3gr7|510aa|down_4|NC_019753.1_664920_666450_+	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csx19|195aa|down_5|NC_019753.1_666442_667027_+	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|781aa|down_6|NC_019753.1_667026_669369_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csm6|381aa|down_7|NC_019753.1_672071_673214_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	cas6|396aa|down_8|NC_019753.1_673232_674420_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_9|NC_019753.1_675374_675920_+	NA
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	3	657331-657509	2	CRISPRCasFinder	no	csx3,cas10,csm3gr7,csx19,csm6,cas6	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type III-B,Type III-D,Type III-A,Type III-C	CTTCCACTAACCATTTCCCCGTAAGGGGACGGAAAC	36	0	0	NA	NA	NA	2	2	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|256aa|up_9|NC_019753.1_642625_643393_+,NA|67aa|up_8|NC_019753.1_643446_643647_+,NA|325aa|up_7|NC_019753.1_643680_644655_+,NA|203aa|up_5|NC_019753.1_646406_647015_+,NA|117aa|up_3|NC_019753.1_647555_647906_+,NA|308aa|up_2|NC_019753.1_647875_648799_+,NA|170aa|up_0|NC_019753.1_652413_652923_+,NA|56aa|down_0|NC_019753.1_659668_659836_-,csx3|109aa|down_1|NC_019753.1_659834_660161_+,NA|182aa|down_9|NC_019753.1_675374_675920_+	NA|256aa|up_9|NC_019753.1_642625_643393_+	NA	NA|67aa|up_8|NC_019753.1_643446_643647_+	NA	NA|325aa|up_7|NC_019753.1_643680_644655_+	NA	NA|491aa|up_6|NC_019753.1_644771_646244_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|203aa|up_5|NC_019753.1_646406_647015_+	NA	NA|129aa|up_4|NC_019753.1_647108_647495_+	pfam05635, 23S_rRNA_IVP, 23S rRNA-intervening sequence protein	NA|117aa|up_3|NC_019753.1_647555_647906_+	NA	NA|308aa|up_2|NC_019753.1_647875_648799_+	NA	NA|230aa|up_1|NC_019753.1_649781_650471_+	cd14840, D-Ala-D-Ala_dipeptidase_Aad, D-Ala-D-Ala dipeptidase (includes Lactobacillus plantarum Aad peptidase)	NA|170aa|up_0|NC_019753.1_652413_652923_+	NA	NA|56aa|down_0|NC_019753.1_659668_659836_-	NA	csx3|109aa|down_1|NC_019753.1_659834_660161_+	NA	cas10|560aa|down_2|NC_019753.1_660857_662537_+	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	csm3gr7|771aa|down_3|NC_019753.1_662600_664913_+	pfam03787, RAMPs, RAMP superfamily	csm3gr7|510aa|down_4|NC_019753.1_664920_666450_+	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csx19|195aa|down_5|NC_019753.1_666442_667027_+	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|781aa|down_6|NC_019753.1_667026_669369_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csm6|381aa|down_7|NC_019753.1_672071_673214_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	cas6|396aa|down_8|NC_019753.1_673232_674420_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_9|NC_019753.1_675374_675920_+	NA
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	4	669542-671689	3,2,3	CRISPRCasFinder,CRT,PILER-CR	no	csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type III-B,Type III-D,Type III-A,Type III-C	GTTTCCGTCCCCTTGCGGGGAAAGTGGATTCGAGAC,GTTTCCGTCCCCTTGCGGGGAAAGTGGATTCGAGAC,GTCTCGAATCCACTTTCCCCGCAAGGGGACGGAAAC	36,36,36	0	0	NA	NA	NA:NA:NA	28,28,26	28	TypeIII-B,TypeIII-D,TypeIII-A,TypeIII-C	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|308aa|up_9|NC_019753.1_647875_648799_+,NA|170aa|up_7|NC_019753.1_652413_652923_+,NA|56aa|up_6|NC_019753.1_659668_659836_-,csx3|109aa|up_5|NC_019753.1_659834_660161_+,NA|182aa|down_2|NC_019753.1_675374_675920_+,NA|289aa|down_5|NC_019753.1_679270_680137_-,csx3|260aa|down_9|NC_019753.1_684968_685748_+	NA|308aa|up_9|NC_019753.1_647875_648799_+	NA	NA|230aa|up_8|NC_019753.1_649781_650471_+	cd14840, D-Ala-D-Ala_dipeptidase_Aad, D-Ala-D-Ala dipeptidase (includes Lactobacillus plantarum Aad peptidase)	NA|170aa|up_7|NC_019753.1_652413_652923_+	NA	NA|56aa|up_6|NC_019753.1_659668_659836_-	NA	csx3|109aa|up_5|NC_019753.1_659834_660161_+	NA	cas10|560aa|up_4|NC_019753.1_660857_662537_+	COG1353, COG1353, Predicted CRISPR-associated polymerase [Defense mechanisms]	csm3gr7|771aa|up_3|NC_019753.1_662600_664913_+	pfam03787, RAMPs, RAMP superfamily	csm3gr7|510aa|up_2|NC_019753.1_664920_666450_+	cd09726, RAMP_I_III, CRISPR/Cas system-associated RAMP superfamily protein	csx19|195aa|up_1|NC_019753.1_666442_667027_+	TIGR03984, hypothetical_protein_FrEUN1fDRAFT_5778, CRISPR-associated protein, TIGR03984 family	csm3gr7|781aa|up_0|NC_019753.1_667026_669369_+	TIGR03986, CRISPR-associated_protein, CRISPR-associated protein	csm6|381aa|down_0|NC_019753.1_672071_673214_-	cd09742, Csm6_III-A, CRISPR/Cas system-associated protein Csm6	cas6|396aa|down_1|NC_019753.1_673232_674420_-	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	NA|182aa|down_2|NC_019753.1_675374_675920_+	NA	NA|350aa|down_3|NC_019753.1_676207_677257_-	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)	NA|591aa|down_4|NC_019753.1_677288_679061_-	pfam00004, AAA, ATPase family associated with various cellular activities (AAA)	NA|289aa|down_5|NC_019753.1_679270_680137_-	NA	NA|339aa|down_6|NC_019753.1_680352_681369_-	TIGR02210, Rod_shape-determining_protein_RodA, rod shape-determining protein RodA	NA|283aa|down_7|NC_019753.1_681371_682220_-	pfam05685, Uma2, Putative restriction endonuclease	WYL|519aa|down_8|NC_019753.1_682774_684331_-	TIGR03985, hypothetical_protein_sll7078, CRISPR-associated protein, TIGR03985 family	csx3|260aa|down_9|NC_019753.1_684968_685748_+	NA
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	5	831026-831188	4	CRISPRCasFinder	no	cas14j	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Unclear	CAGAGTTGATGGAAATCCCTTTTTGGCAAGCAATGATTAGAGAAGATATTTC	52	0	0	NA	NA	NA	1	1	TypeV	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA,NA|274aa|down_4|NC_019753.1_836096_836918_+,NA|374aa|down_5|NC_019753.1_837131_838253_-	NA|367aa|up_9|NC_019753.1_817587_818688_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|77aa|up_8|NC_019753.1_818890_819121_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|137aa|up_7|NC_019753.1_819123_819534_+	cd18748, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	NA|242aa|up_6|NC_019753.1_820332_821058_+	cd07709, flavodiiron_proteins_MBL-fold, catalytic domain of flavodiiron proteins (FDPs) and related proteins; MBL-fold metallo-hydrolase domain	NA|301aa|up_5|NC_019753.1_821706_822609_+	cd05259, PCBER_SDR_a, phenylcoumaran benzylic ether reductase (PCBER) like, atypical (a) SDRs	NA|343aa|up_4|NC_019753.1_824703_825732_+	PRK05451, PRK05451, dihydroorotase; Provisional	NA|335aa|up_3|NC_019753.1_825848_826853_-	TIGR03558, oxido_grp_1, luciferase family oxidoreductase, group 1	NA|249aa|up_2|NC_019753.1_826896_827643_-	COG2085, COG2085, Predicted dinucleotide-binding enzymes [General function prediction only]	NA|159aa|up_1|NC_019753.1_827715_828192_-	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	cas14j|403aa|up_0|NC_019753.1_828893_830102_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|403aa|down_0|NC_019753.1_831753_832962_+	pfam05023, Phytochelatin, Phytochelatin synthase	NA|268aa|down_1|NC_019753.1_833101_833905_-	COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]	NA|222aa|down_2|NC_019753.1_834078_834744_-	cd05325, carb_red_sniffer_like_SDR_c, carbonyl reductase sniffer-like, classical (c) SDRs	NA|327aa|down_3|NC_019753.1_834869_835850_-	cd19079, AKR_EcYajO-like, Escherichia coli YajO and similar proteins	NA|274aa|down_4|NC_019753.1_836096_836918_+	NA	NA|374aa|down_5|NC_019753.1_837131_838253_-	NA	NA|154aa|down_6|NC_019753.1_840236_840698_+	pfam10979, DUF2786, Protein of unknown function (DUF2786)	NA|741aa|down_7|NC_019753.1_840878_843101_+	PRK13341, PRK13341, AAA family ATPase	NA|494aa|down_8|NC_019753.1_844725_846207_-	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|356aa|down_9|NC_019753.1_846592_847660_+	pfam02317, Octopine_DH, NAD/NADP octopine/nopaline dehydrogenase, alpha-helical domain
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	6	992622-993675	4,5,3	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas2,cas14j	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type I-D	GCGAAAATAGCTAATAATCCCTTTTAGGGATTGAAAC,GCGAAAATAGCTAATAATCCCTTTTAGGGATTGAAAC,GCGAAAATAGCTAATAATCCCTTTTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	14,14,14	14	TypeI-D,TypeV	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA,NA|63aa|down_3|NC_019753.1_996924_997113_-	NA|153aa|up_9|NC_019753.1_980602_981061_+	cd04586, CBS_pair_BON_assoc, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the BON (bacterial OsmY and nodulation domain) domain	NA|222aa|up_8|NC_019753.1_981092_981758_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	WYL|287aa|up_7|NC_019753.1_981822_982683_-	pfam13280, WYL, WYL domain	cas3|670aa|up_6|NC_019753.1_983011_985021_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	cas10d|977aa|up_5|NC_019753.1_985031_987962_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	csc2gr7|333aa|up_4|NC_019753.1_987974_988973_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|223aa|up_3|NC_019753.1_988976_989645_+	TIGR03159, cas_Csc1, CRISPR type I-D/CYANO-associated protein Csc1	cas6|274aa|up_2|NC_019753.1_989656_990478_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|197aa|up_1|NC_019753.1_990474_991065_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas2|90aa|up_0|NC_019753.1_992098_992368_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|253aa|down_0|NC_019753.1_993887_994646_-	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|483aa|down_1|NC_019753.1_994826_996275_+	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|153aa|down_2|NC_019753.1_996371_996830_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|63aa|down_3|NC_019753.1_996924_997113_-	NA	NA|496aa|down_4|NC_019753.1_997163_998651_-	cd17486, MFS_AmpG_like, AmpG and similar transporters of the Major Facilitator Superfamily	NA|211aa|down_5|NC_019753.1_998738_999371_-	COG1974, LexA, SOS-response transcriptional repressors (RecA-mediated autopeptidases) [Transcription / Signal transduction mechanisms]	NA|308aa|down_6|NC_019753.1_999758_1000682_-	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|561aa|down_7|NC_019753.1_1001058_1002741_+	COG4188, COG4188, Predicted dienelactone hydrolase [General function prediction only]	NA|200aa|down_8|NC_019753.1_1002806_1003406_+	pfam13424, TPR_12, Tetratricopeptide repeat	NA|192aa|down_9|NC_019753.1_1003520_1004096_+	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	7	1168466-1168834	6,4,5	CRISPRCasFinder,CRT,PILER-CR	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	GTTTCCGTCCCCTTGCGGGGAAAAGATAGTATCTGAC,GTTTCCGTCCCCTTGCGGGGAAAAGATAGTATCTGAC,GTCAGATACTATCT---------TTTCCCCGCAAGGGGACGGAAAC	37,37,46	0	0	NA	NA	NA:NA:NA	4,4,4	4	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|101aa|up_9|NC_019753.1_1156466_1156769_-,NA|184aa|up_2|NC_019753.1_1166373_1166925_+,NA|134aa|down_3|NC_019753.1_1171442_1171844_-	NA|101aa|up_9|NC_019753.1_1156466_1156769_-	NA	NA|297aa|up_8|NC_019753.1_1157838_1158729_+	cd09025, Aldose_epim_Slr1438, Aldose 1-epimerase, similar to Synechocystis Slr1438	NA|358aa|up_7|NC_019753.1_1159032_1160106_+	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|158aa|up_6|NC_019753.1_1160293_1160767_-	COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms]	NA|311aa|up_5|NC_019753.1_1161441_1162374_+	pfam12920, TcdA_TcdB_pore, TcdA/TcdB pore forming domain	NA|887aa|up_4|NC_019753.1_1162826_1165487_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|104aa|up_3|NC_019753.1_1165822_1166134_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|184aa|up_2|NC_019753.1_1166373_1166925_+	NA	NA|326aa|up_1|NC_019753.1_1167081_1168059_+	PRK00089, era, GTPase Era; Reviewed	NA|105aa|up_0|NC_019753.1_1168070_1168385_+	pfam16277, DUF4926, Domain of unknown function (DUF4926)	NA|367aa|down_0|NC_019753.1_1169175_1170276_-	PRK00002, aroB, 3-dehydroquinate synthase; Reviewed	NA|32aa|down_1|NC_019753.1_1170369_1170465_+	pfam05115, PetL, Cytochrome B6-F complex subunit VI (PetL)	NA|261aa|down_2|NC_019753.1_1170526_1171309_-	PLN03100, PLN03100, Permease subunit of ER-derived-lipid transporter; Provisional	NA|134aa|down_3|NC_019753.1_1171442_1171844_-	NA	NA|271aa|down_4|NC_019753.1_1172600_1173413_+	cd07572, nit, Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases)	NA|136aa|down_5|NC_019753.1_1173729_1174137_-	COG3755, COG3755, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|down_6|NC_019753.1_1174372_1175770_-	pfam11850, DUF3370, Protein of unknown function (DUF3370)	NA|201aa|down_7|NC_019753.1_1175770_1176373_-	cd00501, Peptidase_C15, Pyroglutamyl peptidase (PGP) type I, also known as pyrrolidone carboxyl peptidase (pcp) type I:  Enzymes responsible for cleaving pyroglutamate (pGlu) from the N-terminal end of specialized proteins	NA|210aa|down_8|NC_019753.1_1176496_1177126_+	PRK13141, hisH, imidazole glycerol phosphate synthase subunit HisH; Provisional	NA|189aa|down_9|NC_019753.1_1177202_1177769_+	COG0742, COG0742, N6-adenine-specific methylase [DNA replication, recombination, and repair]
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	8	1339615-1339713	7	CRISPRCasFinder	no	c2c9_V-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type V-U4	TTAGCTGCAATTGCGATGCTTCTGCTGGTTCATCTA	36	0	0	NA	NA	NA	1	1	TypeV-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|163aa|up_9|NC_019753.1_1328870_1329359_+,NA|142aa|up_6|NC_019753.1_1330976_1331402_-,NA|67aa|up_3|NC_019753.1_1333252_1333453_+,NA|210aa|down_5|NC_019753.1_1349676_1350306_-	NA|163aa|up_9|NC_019753.1_1328870_1329359_+	NA	NA|238aa|up_8|NC_019753.1_1329548_1330262_-	cd02910, cupin_Yhhw_N, Escherichia coli YhhW and YhaK and related proteins, pirin-like bicupin, N-terminal cupin domain	NA|141aa|up_7|NC_019753.1_1330514_1330937_+	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|142aa|up_6|NC_019753.1_1330976_1331402_-	NA	NA|261aa|up_5|NC_019753.1_1331667_1332450_+	PLN03084, PLN03084, alpha/beta hydrolase fold protein; Provisional	NA|148aa|up_4|NC_019753.1_1332576_1333020_+	TIGR03042, hypothetical_protein, photosystem II protein PsbQ	NA|67aa|up_3|NC_019753.1_1333252_1333453_+	NA	NA|134aa|up_2|NC_019753.1_1333443_1333845_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	NA|368aa|up_1|NC_019753.1_1333904_1335008_+	pfam01266, DAO, FAD dependent oxidoreductase	NA|692aa|up_0|NC_019753.1_1335245_1337321_-	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|992aa|down_0|NC_019753.1_1343189_1346165_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|177aa|down_1|NC_019753.1_1346234_1346765_-	cd00732, CheW, CheW, a small regulator protein, unique to the chemotaxis signalling in prokaryotes and archea	NA|122aa|down_2|NC_019753.1_1346771_1347137_-	cd19937, REC_OmpR_BsPhoP-like, phosphoacceptor receiver (REC) domain of BsPhoP-like OmpR family response regulators	NA|425aa|down_3|NC_019753.1_1347273_1348548_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|129aa|down_4|NC_019753.1_1349069_1349456_+	cd08352, VOC_Bs_YwkD_like, vicinal oxygen chelate (VOC) family protein  Bacillus subtilis YwkD and similar proteins	NA|210aa|down_5|NC_019753.1_1349676_1350306_-	NA	NA|304aa|down_6|NC_019753.1_1350988_1351900_-	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|286aa|down_7|NC_019753.1_1352380_1353238_+	cd06582, TM_PBP1_LivH_like, Transmembrane subunit (TM) of Escherichia coli LivH and related proteins	NA|48aa|down_8|NC_019753.1_1353381_1353525_-	pfam08078, PsaX, PsaX family	NA|325aa|down_9|NC_019753.1_1353621_1354596_-	PRK12928, PRK12928, lipoyl synthase; Provisional
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	9	1564327-1564437	8	CRISPRCasFinder	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	GGAATTATAGTTGGAGCTAGAGTTGGAGTTGCATTTGGA	39	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|59aa|up_9|NC_019753.1_1550918_1551095_+,NA|368aa|up_6|NC_019753.1_1554793_1555897_+,NA|190aa|down_1|NC_019753.1_1566821_1567391_+,NA|112aa|down_6|NC_019753.1_1577973_1578309_+,NA|106aa|down_8|NC_019753.1_1579200_1579518_-	NA|59aa|up_9|NC_019753.1_1550918_1551095_+	NA	NA|470aa|up_8|NC_019753.1_1551177_1552587_+	COG1215, COG1215, Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]	NA|657aa|up_7|NC_019753.1_1552605_1554576_-	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|368aa|up_6|NC_019753.1_1554793_1555897_+	NA	NA|293aa|up_5|NC_019753.1_1556154_1557033_-	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|249aa|up_4|NC_019753.1_1557201_1557948_-	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|341aa|up_3|NC_019753.1_1558070_1559093_+	cd01137, PsaA, Metal binding protein PsaA	NA|143aa|up_2|NC_019753.1_1559569_1559998_+	pfam00875, DNA_photolyase, DNA photolyase	NA|448aa|up_1|NC_019753.1_1560154_1561498_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|594aa|up_0|NC_019753.1_1562249_1564031_-	PRK06354, PRK06354, pyruvate kinase; Provisional	NA|445aa|down_0|NC_019753.1_1565271_1566606_-	COG2133, COG2133, Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]	NA|190aa|down_1|NC_019753.1_1566821_1567391_+	NA	NA|614aa|down_2|NC_019753.1_1567423_1569265_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|718aa|down_3|NC_019753.1_1569604_1571758_-	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|1538aa|down_4|NC_019753.1_1572249_1576863_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|290aa|down_5|NC_019753.1_1576963_1577833_+	COG4279, COG4279, Uncharacterized conserved protein [Function unknown]	NA|112aa|down_6|NC_019753.1_1577973_1578309_+	NA	NA|243aa|down_7|NC_019753.1_1578428_1579157_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|106aa|down_8|NC_019753.1_1579200_1579518_-	NA	NA|62aa|down_9|NC_019753.1_1580033_1580219_-	pfam08369, PCP_red, Proto-chlorophyllide reductase 57 kD subunit
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	10	1991876-1992057	9	CRISPRCasFinder	no	csa3,c2c9_V-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	 Type V-U4?,Type I-A	GGTGGTAAGTATTGCGGTTGGGTTGTAGGTTG	32	0	0	NA	NA	NA	2	2	TypeV-U4?,TypeI-A	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|173aa|up_9|NC_019753.1_1983302_1983821_+,NA|104aa|up_8|NC_019753.1_1983873_1984185_+,NA|323aa|up_6|NC_019753.1_1986549_1987518_-,NA|56aa|up_5|NC_019753.1_1987529_1987697_-,NA|255aa|up_4|NC_019753.1_1987768_1988533_-,NA|112aa|up_3|NC_019753.1_1988529_1988865_-,NA|119aa|up_2|NC_019753.1_1988864_1989221_-,NA|115aa|up_1|NC_019753.1_1989267_1989612_-,NA|271aa|up_0|NC_019753.1_1989732_1990545_-,NA|110aa|down_3|NC_019753.1_1995634_1995964_+,NA|234aa|down_9|NC_019753.1_2000779_2001481_-	NA|173aa|up_9|NC_019753.1_1983302_1983821_+	NA	NA|104aa|up_8|NC_019753.1_1983873_1984185_+	NA	NA|757aa|up_7|NC_019753.1_1984168_1986439_+	PRK07773, PRK07773, replicative DNA helicase; Validated	NA|323aa|up_6|NC_019753.1_1986549_1987518_-	NA	NA|56aa|up_5|NC_019753.1_1987529_1987697_-	NA	NA|255aa|up_4|NC_019753.1_1987768_1988533_-	NA	NA|112aa|up_3|NC_019753.1_1988529_1988865_-	NA	NA|119aa|up_2|NC_019753.1_1988864_1989221_-	NA	NA|115aa|up_1|NC_019753.1_1989267_1989612_-	NA	NA|271aa|up_0|NC_019753.1_1989732_1990545_-	NA	NA|201aa|down_0|NC_019753.1_1992832_1993435_+	cd03769, SR_IS607_transposase_like, Serine Recombinase (SR) family, IS607-like transposase subfamily, catalytic domain; members contain a DNA binding domain with homology to MerR/SoxR located N-terminal to the catalytic domain	c2c9_V-U4|403aa|down_1|NC_019753.1_1993418_1994627_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|80aa|down_2|NC_019753.1_1995155_1995395_-	pfam08872, KGK, KGK domain	NA|110aa|down_3|NC_019753.1_1995634_1995964_+	NA	NA|118aa|down_4|NC_019753.1_1996066_1996420_+	COG4980, GvpP, Gas vesicle protein [General function prediction only]	NA|268aa|down_5|NC_019753.1_1997327_1998131_+	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|210aa|down_6|NC_019753.1_1998293_1998923_+	PRK05953, PRK05953, Precorrin-8X methylmutase	NA|137aa|down_7|NC_019753.1_1998993_1999404_-	pfam01797, Y1_Tnp, Transposase IS200 like	c2c9_V-U4|427aa|down_8|NC_019753.1_1999432_2000713_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|234aa|down_9|NC_019753.1_2000779_2001481_-	NA
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	11	2133176-2133283	10	CRISPRCasFinder	no	c2c9_V-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type V-U4	GGCGGAGTACTAGCAACGGGCGCGGTAAAT	30	0	0	NA	NA	NA	1	1	TypeV-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|190aa|up_8|NC_019753.1_2116271_2116841_+,NA|120aa|down_5|NC_019753.1_2143201_2143561_+	NA|357aa|up_9|NC_019753.1_2115248_2116319_-	cd07987, LPLAT_MGAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: MGAT-like	NA|190aa|up_8|NC_019753.1_2116271_2116841_+	NA	NA|147aa|up_7|NC_019753.1_2116921_2117362_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	c2c9_V-U4|405aa|up_6|NC_019753.1_2117412_2118627_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|719aa|up_5|NC_019753.1_2118790_2120947_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|103aa|up_4|NC_019753.1_2121474_2121783_+	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|139aa|up_3|NC_019753.1_2127675_2128092_-	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|384aa|up_2|NC_019753.1_2128191_2129343_+	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|109aa|up_1|NC_019753.1_2129473_2129800_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|664aa|up_0|NC_019753.1_2129928_2131920_-	TIGR03030, Cellulose_synthase_UDP-forming, cellulose synthase catalytic subunit (UDP-forming)	NA|476aa|down_0|NC_019753.1_2136192_2137620_-	pfam11329, DUF3131, Protein of unknown function (DUF3131)	NA|481aa|down_1|NC_019753.1_2137878_2139321_-	PRK07994, PRK07994, DNA polymerase III subunits gamma and tau; Validated	NA|447aa|down_2|NC_019753.1_2139562_2140903_+	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|114aa|down_3|NC_019753.1_2140989_2141331_+	PRK05943, PRK05943, 50S ribosomal protein L25; Reviewed	NA|515aa|down_4|NC_019753.1_2141577_2143122_+	COG2187, COG2187, Uncharacterized protein conserved in bacteria [Function unknown]	NA|120aa|down_5|NC_019753.1_2143201_2143561_+	NA	NA|264aa|down_6|NC_019753.1_2143663_2144455_+	cd06259, YdcF-like, YdcF-like	NA|487aa|down_7|NC_019753.1_2144455_2145916_-	pfam15611, EH_Signature, EH_Signature domain	NA|240aa|down_8|NC_019753.1_2145876_2146596_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|727aa|down_9|NC_019753.1_2146599_2148780_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	12	2914018-2914126	11	CRISPRCasFinder	no	c2c5_V-U5	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type V-U5	GTTTCAACTACCATCCCAACTAGGGGTGGGTTGAAAG	37	0	0	NA	NA	V-U5	1	1	TypeV-U5	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|202aa|up_9|NC_019753.1_2901901_2902507_-,NA|199aa|up_3|NC_019753.1_2908111_2908708_-,c2c5_V-U5|615aa|up_0|NC_019753.1_2911657_2913502_+,NA	NA|202aa|up_9|NC_019753.1_2901901_2902507_-	NA	NA|116aa|up_8|NC_019753.1_2902506_2902854_-	pfam14213, DUF4325, STAS-like domain of unknown function (DUF4325)	NA|292aa|up_7|NC_019753.1_2902870_2903746_-	smart00387, HATPase_c, Histidine kinase-like ATPases	NA|142aa|up_6|NC_019753.1_2903833_2904259_-	cd17249, RMtype1_S_EcoR124I-TRD2-CR2_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|241aa|up_5|NC_019753.1_2904219_2904942_-	cd16961, RMtype1_S_TRD-CR_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR) and similar domains	NA|381aa|up_4|NC_019753.1_2905540_2906683_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|199aa|up_3|NC_019753.1_2908111_2908708_-	NA	NA|797aa|up_2|NC_019753.1_2908751_2911142_-	COG4096, HsdR, Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms]	NA|147aa|up_1|NC_019753.1_2911148_2911589_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	c2c5_V-U5|615aa|up_0|NC_019753.1_2911657_2913502_+	NA	NA|277aa|down_0|NC_019753.1_2914501_2915332_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|241aa|down_1|NC_019753.1_2915424_2916147_-	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like	NA|531aa|down_2|NC_019753.1_2916470_2918063_+	PRK11893, PRK11893, methionyl-tRNA synthetase; Reviewed	NA|202aa|down_3|NC_019753.1_2918162_2918768_+	cd10911, PIN_LabA, PIN domain of Synechococcus elongatus LabA (low-amplitude and bright) and related proteins	NA|397aa|down_4|NC_019753.1_2919090_2920281_+	TIGR04409, LptC_YrbK, LPS export ABC transporter periplasmic protein LptC	NA|441aa|down_5|NC_019753.1_2920294_2921617_-	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|345aa|down_6|NC_019753.1_2921787_2922822_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|337aa|down_7|NC_019753.1_2922827_2923838_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|300aa|down_8|NC_019753.1_2924067_2924967_+	TIGR00128, Malonyl_CoA-acyl_carrier_protein_transacylase, malonyl CoA-acyl carrier protein transacylase	NA|218aa|down_9|NC_019753.1_2924953_2925607_+	cd07989, LPLAT_AGPAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: AGPAT-like
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	13	3333723-3333813	12	CRISPRCasFinder	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	AAATTCTGATAGTGCGAGCATCT	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|63aa|up_2|NC_019753.1_3332114_3332303_+,NA|470aa|down_4|NC_019753.1_3341717_3343127_-,NA|309aa|down_5|NC_019753.1_3343364_3344291_+	NA|316aa|up_9|NC_019753.1_3320503_3321451_-	COG4191, COG4191, Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]	NA|343aa|up_8|NC_019753.1_3321709_3322738_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|892aa|up_7|NC_019753.1_3323275_3325951_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|617aa|up_6|NC_019753.1_3325947_3327798_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|366aa|up_5|NC_019753.1_3328111_3329209_+	cd17552, REC_RR468-like, phosphoacceptor receiver (REC) domain of Thermotoga maritima response regulator RR468 and similar domains	NA|470aa|up_4|NC_019753.1_3329258_3330668_+	PRK12273, aspA, aspartate ammonia-lyase; Provisional	NA|323aa|up_3|NC_019753.1_3330780_3331749_+	COG2006, COG2006, Uncharacterized conserved protein [Function unknown]	NA|63aa|up_2|NC_019753.1_3332114_3332303_+	NA	NA|173aa|up_1|NC_019753.1_3332623_3333142_+	cd00412, pyrophosphatase, Inorganic pyrophosphatase	NA|139aa|up_0|NC_019753.1_3333275_3333692_+	pfam02261, Asp_decarbox, Aspartate decarboxylase	NA|150aa|down_0|NC_019753.1_3334048_3334498_-	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|354aa|down_1|NC_019753.1_3334711_3335773_-	PRK07394, PRK07394, hypothetical protein; Provisional	NA|330aa|down_2|NC_019753.1_3335825_3336815_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|1487aa|down_3|NC_019753.1_3337030_3341491_-	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain	NA|470aa|down_4|NC_019753.1_3341717_3343127_-	NA	NA|309aa|down_5|NC_019753.1_3343364_3344291_+	NA	NA|429aa|down_6|NC_019753.1_3344426_3345713_-	pfam13646, HEAT_2, HEAT repeats	NA|82aa|down_7|NC_019753.1_3345758_3346004_-	pfam14279, HNH_5, HNH endonuclease	NA|130aa|down_8|NC_019753.1_3346015_3346405_-	pfam08872, KGK, KGK domain	NA|160aa|down_9|NC_019753.1_3346667_3347147_+	COG1285, SapB, Uncharacterized membrane protein [Function unknown]
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	14	3468402-3468506	13	CRISPRCasFinder	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	AATACTCATAAACCCGCCCATCAACA	26	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|77aa|up_1|NC_019753.1_3467571_3467802_+,NA|117aa|down_0|NC_019753.1_3468835_3469186_-,NA|80aa|down_3|NC_019753.1_3474980_3475220_-	NA|270aa|up_9|NC_019753.1_3457321_3458131_+	TIGR03518, ABC_transporter_permease_protein, gliding motility-associated ABC transporter permease protein GldF	NA|521aa|up_8|NC_019753.1_3458195_3459758_+	COG3225, GldG, ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]	NA|217aa|up_7|NC_019753.1_3459901_3460552_+	pfam14238, DUF4340, Domain of unknown function (DUF4340)	NA|593aa|up_6|NC_019753.1_3460764_3462543_-	TIGR03423, pbp2_mrdA, penicillin-binding protein 2	NA|377aa|up_5|NC_019753.1_3462844_3463975_+	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|367aa|up_4|NC_019753.1_3464098_3465199_+	cd13590, PBP2_PotD_PotF_like, The periplasmic-binding component of ABC transporters involved in uptake of polyamines; possess the type 2 periplasmic binding fold	NA|303aa|up_3|NC_019753.1_3465363_3466272_+	COG1176, PotB, ABC-type spermidine/putrescine transport system, permease component I [Amino acid transport and metabolism]	NA|279aa|up_2|NC_019753.1_3466317_3467154_+	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|77aa|up_1|NC_019753.1_3467571_3467802_+	NA	NA|120aa|up_0|NC_019753.1_3467788_3468148_+	pfam02452, PemK_toxin, PemK-like, MazF-like toxin of type II toxin-antitoxin system	NA|117aa|down_0|NC_019753.1_3468835_3469186_-	NA	NA|1231aa|down_1|NC_019753.1_3469261_3472954_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|572aa|down_2|NC_019753.1_3472953_3474669_-	pfam14516, AAA_35, AAA-like domain	NA|80aa|down_3|NC_019753.1_3474980_3475220_-	NA	NA|327aa|down_4|NC_019753.1_3475319_3476300_-	COG4371, COG4371, Predicted membrane protein [Function unknown]	NA|74aa|down_5|NC_019753.1_3476618_3476840_-	PRK07440, PRK07440, thiamine biosynthesis protein ThiS	NA|363aa|down_6|NC_019753.1_3476832_3477921_-	PRK02615, PRK02615, thiamine phosphate synthase	NA|328aa|down_7|NC_019753.1_3478141_3479125_+	COG4586, COG4586, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|263aa|down_8|NC_019753.1_3479185_3479974_+	COG4587, COG4587, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|1061aa|down_9|NC_019753.1_3480332_3483515_+	TIGR03607, TIGR03607, patatin-related protein
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	15	3609819-3610066	6	PILER-CR	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	TTACAGCGATATGAAAGTGCGATCGCATCTTACGCTCAAGCAAT	44	0	0	NA	NA	NA	2	2	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA,NA|62aa|down_1|NC_019753.1_3611512_3611698_+,NA|75aa|down_4|NC_019753.1_3613296_3613521_-,NA|64aa|down_5|NC_019753.1_3613504_3613696_-,NA|65aa|down_7|NC_019753.1_3617361_3617556_-,NA|138aa|down_8|NC_019753.1_3617682_3618096_+	NA|149aa|up_9|NC_019753.1_3596660_3597107_+	COG1934, COG1934, Uncharacterized protein conserved in bacteria [Function unknown]	NA|243aa|up_8|NC_019753.1_3597178_3597907_+	cd03218, ABC_YhbG, ATP-binding cassette component of YhbG transport system	NA|370aa|up_7|NC_019753.1_3598059_3599169_+	pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family	NA|475aa|up_6|NC_019753.1_3599277_3600702_-	PRK07362, PRK07362, NADP-dependent isocitrate dehydrogenase	NA|238aa|up_5|NC_019753.1_3600963_3601677_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|242aa|up_4|NC_019753.1_3601765_3602491_+	pfam05419, GUN4, GUN4-like	NA|539aa|up_3|NC_019753.1_3602658_3604275_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|466aa|up_2|NC_019753.1_3604362_3605760_+	TIGR00947, 2A73, putative bicarbonate transporter, IctB family	NA|367aa|up_1|NC_019753.1_3605913_3607014_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|156aa|up_0|NC_019753.1_3607390_3607858_-	PRK05422, smpB, SsrA-binding protein SmpB	NA|373aa|down_0|NC_019753.1_3610361_3611480_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|62aa|down_1|NC_019753.1_3611512_3611698_+	NA	NA|333aa|down_2|NC_019753.1_3611963_3612962_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|56aa|down_3|NC_019753.1_3613060_3613228_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|75aa|down_4|NC_019753.1_3613296_3613521_-	NA	NA|64aa|down_5|NC_019753.1_3613504_3613696_-	NA	NA|1010aa|down_6|NC_019753.1_3614058_3617088_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|65aa|down_7|NC_019753.1_3617361_3617556_-	NA	NA|138aa|down_8|NC_019753.1_3617682_3618096_+	NA	NA|151aa|down_9|NC_019753.1_3618147_3618600_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	16	4414992-4415076	14	CRISPRCasFinder	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	TAGCATTAGTTAGCGTAACCAAT	23	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA,NA|73aa|down_3|NC_019753.1_4417744_4417963_+,NA|59aa|down_4|NC_019753.1_4419480_4419657_+,NA|47aa|down_5|NC_019753.1_4419657_4419798_+,NA|737aa|down_6|NC_019753.1_4419865_4422076_-,NA|204aa|down_9|NC_019753.1_4425111_4425723_-	NA|486aa|up_9|NC_019753.1_4404734_4406192_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|401aa|up_8|NC_019753.1_4406361_4407564_-	PRK00073, pgk, phosphoglycerate kinase; Provisional	NA|138aa|up_7|NC_019753.1_4407745_4408159_+	pfam00582, Usp, Universal stress protein family	NA|290aa|up_6|NC_019753.1_4408305_4409175_+	PRK09563, rbgA, GTPase YlqF; Reviewed	NA|114aa|up_5|NC_019753.1_4409320_4409662_+	cd02076, P-type_ATPase_H, plant and fungal plasma membrane H(+)-ATPases, and related bacterial and archaeal putative H(+)-ATPases	NA|256aa|up_4|NC_019753.1_4409740_4410508_+	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|146aa|up_3|NC_019753.1_4410653_4411091_+	cd06987, cupin_MAE_RS03005, Microcystis aeruginosa MAE_RS03005 and related proteins, cupin domain	NA|218aa|up_2|NC_019753.1_4411206_4411860_+	COG5031, COQ4, Uncharacterized protein involved in ubiquinone biosynthesis [Coenzyme metabolism]	NA|425aa|up_1|NC_019753.1_4412383_4413658_+	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|399aa|up_0|NC_019753.1_4413682_4414879_-	cd02511, Beta4Glucosyltransferase, UDP-glucose LOS-beta-1,4 glucosyltransferase is required for biosynthesis of lipooligosaccharide	NA|539aa|down_0|NC_019753.1_4415097_4416714_+	COG4615, PvdE, ABC-type siderophore export system, fused ATPase and permease components [Secondary metabolites biosynthesis, transport, and catabolism / Inorganic ion transport and metabolism]	NA|69aa|down_1|NC_019753.1_4416767_4416974_+	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|74aa|down_2|NC_019753.1_4416970_4417192_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|73aa|down_3|NC_019753.1_4417744_4417963_+	NA	NA|59aa|down_4|NC_019753.1_4419480_4419657_+	NA	NA|47aa|down_5|NC_019753.1_4419657_4419798_+	NA	NA|737aa|down_6|NC_019753.1_4419865_4422076_-	NA	NA|454aa|down_7|NC_019753.1_4422325_4423687_+	PRK02507, PRK02507, proton extrusion protein PcxA; Provisional	NA|414aa|down_8|NC_019753.1_4423700_4424942_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|204aa|down_9|NC_019753.1_4425111_4425723_-	NA
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	17	4510783-4510881	15	CRISPRCasFinder	no		c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Orphan	ATTAATAATATTCCCAGTAGTAATATTTCA	30	0	0	NA	NA	NA	1	1	Orphan	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|109aa|up_3|NC_019753.1_4506316_4506643_+,NA|107aa|up_0|NC_019753.1_4509112_4509433_-,NA|482aa|down_1|NC_019753.1_4512822_4514268_+	NA|311aa|up_9|NC_019753.1_4498996_4499929_-	cd06581, TM_PBP1_LivM_like, Transmembrane subunit (TM) of Escherichia coli LivM and related proteins	NA|249aa|up_8|NC_019753.1_4499963_4500710_-	pfam02633, Creatininase, Creatinine amidohydrolase	NA|251aa|up_7|NC_019753.1_4500992_4501745_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|447aa|up_6|NC_019753.1_4501814_4503155_+	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|195aa|up_5|NC_019753.1_4503365_4503950_+	COG0704, PhoU, Phosphate uptake regulator [Inorganic ion transport and metabolism]	NA|592aa|up_4|NC_019753.1_4504270_4506046_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|109aa|up_3|NC_019753.1_4506316_4506643_+	NA	NA|180aa|up_2|NC_019753.1_4506663_4507203_+	cd10450, GIY-YIG_AtGrxS16_like, GIY-YIG domain found in CAXIP1-like proteins, iron-sulfur cluster assembly proteins, and similar proteins	NA|584aa|up_1|NC_019753.1_4507359_4509111_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|107aa|up_0|NC_019753.1_4509112_4509433_-	NA	NA|345aa|down_0|NC_019753.1_4511397_4512432_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|482aa|down_1|NC_019753.1_4512822_4514268_+	NA	NA|130aa|down_2|NC_019753.1_4514333_4514723_-	CHL00083, rpl12, ribosomal protein L12	NA|190aa|down_3|NC_019753.1_4514781_4515351_-	PRK00099, rplJ, 50S ribosomal protein L10; Reviewed	NA|239aa|down_4|NC_019753.1_4515615_4516332_-	CHL00129, rpl1, ribosomal protein L1; Reviewed	NA|142aa|down_5|NC_019753.1_4516448_4516874_-	PRK00140, rplK, 50S ribosomal protein L11; Validated	NA|217aa|down_6|NC_019753.1_4516880_4517531_-	PRK05609, nusG, transcription antitermination protein NusG; Validated	NA|75aa|down_7|NC_019753.1_4517527_4517752_-	PRK07597, secE, preprotein translocase subunit SecE; Reviewed	NA|121aa|down_8|NC_019753.1_4518455_4518818_-	CHL00084, rpl19, ribosomal protein L19	NA|820aa|down_9|NC_019753.1_4518983_4521443_+	cd06595, GH31_u1, glycosyl hydrolase family 31 (GH31); uncharacterized subgroup
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	18	4687015-4687133	16	CRISPRCasFinder	no	Cas14c_CAS-V-F	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Unclear	CCAATACCCAGCCCAGTCCCAGA	23	0	0	NA	NA	NA	2	2	TypeV	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|52aa|up_9|NC_019753.1_4677041_4677197_-,NA|219aa|down_1|NC_019753.1_4688402_4689059_-,NA|154aa|down_3|NC_019753.1_4690044_4690506_-,NA|117aa|down_4|NC_019753.1_4690746_4691097_+	NA|52aa|up_9|NC_019753.1_4677041_4677197_-	NA	NA|379aa|up_8|NC_019753.1_4677590_4678727_-	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|144aa|up_7|NC_019753.1_4678872_4679304_-	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|502aa|up_6|NC_019753.1_4679501_4681007_+	PRK14508, PRK14508, 4-alpha-glucanotransferase; Provisional	NA|275aa|up_5|NC_019753.1_4681070_4681895_-	COG1426, COG1426, Predicted transcriptional regulator contains Xre-like HTH domain [Function unknown]	NA|262aa|up_4|NC_019753.1_4681891_4682677_-	COG1187, RsuA, 16S rRNA uridine-516 pseudouridylate synthase and related pseudouridylate synthases [Translation, ribosomal structure and biogenesis]	NA|246aa|up_3|NC_019753.1_4682748_4683486_+	pfam11209, DUF2993, Protein of unknown function (DUF2993)	NA|293aa|up_2|NC_019753.1_4683525_4684404_-	PLN02953, PLN02953, phosphatidate cytidylyltransferase	NA|201aa|up_1|NC_019753.1_4684510_4685113_-	PRK07402, PRK07402, precorrin-6Y C5,15-methyltransferase subunit CbiT	NA|512aa|up_0|NC_019753.1_4685165_4686701_-	COG1982, LdcC, Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism]	NA|364aa|down_0|NC_019753.1_4687238_4688330_+	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|219aa|down_1|NC_019753.1_4688402_4689059_-	NA	NA|176aa|down_2|NC_019753.1_4689299_4689827_+	cd07245, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|154aa|down_3|NC_019753.1_4690044_4690506_-	NA	NA|117aa|down_4|NC_019753.1_4690746_4691097_+	NA	NA|504aa|down_5|NC_019753.1_4691175_4692687_+	CHL00195, ycf46, Ycf46; Provisional	NA|118aa|down_6|NC_019753.1_4692817_4693171_+	pfam06868, DUF1257, Protein of unknown function (DUF1257)	Cas14c_CAS-V-F|408aa|down_7|NC_019753.1_4693364_4694588_+	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|161aa|down_8|NC_019753.1_4694899_4695382_-	pfam11947, DUF3464, Protein of unknown function (DUF3464)	NA|90aa|down_9|NC_019753.1_4695391_4695661_-	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	19	4712168-4712424	7,5	PILER-CR,CRT	no	Cas14c_CAS-V-F	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Unclear	AGCGAAAATTACTAATAATCCCTTTTAGGGATTGAAAC,TAATAATCCCTTTTAGGGATTGAAAC	38,26	0	0	NA	NA	I-D,II-B:I-D,II-B	3,3	3	TypeV	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|106aa|up_4|NC_019753.1_4707930_4708248_+,NA	NA|799aa|up_9|NC_019753.1_4698718_4701115_-	COG5635, COG5635, Predicted NTPase (NACHT family) [Signal transduction mechanisms]	NA|1448aa|up_8|NC_019753.1_4701274_4705618_+	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|94aa|up_7|NC_019753.1_4705614_4705896_+	cd17538, REC_D1_PleD-like, first (D1) phosphoacceptor receiver (REC) domain of response regulator PleD and similar domains	NA|433aa|up_6|NC_019753.1_4705920_4707219_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|83aa|up_5|NC_019753.1_4707397_4707646_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|106aa|up_4|NC_019753.1_4707930_4708248_+	NA	NA|112aa|up_3|NC_019753.1_4708404_4708740_-	pfam00101, RuBisCO_small, Ribulose bisphosphate carboxylase, small chain	NA|137aa|up_2|NC_019753.1_4708786_4709197_-	pfam02341, RcbX, RbcX protein	NA|477aa|up_1|NC_019753.1_4709289_4710720_-	CHL00040, rbcL, ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit	NA|258aa|up_0|NC_019753.1_4711217_4711991_+	PRK00311, panB, 3-methyl-2-oxobutanoate hydroxymethyltransferase; Reviewed	NA|67aa|down_0|NC_019753.1_4712469_4712670_+	COG1724, COG1724, Predicted RNA binding protein (dsRBD-like fold), HicA family    [General function prediction only]	NA|73aa|down_1|NC_019753.1_4712659_4712878_+	pfam15919, HicB_lk_antitox, HicB_like antitoxin of bacterial toxin-antitoxin system	NA|588aa|down_2|NC_019753.1_4712966_4714730_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|174aa|down_3|NC_019753.1_4714972_4715494_-	COG5403, COG5403, Uncharacterized conserved protein [Function unknown]	NA|520aa|down_4|NC_019753.1_4715664_4717224_+	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|192aa|down_5|NC_019753.1_4717414_4717990_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|313aa|down_6|NC_019753.1_4718310_4719249_-	PRK06249, PRK06249, putative 2-dehydropantoate 2-reductase	NA|334aa|down_7|NC_019753.1_4719612_4720614_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|139aa|down_8|NC_019753.1_4720901_4721318_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|486aa|down_9|NC_019753.1_4721434_4722892_-	cd11338, AmyAc_CMD, Alpha amylase catalytic domain found in cyclomaltodextrinases and related proteins
GCF_000317495.1_ASM31749v1	NC_019753	Crinalium epipsammum PCC 9333, complete genome	20	5029582-5031781	8,17,6	PILER-CR,CRISPRCasFinder,CRT	no	c2c9_V-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	Type V-U4	GTTTAAATGCACCTAAATCCCTTTTAGGGATTGAAAC,GTTTAAATGCACCTAAATCCCTTTTAGGGATTGAAAC,GTTTAAATGCACCTAAATCCCTTTTAGGGATTGAAAC	37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B	30,30,30	30	TypeV-U4	c2c9_V-U4,PD-DExK,cas14j,Cas9_archaeal,csx3,cas10,csm3gr7,csx19,csm6,cas6,WYL,RT,cas1,cas2,cas3,cas10d,csc2gr7,csc1gr5,cas4,DinG,csa3,cas14k,c2c5_V-U5,Cas14c_CAS-V-F,2OG_CAS,Cas14u_CAS-V	NA|47aa|up_4|NC_019753.1_5025323_5025464_-,NA|144aa|down_0|NC_019753.1_5032025_5032457_+,NA|118aa|down_3|NC_019753.1_5039607_5039961_-	NA|417aa|up_9|NC_019753.1_5020244_5021495_+	cd06423, CESA_like, CESA_like is  the cellulose synthase superfamily	NA|107aa|up_8|NC_019753.1_5021511_5021832_-	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|573aa|up_7|NC_019753.1_5022011_5023730_+	COG3653, COG3653, N-acyl-D-aspartate/D-glutamate deacylase [Secondary metabolites biosynthesis, transport, and catabolism]	NA|137aa|up_6|NC_019753.1_5023743_5024154_+	pfam06127, DUF962, Protein of unknown function (DUF962)	NA|286aa|up_5|NC_019753.1_5024204_5025062_-	cd01846, fatty_acyltransferase_like, Fatty acyltransferase-like subfamily of the SGNH hydrolases, a diverse family of lipases and esterases	NA|47aa|up_4|NC_019753.1_5025323_5025464_-	NA	NA|511aa|up_3|NC_019753.1_5025511_5027044_-	cd06160, S2P-M50_like_2, Uncharacterized homologs of Site-2 protease (S2P), zinc metalloproteases (MEROPS family M50) which cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|315aa|up_2|NC_019753.1_5027140_5028085_-	PRK05441, murQ, N-acetylmuramic acid-6-phosphate etherase; Reviewed	NA|122aa|up_1|NC_019753.1_5028081_5028447_-	pfam11360, DUF3110, Protein of unknown function (DUF3110)	NA|234aa|up_0|NC_019753.1_5028612_5029314_-	COG2173, DdpX, D-alanyl-D-alanine dipeptidase [Cell envelope biogenesis, outer membrane]	NA|144aa|down_0|NC_019753.1_5032025_5032457_+	NA	NA|1760aa|down_1|NC_019753.1_5032535_5037815_-	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|458aa|down_2|NC_019753.1_5038020_5039394_-	COG0641, AslB, Arylsulfatase regulator (Fe-S oxidoreductase) [General function prediction only]	NA|118aa|down_3|NC_019753.1_5039607_5039961_-	NA	NA|398aa|down_4|NC_019753.1_5040151_5041345_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|388aa|down_5|NC_019753.1_5041424_5042588_+	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|93aa|down_6|NC_019753.1_5042584_5042863_-	pfam01809, Haemolytic, Haemolytic domain	NA|236aa|down_7|NC_019753.1_5042912_5043620_+	COG0170, SEC59, Dolichol kinase [Lipid metabolism]	NA|98aa|down_8|NC_019753.1_5044043_5044337_-	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|83aa|down_9|NC_019753.1_5044326_5044575_-	pfam02604, PhdYeFM_antitox, Antitoxin Phd_YefM, type II toxin-antitoxin system
