assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	1	16624-16758	1	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GTATTTTTGATAGAGTTTACTAAAATTCTAAATATAGATTTTTAAAG	47	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|1006aa|up_0|NC_008312.1_12894_15912_+,NA|59aa|down_3|NC_008312.1_23489_23666_-	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|457aa|up_5|NC_008312.1_26_1397_+	PRK00149, dnaA, chromosomal replication initiator protein DnaA	NA|101aa|up_4|NC_008312.1_9092_9395_+	PRK05643, PRK05643, DNA polymerase III subunit beta; Validated	NA|201aa|up_3|NC_008312.1_10370_10973_+	PRK05643, PRK05643, DNA polymerase III subunit beta; Validated	NA|152aa|up_2|NC_008312.1_11373_11829_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|168aa|up_1|NC_008312.1_11837_12341_-	pfam13565, HTH_32, Homeodomain-like domain	NA|1006aa|up_0|NC_008312.1_12894_15912_+	NA	NA|1249aa|down_0|NC_008312.1_17125_20872_+	pfam13191, AAA_16, AAA ATPase domain	NA|396aa|down_1|NC_008312.1_21245_22433_-	COG0003, ArsA, Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning]	NA|131aa|down_2|NC_008312.1_22679_23072_-	pfam10184, DUF2358, Uncharacterized conserved protein (DUF2358)	NA|59aa|down_3|NC_008312.1_23489_23666_-	NA	NA|177aa|down_4|NC_008312.1_23798_24329_-	pfam00805, Pentapeptide, Pentapeptide repeats (8 copies)	NA|732aa|down_5|NC_008312.1_25440_27636_-	COG3330, COG3330, Uncharacterized protein conserved in bacteria [Function unknown]	NA|562aa|down_6|NC_008312.1_28953_30639_+	PRK00911, PRK00911, dihydroxy-acid dehydratase; Provisional	NA|346aa|down_7|NC_008312.1_31433_32471_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|763aa|down_8|NC_008312.1_33064_35353_+	COG0744, MrcB, Membrane carboxypeptidase (penicillin-binding protein) [Cell envelope biogenesis, outer membrane]	NA|443aa|down_9|NC_008312.1_36829_38158_+	pfam17784, Sulfotransfer_4, Sulfotransferase domain
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	2	438079-438234	2	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CCCATTTTTATTTCCATCGTTAAACATAAAATAGAAGTTAAACACATACATAA	53	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA,NA|65aa|down_1|NC_008312.1_440382_440577_-,NA|82aa|down_3|NC_008312.1_442806_443052_-	NA|406aa|up_9|NC_008312.1_422370_423588_+	cd17474, MFS_YfmO_like, Bacillus subtilis multidrug efflux protein YfmO and similar transporters of the Major Facilitator Superfamily	NA|326aa|up_8|NC_008312.1_423918_424896_+	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|507aa|up_7|NC_008312.1_425177_426698_-	COG2303, BetA, Choline dehydrogenase and related flavoproteins [Amino acid transport and metabolism]	NA|205aa|up_6|NC_008312.1_427228_427843_-	COG1845, CyoC, Heme/copper-type cytochrome/quinol oxidase, subunit 3 [Energy production and conversion]	NA|563aa|up_5|NC_008312.1_428292_429981_-	TIGR02891, Probable_cytochrome_c_oxidase_subunit_1-beta, cytochrome c oxidase, subunit I	NA|303aa|up_4|NC_008312.1_430085_430994_-	COG1622, CyoA, Heme/copper-type cytochrome/quinol oxidases, subunit 2 [Energy production and conversion]	NA|202aa|up_3|NC_008312.1_431012_431618_-	COG4244, COG4244, Predicted membrane protein [Function unknown]	NA|167aa|up_2|NC_008312.1_431614_432115_-	COG4244, COG4244, Predicted membrane protein [Function unknown]	NA|358aa|up_1|NC_008312.1_433207_434281_-	COG0429, COG0429, Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only]	NA|390aa|up_0|NC_008312.1_436167_437337_+	PRK12309, PRK12309, transaldolase	NA|368aa|down_0|NC_008312.1_438502_439606_-	TIGR02352, Glycine_oxidase, glycine oxidase ThiO	NA|65aa|down_1|NC_008312.1_440382_440577_-	NA	NA|535aa|down_2|NC_008312.1_440974_442579_+	PRK12344, PRK12344, putative alpha-isopropylmalate/homocitrate synthase family transferase; Provisional	NA|82aa|down_3|NC_008312.1_442806_443052_-	NA	NA|250aa|down_4|NC_008312.1_443534_444284_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|413aa|down_5|NC_008312.1_446240_447479_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|287aa|down_6|NC_008312.1_447470_448331_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|391aa|down_7|NC_008312.1_449413_450586_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|433aa|down_8|NC_008312.1_451429_452728_-	pfam01937, DUF89, Protein of unknown function DUF89	NA|867aa|down_9|NC_008312.1_452824_455425_+	COG0308, PepN, Aminopeptidase N [Amino acid transport and metabolism]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	3	612915-616884	1,2	CRT,CRT	no	cas14k	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	GGACCTACAGGACCTACNGGACCAG,GGACCTANAGGACCNNNNGGACCAG	25,25	31	77	613003-613022|613003-613022|613093-613121|613093-613121|613147-613166|613147-613166|613147-613166|613147-613166|613147-613166|613408-613445|613408-613445|613633-613670|613633-613670|613777-613823|613777-613823|613777-613823|614119-614147|614218-614246|614218-614246|614272-614300|614326-614345|614326-614345|614326-614345|614326-614345|614371-614408|614371-614408|614371-614408|614371-614408|614434-614453|614668-614687|614668-614687|614668-614687|614668-614687|614713-614741|614713-614741|614830-614858|614830-614858|614884-614912|614938-614957|614938-614957|614938-614957|614938-614957|614983-615020|614983-615020|614983-615020|614983-615020|615046-615065|615217-615254|615217-615254|615217-615254|615217-615254|615280-615344|615280-615344|615280-615344|615433-615461|615433-615461|615487-615515|615541-615560|615541-615560|615541-615560|615541-615560|615586-615605|615586-615605|615586-615605|615586-615605|615631-615668|615631-615668|615775-615848|615775-615848|616237-616274|616237-616274|616237-616274|616237-616274|616237-616274|616381-616454|616750-616796|616822-616859	NC_008312.1_616948-616967|NC_008312.1_617533-617552|NC_008312.1_616948-616976|NC_008312.1_617533-617561|NC_008312.1_616912-616931|NC_008312.1_617497-617516|NC_008312.1_617155-617174|NC_008312.1_617686-617705|NC_008312.1_617839-617858|NC_008312.1_617533-617570|NC_008312.1_616948-616985|NC_008312.1_617767-617804|NC_008312.1_616849-616886|NC_008312.1_617092-617138|NC_008312.1_617767-617813|NC_008312.1_616849-616895|NC_008312.1_617767-617795|NC_008312.1_616948-616976|NC_008312.1_617533-617561|NC_008312.1_617002-617030|NC_008312.1_617488-617507|NC_008312.1_617677-617696|NC_008312.1_617830-617849|NC_008312.1_617866-617885|NC_008312.1_617767-617804|NC_008312.1_616849-616886|NC_008312.1_617092-617129|NC_008312.1_617596-617633|NC_008312.1_617146-617165|NC_008312.1_617488-617507|NC_008312.1_617677-617696|NC_008312.1_617830-617849|NC_008312.1_617866-617885|NC_008312.1_616948-616976|NC_008312.1_617533-617561|NC_008312.1_617092-617120|NC_008312.1_617596-617624|NC_008312.1_617002-617030|NC_008312.1_617488-617507|NC_008312.1_617677-617696|NC_008312.1_617830-617849|NC_008312.1_617866-617885|NC_008312.1_617767-617804|NC_008312.1_616849-616886|NC_008312.1_617092-617129|NC_008312.1_617596-617633|NC_008312.1_617146-617165|NC_008312.1_617767-617804|NC_008312.1_616849-616886|NC_008312.1_617092-617129|NC_008312.1_617596-617633|NC_008312.1_616912-616976|NC_008312.1_617497-617561|NC_008312.1_617155-617219|NC_008312.1_617092-617120|NC_008312.1_617596-617624|NC_008312.1_617002-617030|NC_008312.1_617488-617507|NC_008312.1_617677-617696|NC_008312.1_617830-617849|NC_008312.1_617866-617885|NC_008312.1_617488-617507|NC_008312.1_617677-617696|NC_008312.1_617830-617849|NC_008312.1_617866-617885|NC_008312.1_617533-617570|NC_008312.1_616948-616985|NC_008312.1_616912-616985|NC_008312.1_617497-617570|NC_008312.1_617101-617138|NC_008312.1_617542-617579|NC_008312.1_616858-616895|NC_008312.1_616957-616994|NC_008312.1_617776-617813|NC_008312.1_616912-616985|NC_008312.1_617056-617102|NC_008312.1_617065-617102	NA:NA	63,63	63	TypeV	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|133aa|up_8|NC_008312.1_599379_599778_+,NA|55aa|up_6|NC_008312.1_603465_603630_-,NA|617aa|up_5|NC_008312.1_603833_605684_+,NA|53aa|down_0|NC_008312.1_620096_620255_+,NA|59aa|down_6|NC_008312.1_630016_630193_+,NA|57aa|down_8|NC_008312.1_633908_634079_-	NA|363aa|up_9|NC_008312.1_597557_598646_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|133aa|up_8|NC_008312.1_599379_599778_+	NA	NA|939aa|up_7|NC_008312.1_600304_603121_+	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|55aa|up_6|NC_008312.1_603465_603630_-	NA	NA|617aa|up_5|NC_008312.1_603833_605684_+	NA	NA|56aa|up_4|NC_008312.1_605998_606166_-	cd09025, Aldose_epim_Slr1438, Aldose 1-epimerase, similar to Synechocystis Slr1438	NA|77aa|up_3|NC_008312.1_606397_606628_-	TIGR01097, PhnE, phosphonate ABC transporter, permease protein PhnE	NA|251aa|up_2|NC_008312.1_606873_607626_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	cas14k|143aa|up_1|NC_008312.1_608203_608632_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|178aa|up_0|NC_008312.1_609070_609604_-	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|53aa|down_0|NC_008312.1_620096_620255_+	NA	NA|419aa|down_1|NC_008312.1_620587_621844_+	PLN02572, PLN02572, UDP-sulfoquinovose synthase	NA|378aa|down_2|NC_008312.1_622276_623410_+	cd03814, GT4-like, glycosyltransferase family 4 proteins	NA|865aa|down_3|NC_008312.1_623996_626591_+	cd07302, CHD, cyclase homology domain	NA|242aa|down_4|NC_008312.1_627136_627862_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|256aa|down_5|NC_008312.1_628070_628838_+	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|59aa|down_6|NC_008312.1_630016_630193_+	NA	NA|742aa|down_7|NC_008312.1_631425_633651_+	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site	NA|57aa|down_8|NC_008312.1_633908_634079_-	NA	NA|710aa|down_9|NC_008312.1_634050_636180_-	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	4	617166-617496	3	CRT	no	cas14k	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	GGACCTACAGGACCTANNGGACCNNCAGGACCAG	34	5	8	617200-617219|617200-617219|617200-617219|617254-617282|617317-617345|617380-617399|617380-617399|617434-617462	NC_008312.1_616957-616976|NC_008312.1_617542-617561|NC_008312.1_617776-617795|NC_008312.1_617632-617660|NC_008312.1_617767-617795|NC_008312.1_617641-617660|NC_008312.1_617146-617165|NC_008312.1_617767-617795	NA	5	5	TypeV	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|133aa|up_8|NC_008312.1_599379_599778_+,NA|55aa|up_6|NC_008312.1_603465_603630_-,NA|617aa|up_5|NC_008312.1_603833_605684_+,NA|53aa|down_0|NC_008312.1_620096_620255_+,NA|59aa|down_6|NC_008312.1_630016_630193_+,NA|57aa|down_8|NC_008312.1_633908_634079_-	NA|363aa|up_9|NC_008312.1_597557_598646_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|133aa|up_8|NC_008312.1_599379_599778_+	NA	NA|939aa|up_7|NC_008312.1_600304_603121_+	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|55aa|up_6|NC_008312.1_603465_603630_-	NA	NA|617aa|up_5|NC_008312.1_603833_605684_+	NA	NA|56aa|up_4|NC_008312.1_605998_606166_-	cd09025, Aldose_epim_Slr1438, Aldose 1-epimerase, similar to Synechocystis Slr1438	NA|77aa|up_3|NC_008312.1_606397_606628_-	TIGR01097, PhnE, phosphonate ABC transporter, permease protein PhnE	NA|251aa|up_2|NC_008312.1_606873_607626_+	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	cas14k|143aa|up_1|NC_008312.1_608203_608632_+	TIGR01766, Putative_transposase_MJ0751, transposase, IS605 OrfB family, central region	NA|178aa|up_0|NC_008312.1_609070_609604_-	pfam01625, PMSR, Peptide methionine sulfoxide reductase	NA|53aa|down_0|NC_008312.1_620096_620255_+	NA	NA|419aa|down_1|NC_008312.1_620587_621844_+	PLN02572, PLN02572, UDP-sulfoquinovose synthase	NA|378aa|down_2|NC_008312.1_622276_623410_+	cd03814, GT4-like, glycosyltransferase family 4 proteins	NA|865aa|down_3|NC_008312.1_623996_626591_+	cd07302, CHD, cyclase homology domain	NA|242aa|down_4|NC_008312.1_627136_627862_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|256aa|down_5|NC_008312.1_628070_628838_+	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|59aa|down_6|NC_008312.1_630016_630193_+	NA	NA|742aa|down_7|NC_008312.1_631425_633651_+	cd02767, MopB_ydeP, The MopB_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site	NA|57aa|down_8|NC_008312.1_633908_634079_-	NA	NA|710aa|down_9|NC_008312.1_634050_636180_-	COG2931, COG2931, RTX toxins and related Ca2+-binding proteins [Secondary metabolites biosynthesis, transport, and catabolism]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	5	684856-684965	3	CRISPRCasFinder	no	RT	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	AAAATTGCCTGTAATAATCTCTTCAA	26	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|47aa|up_3|NC_008312.1_680661_680802_-,NA|92aa|up_1|NC_008312.1_682571_682847_+,NA	NA|1023aa|up_9|NC_008312.1_659041_662110_+	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|118aa|up_8|NC_008312.1_663632_663986_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family	NA|100aa|up_7|NC_008312.1_664959_665259_-	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|1018aa|up_6|NC_008312.1_666286_669340_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	RT|590aa|up_5|NC_008312.1_675848_677618_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|94aa|up_4|NC_008312.1_678195_678477_-	cd01676, RNR_II_monomer, Class II ribonucleotide reductase, monomeric form	NA|47aa|up_3|NC_008312.1_680661_680802_-	NA	RT|503aa|up_2|NC_008312.1_680791_682300_-	TIGR04416, hypothetical_protein, group II intron reverse transcriptase/maturase	NA|92aa|up_1|NC_008312.1_682571_682847_+	NA	NA|86aa|up_0|NC_008312.1_682940_683198_-	cd01676, RNR_II_monomer, Class II ribonucleotide reductase, monomeric form	NA|171aa|down_0|NC_008312.1_685182_685695_-	PRK02304, PRK02304, adenine phosphoribosyltransferase; Provisional	NA|202aa|down_1|NC_008312.1_687516_688122_+	pfam11237, DUF3038, Protein of unknown function (DUF3038)	NA|539aa|down_2|NC_008312.1_688949_690566_-	TIGR00976, Hypothetical_protein_Rv1835c/MT1883/Mb1866c	NA|150aa|down_3|NC_008312.1_692629_693079_-	cd06987, cupin_MAE_RS03005, Microcystis aeruginosa MAE_RS03005 and related proteins, cupin domain	NA|383aa|down_4|NC_008312.1_694773_695922_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|154aa|down_5|NC_008312.1_696401_696863_+	PRK05273, PRK05273, D-tyrosyl-tRNA(Tyr) deacylase; Provisional	NA|108aa|down_6|NC_008312.1_697750_698074_+	pfam08855, DUF1825, Domain of unknown function (DUF1825)	NA|605aa|down_7|NC_008312.1_698788_700603_-	TIGR02074, Includes:_Penicillin-insensitive_transglycosylase, penicillin-binding protein, 1A family	NA|405aa|down_8|NC_008312.1_700845_702060_+	PRK05912, PRK05912, tyrosyl-tRNA synthetase; Validated	NA|243aa|down_9|NC_008312.1_702712_703441_+	PRK00230, PRK00230, orotidine-5'-phosphate decarboxylase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	6	904018-904108	4	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GAGGTTTCTGAGACAGAAAATACTGATAATT	31	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|344aa|up_2|NC_008312.1_889555_890587_+,NA|76aa|up_1|NC_008312.1_890950_891178_+,NA|72aa|down_1|NC_008312.1_923659_923875_+	NA|472aa|up_9|NC_008312.1_878410_879826_+	PRK14360, glmU, bifunctional UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase GlmU	NA|333aa|up_8|NC_008312.1_880081_881080_+	cd08276, MDR7, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|327aa|up_7|NC_008312.1_881100_882081_+	cd08271, MDR5, Medium chain dehydrogenases/reductase (MDR)/zinc-dependent alcohol dehydrogenase-like family	NA|110aa|up_6|NC_008312.1_882333_882663_+	cd06260, DUF820, Domain of unknown function (DUF820)	NA|259aa|up_5|NC_008312.1_882851_883628_-	pfam14517, Tachylectin, Tachylectin	NA|595aa|up_4|NC_008312.1_884920_886705_+	cd11477, SLC5sbd_u1, Uncharacterized bacterial solute carrier 5 subfamily; putative solute-binding domain	NA|446aa|up_3|NC_008312.1_887811_889149_+	sd00006, TPR, Tetratricopeptide repeat	NA|344aa|up_2|NC_008312.1_889555_890587_+	NA	NA|76aa|up_1|NC_008312.1_890950_891178_+	NA	NA|3536aa|up_0|NC_008312.1_891342_901950_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|3368aa|down_0|NC_008312.1_913572_923676_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|72aa|down_1|NC_008312.1_923659_923875_+	NA	NA|201aa|down_2|NC_008312.1_925475_926078_+	pfam07466, DUF1517, Protein of unknown function (DUF1517)	NA|324aa|down_3|NC_008312.1_926398_927370_-	cd03469, Rieske_RO_Alpha_N, Rieske non-heme iron oxygenase (RO) family, N-terminal Rieske domain of the oxygenase alpha subunit; The RO family comprise a large class of aromatic ring-hydroxylating dioxygenases found predominantly in microorganisms	NA|518aa|down_4|NC_008312.1_927460_929014_+	TIGR00442, hisS, histidyl-tRNA synthetase	NA|339aa|down_5|NC_008312.1_929568_930585_+	cd13593, PBP2_HisGL3, The catalytic domain of hexameric long form HisGL3; contains the type 2 periplasmic binding protein fold	NA|76aa|down_6|NC_008312.1_930907_931135_+	COG1146, COG1146, Ferredoxin [Energy production and conversion]	NA|195aa|down_7|NC_008312.1_931528_932113_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|410aa|down_8|NC_008312.1_932939_934169_+	PRK04346, PRK04346, tryptophan synthase subunit beta; Validated	NA|432aa|down_9|NC_008312.1_934520_935816_+	pfam00144, Beta-lactamase, Beta-lactamase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	7	1110157-1110246	5	CRISPRCasFinder	no	Cas14u_CAS-V	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	GTTGGGTAGGTTGTATTGGTTGAACTGGAA	30	0	0	NA	NA	NA	1	1	Unclear	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|185aa|up_5|NC_008312.1_1103207_1103762_+,NA|84aa|up_2|NC_008312.1_1107524_1107776_+,NA	NA|892aa|up_9|NC_008312.1_1094073_1096749_-	TIGR00845, Sodium/calcium_exchanger_1, sodium/calcium exchanger 1	NA|229aa|up_8|NC_008312.1_1096924_1097611_+	COG0170, SEC59, Dolichol kinase [Lipid metabolism]	Cas14u_CAS-V|192aa|up_7|NC_008312.1_1098291_1098867_-	pfam12323, HTH_OrfB_IS605, Helix-turn-helix domain	NA|123aa|up_6|NC_008312.1_1102445_1102814_+	pfam08865, DUF1830, Domain of unknown function (DUF1830)	NA|185aa|up_5|NC_008312.1_1103207_1103762_+	NA	NA|250aa|up_4|NC_008312.1_1103769_1104519_+	COG1135, AbcC, ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism]	NA|253aa|up_3|NC_008312.1_1105111_1105870_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|84aa|up_2|NC_008312.1_1107524_1107776_+	NA	NA|112aa|up_1|NC_008312.1_1108731_1109067_-	TIGR00365, TIGR00365, monothiol glutaredoxin, Grx4 family	NA|87aa|up_0|NC_008312.1_1109389_1109650_-	COG0271, BolA, Stress-induced morphogen (activity unknown) [Signal transduction mechanisms]	NA|228aa|down_0|NC_008312.1_1110562_1111246_-	PLN02811, PLN02811, hydrolase	NA|98aa|down_1|NC_008312.1_1112417_1112711_+	smart00886, Dabb, Stress responsive A/B Barrel Domain	NA|355aa|down_2|NC_008312.1_1114287_1115352_+	PRK09293, PRK09293, class 1 fructose-bisphosphatase	NA|381aa|down_3|NC_008312.1_1115548_1116691_+	PRK03343, PRK03343, transaldolase; Validated	NA|510aa|down_4|NC_008312.1_1117040_1118570_+	PRK05722, PRK05722, glucose-6-phosphate 1-dehydrogenase; Validated	NA|441aa|down_5|NC_008312.1_1118944_1120267_+	pfam10128, OpcA_G6PD_assem, Glucose-6-phosphate dehydrogenase subunit	NA|202aa|down_6|NC_008312.1_1121931_1122537_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|121aa|down_7|NC_008312.1_1123315_1123678_-	pfam02152, FolB, Dihydroneopterin aldolase	NA|172aa|down_8|NC_008312.1_1124314_1124830_+	COG0801, FolK, 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase [Coenzyme metabolism]	NA|159aa|down_9|NC_008312.1_1127117_1127594_+	pfam02261, Asp_decarbox, Aspartate decarboxylase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	8	1276978-1277054	6	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CAAGGTGATGATCATCTTATTGG	23	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|46aa|up_7|NC_008312.1_1263815_1263953_+,NA|67aa|down_4|NC_008312.1_1284894_1285095_+,NA|299aa|down_5|NC_008312.1_1286185_1287082_-,NA|59aa|down_7|NC_008312.1_1289413_1289590_+	NA|83aa|up_9|NC_008312.1_1261082_1261331_+	pfam01455, HupF_HypC, HupF/HypC family	NA|378aa|up_8|NC_008312.1_1261719_1262853_+	PRK15062, PRK15062, hydrogenase isoenzymes formation protein HypD; Provisional	NA|46aa|up_7|NC_008312.1_1263815_1263953_+	NA	NA|369aa|up_6|NC_008312.1_1264285_1265392_+	TIGR02124, Hydrogenase_expression/formation_protein_HypE, hydrogenase expression/formation protein HypE	NA|115aa|up_5|NC_008312.1_1265647_1265992_+	pfam01155, HypA, Hydrogenase/urease nickel incorporation, metallochaperone, hypA	NA|335aa|up_4|NC_008312.1_1266309_1267314_-	PRK09478, mglC, galactose/methyl galactoside ABC transporter permease MglC	NA|510aa|up_3|NC_008312.1_1267523_1269053_-	PRK10982, PRK10982, galactose/methyl galaxtoside transporter ATP-binding protein; Provisional	NA|343aa|up_2|NC_008312.1_1269286_1270315_-	cd01539, PBP1_GGBP, periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species	NA|28aa|up_1|NC_008312.1_1271189_1271273_+	pfam11103, DUF2887, Protein of unknown function (DUF2887)	NA|280aa|up_0|NC_008312.1_1271552_1272392_+	PRK10463, PRK10463, hydrogenase nickel incorporation protein HypB; Provisional	NA|313aa|down_0|NC_008312.1_1279952_1280891_-	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|214aa|down_1|NC_008312.1_1281380_1282022_+	cd19368, TenA_C_AtTH2-like, TenA_C family similar to the N-terminal TenA_C domain of Arabidopsis thaliana thiamine requiring 2	NA|431aa|down_2|NC_008312.1_1282671_1283964_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|188aa|down_3|NC_008312.1_1284062_1284626_+	PRK00150, def, peptide deformylase; Reviewed	NA|67aa|down_4|NC_008312.1_1284894_1285095_+	NA	NA|299aa|down_5|NC_008312.1_1286185_1287082_-	NA	NA|644aa|down_6|NC_008312.1_1287238_1289170_+	COG3264, COG3264, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|59aa|down_7|NC_008312.1_1289413_1289590_+	NA	NA|495aa|down_8|NC_008312.1_1289780_1291265_-	pfam14516, AAA_35, AAA-like domain	NA|718aa|down_9|NC_008312.1_1291959_1294113_+	cd00147, cPLA2_like, Cytosolic phospholipase A2, catalytic domain; hydrolyses arachidonyl phospholipids
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	9	1311829-1311915	7	CRISPRCasFinder	no	PD-DExK	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	TTATTGCTGAGAAAAATAATAATATAAATA	30	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|97aa|up_7|NC_008312.1_1296179_1296470_+,PD-DExK|206aa|up_5|NC_008312.1_1299129_1299747_-,NA|373aa|up_4|NC_008312.1_1299864_1300983_-,NA|727aa|up_3|NC_008312.1_1303102_1305283_-,NA|302aa|up_2|NC_008312.1_1305630_1306536_-,NA|52aa|up_0|NC_008312.1_1307574_1307730_-,NA|143aa|down_2|NC_008312.1_1317055_1317484_+,NA|263aa|down_3|NC_008312.1_1318025_1318814_-,NA|268aa|down_6|NC_008312.1_1322984_1323788_+,NA|78aa|down_7|NC_008312.1_1326044_1326278_+,NA|396aa|down_9|NC_008312.1_1330520_1331708_-	NA|718aa|up_9|NC_008312.1_1291959_1294113_+	cd00147, cPLA2_like, Cytosolic phospholipase A2, catalytic domain; hydrolyses arachidonyl phospholipids	NA|454aa|up_8|NC_008312.1_1294563_1295925_+	pfam01823, MACPF, MAC/Perforin domain	NA|97aa|up_7|NC_008312.1_1296179_1296470_+	NA	NA|464aa|up_6|NC_008312.1_1297258_1298650_-	cd00198, vWFA, Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF)	PD-DExK|206aa|up_5|NC_008312.1_1299129_1299747_-	NA	NA|373aa|up_4|NC_008312.1_1299864_1300983_-	NA	NA|727aa|up_3|NC_008312.1_1303102_1305283_-	NA	NA|302aa|up_2|NC_008312.1_1305630_1306536_-	NA	NA|272aa|up_1|NC_008312.1_1306594_1307410_-	smart00962, SRP54, SRP54-type protein, GTPase domain	NA|52aa|up_0|NC_008312.1_1307574_1307730_-	NA	NA|436aa|down_0|NC_008312.1_1312664_1313972_-	PRK00861, PRK00861, putative lipid kinase; Reviewed	NA|599aa|down_1|NC_008312.1_1314880_1316677_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|143aa|down_2|NC_008312.1_1317055_1317484_+	NA	NA|263aa|down_3|NC_008312.1_1318025_1318814_-	NA	NA|332aa|down_4|NC_008312.1_1318856_1319852_-	cd11024, CuRO_1_2DMCO_NIR_like, The cupredoxin domain 1 of a two-domain laccase related to nitrite reductase	NA|696aa|down_5|NC_008312.1_1320188_1322276_-	PRK14948, PRK14948, DNA polymerase III subunit gamma/tau	NA|268aa|down_6|NC_008312.1_1322984_1323788_+	NA	NA|78aa|down_7|NC_008312.1_1326044_1326278_+	NA	NA|1039aa|down_8|NC_008312.1_1327222_1330339_+	pfam00311, PEPcase, Phosphoenolpyruvate carboxylase	NA|396aa|down_9|NC_008312.1_1330520_1331708_-	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	10	1399189-1399253	8	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TTCAAAGTTAGAGTTTAGAAGTT	23	1	9	1399212-1399230|1399212-1399230|1399212-1399230|1399212-1399230|1399212-1399230|1399212-1399230|1399212-1399230|1399212-1399230|1399212-1399230	NC_008312.1_3181554-3181536|NC_008312.1_312986-312968|NC_008312.1_2082386-2082368|NC_008312.1_2267182-2267200|NC_008312.1_2825631-2825649|NC_008312.1_3001789-3001807|NC_008312.1_4961158-4961176|NC_008312.1_6605857-6605875|NC_008312.1_7534704-7534722	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|51aa|up_6|NC_008312.1_1386864_1387017_-,NA|50aa|up_5|NC_008312.1_1387058_1387208_-,NA|74aa|down_3|NC_008312.1_1404360_1404582_+	NA|494aa|up_9|NC_008312.1_1381499_1382981_+	COG1928, PMT1, Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones]	NA|221aa|up_8|NC_008312.1_1384182_1384845_-	PRK00121, trmB, tRNA (guanine-N(7)-)-methyltransferase; Reviewed	NA|69aa|up_7|NC_008312.1_1385318_1385525_-	pfam01385, OrfB_IS605, Probable transposase	NA|51aa|up_6|NC_008312.1_1386864_1387017_-	NA	NA|50aa|up_5|NC_008312.1_1387058_1387208_-	NA	NA|365aa|up_4|NC_008312.1_1387488_1388583_+	cd02035, ArsA, Arsenical pump-driving ATPase ArsA	NA|327aa|up_3|NC_008312.1_1388872_1389853_+	TIGR02056, chlorophyll_synthase_33_kD_subunit, chlorophyll synthase, ChlG	NA|689aa|up_2|NC_008312.1_1391234_1393301_+	TIGR02074, Includes:_Penicillin-insensitive_transglycosylase, penicillin-binding protein, 1A family	NA|371aa|up_1|NC_008312.1_1394262_1395375_+	TIGR02032, Uncharacterized_protein_MJ1520, geranylgeranyl reductase family	NA|1069aa|up_0|NC_008312.1_1395949_1399156_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|268aa|down_0|NC_008312.1_1399968_1400772_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|401aa|down_1|NC_008312.1_1401368_1402571_-	PRK00509, PRK00509, argininosuccinate synthase; Provisional	NA|311aa|down_2|NC_008312.1_1403155_1404088_+	cd06420, GT2_Chondriotin_Pol_N, N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase	NA|74aa|down_3|NC_008312.1_1404360_1404582_+	NA	NA|311aa|down_4|NC_008312.1_1406489_1407422_+	PLN02823, PLN02823, spermine synthase	NA|56aa|down_5|NC_008312.1_1407783_1407951_+	pfam01679, Pmp3, Proteolipid membrane potential modulator	NA|155aa|down_6|NC_008312.1_1408254_1408719_+	pfam05154, TM2, TM2 domain	NA|356aa|down_7|NC_008312.1_1409260_1410328_+	PRK13357, PRK13357, branched-chain amino acid aminotransferase; Provisional	NA|286aa|down_8|NC_008312.1_1410905_1411763_+	COG2981, CysZ, Uncharacterized protein involved in cysteine biosynthesis [Amino acid transport and metabolism]	NA|153aa|down_9|NC_008312.1_1412247_1412706_+	pfam09055, Sod_Ni, Nickel-containing superoxide dismutase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	11	1511014-1511160	9	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CTACCACTAAAATATTTTCACCATTTACTCTGTAGGTCATTTAGTCAATCA	51	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|95aa|up_4|NC_008312.1_1501446_1501731_+,NA	NA|110aa|up_9|NC_008312.1_1491214_1491544_-	pfam00085, Thioredoxin, Thioredoxin	NA|65aa|up_8|NC_008312.1_1493240_1493435_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|346aa|up_7|NC_008312.1_1494751_1495789_+	PRK09479, glpX, fructose 1,6-bisphosphatase II; Reviewed	NA|432aa|up_6|NC_008312.1_1496537_1497833_+	PRK00045, hemA, glutamyl-tRNA reductase; Reviewed	NA|502aa|up_5|NC_008312.1_1498059_1499565_+	pfam01555, N6_N4_Mtase, DNA methylase	NA|95aa|up_4|NC_008312.1_1501446_1501731_+	NA	NA|215aa|up_3|NC_008312.1_1502662_1503307_-	pfam11152, CCB2_CCB4, Cofactor assembly of complex C subunit B, CCB2/CCB4	NA|244aa|up_2|NC_008312.1_1503513_1504245_-	CHL00148, orf27, Ycf27; Reviewed	NA|518aa|up_1|NC_008312.1_1505692_1507246_+	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|361aa|up_0|NC_008312.1_1509380_1510463_-	COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]	NA|430aa|down_0|NC_008312.1_1512047_1513337_+	smart00854, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|111aa|down_1|NC_008312.1_1514702_1515035_-	pfam18135, Type_ISP_C, Type ISP C-terminal specificity domain	NA|159aa|down_2|NC_008312.1_1517231_1517708_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|85aa|down_3|NC_008312.1_1518213_1518468_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|414aa|down_4|NC_008312.1_1519888_1521130_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|325aa|down_5|NC_008312.1_1522900_1523875_+	PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated	NA|244aa|down_6|NC_008312.1_1525248_1525980_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|480aa|down_7|NC_008312.1_1527064_1528504_+	cd05800, PGM_like2, This PGM-like (phosphoglucomutase-like) protein of unknown function belongs to the alpha-D-phosphohexomutase superfamily and is found in both archaea and bacteria	NA|572aa|down_8|NC_008312.1_1531675_1533391_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|434aa|down_9|NC_008312.1_1534515_1535817_-	COG1070, XylB, Sugar (pentulose and hexulose) kinases [Carbohydrate transport and metabolism]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	12	1606273-1607292	4	CRT	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TTTTTNANTTATNAGCTTTT	20	0	0	NA	NA	NA	14	14	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|100aa|up_6|NC_008312.1_1594182_1594482_-,NA|58aa|up_3|NC_008312.1_1597721_1597895_+,NA|57aa|down_3|NC_008312.1_1616567_1616738_-	NA|306aa|up_9|NC_008312.1_1585963_1586881_+	PRK01212, PRK01212, homoserine kinase; Provisional	NA|1325aa|up_8|NC_008312.1_1587406_1591381_-	PLN02666, PLN02666, 5-oxoprolinase	NA|466aa|up_7|NC_008312.1_1591705_1593103_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|100aa|up_6|NC_008312.1_1594182_1594482_-	NA	NA|378aa|up_5|NC_008312.1_1594656_1595790_+	cd08550, GlyDH-like, Glycerol_dehydrogenase-like	NA|395aa|up_4|NC_008312.1_1596120_1597305_+	PRK05942, PRK05942, aspartate aminotransferase; Provisional	NA|58aa|up_3|NC_008312.1_1597721_1597895_+	NA	NA|227aa|up_2|NC_008312.1_1601166_1601847_+	pfam04955, HupE_UreJ, HupE / UreJ protein	NA|349aa|up_1|NC_008312.1_1602685_1603732_+	TIGR02475, Probable_cobalamine_biosynthesis_protein, cobalamin biosynthesis protein CobW	NA|394aa|up_0|NC_008312.1_1605056_1606238_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|196aa|down_0|NC_008312.1_1607633_1608221_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|72aa|down_1|NC_008312.1_1608193_1608409_-	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|101aa|down_2|NC_008312.1_1615166_1615469_+	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	NA|57aa|down_3|NC_008312.1_1616567_1616738_-	NA	NA|59aa|down_4|NC_008312.1_1617907_1618084_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|618aa|down_5|NC_008312.1_1619339_1621193_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|228aa|down_6|NC_008312.1_1624554_1625238_+	COG0664, Crp, cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]	NA|601aa|down_7|NC_008312.1_1625439_1627242_+	COG3975, COG3975, Predicted protease with the C-terminal PDZ domain [General function prediction only]	NA|305aa|down_8|NC_008312.1_1627717_1628632_+	smart00327, VWA, von Willebrand factor (vWF) type A domain	NA|165aa|down_9|NC_008312.1_1629978_1630473_+	cd07177, terB_like, tellurium resistance terB-like protein
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	13	1645960-1646085	10	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CAAATTTACCTAACATTAGAAAACCAAAAATTATGT	36	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|293aa|up_5|NC_008312.1_1632116_1632995_+,NA|117aa|up_1|NC_008312.1_1638480_1638831_-,NA|56aa|down_2|NC_008312.1_1650526_1650694_-,NA|65aa|down_3|NC_008312.1_1650964_1651159_+,NA|48aa|down_4|NC_008312.1_1651768_1651912_-	NA|601aa|up_9|NC_008312.1_1625439_1627242_+	COG3975, COG3975, Predicted protease with the C-terminal PDZ domain [General function prediction only]	NA|305aa|up_8|NC_008312.1_1627717_1628632_+	smart00327, VWA, von Willebrand factor (vWF) type A domain	NA|165aa|up_7|NC_008312.1_1629978_1630473_+	cd07177, terB_like, tellurium resistance terB-like protein	NA|184aa|up_6|NC_008312.1_1631176_1631728_-	PLN00072, PLN00072, 3-isopropylmalate isomerase/dehydratase small subunit; Provisional	NA|293aa|up_5|NC_008312.1_1632116_1632995_+	NA	NA|560aa|up_4|NC_008312.1_1633211_1634891_+	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|118aa|up_3|NC_008312.1_1636297_1636651_-	pfam06868, DUF1257, Protein of unknown function (DUF1257)	NA|507aa|up_2|NC_008312.1_1636807_1638328_-	CHL00195, ycf46, Ycf46; Provisional	NA|117aa|up_1|NC_008312.1_1638480_1638831_-	NA	NA|86aa|up_0|NC_008312.1_1643889_1644147_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|239aa|down_0|NC_008312.1_1647639_1648356_-	cd08556, GDPD, Glycerophosphodiester phosphodiesterase domain as found in prokaryota and eukaryota, and similar proteins	NA|423aa|down_1|NC_008312.1_1648440_1649709_+	pfam00395, SLH, S-layer homology domain	NA|56aa|down_2|NC_008312.1_1650526_1650694_-	NA	NA|65aa|down_3|NC_008312.1_1650964_1651159_+	NA	NA|48aa|down_4|NC_008312.1_1651768_1651912_-	NA	NA|575aa|down_5|NC_008312.1_1652248_1653973_+	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|551aa|down_6|NC_008312.1_1654058_1655711_-	COG0659, SUL1, Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism]	NA|97aa|down_7|NC_008312.1_1655921_1656212_+	pfam13384, HTH_23, Homeodomain-like domain	NA|69aa|down_8|NC_008312.1_1656677_1656884_-	pfam13737, DDE_Tnp_1_5, Transposase DDE domain	NA|212aa|down_9|NC_008312.1_1661571_1662207_-	TIGR03725, T6A_YeaZ, tRNA threonylcarbamoyl adenosine modification protein YeaZ
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	14	1851510-1851619	11	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GTTCTGGTGTTGGTTCTGGTGTTGGT	26	1	7	1851536-1851551|1851536-1851551|1851536-1851551|1851536-1851551|1851536-1851551|1851536-1851551|1851536-1851551	NC_008312.1_1851620-1851635|NC_008312.1_6468189-6468204|NC_008312.1_6468237-6468252|NC_008312.1_6468249-6468264|NC_008312.1_6468261-6468276|NC_008312.1_6468273-6468288|NC_008312.1_6468285-6468300	NA	2	2	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|51aa|up_7|NC_008312.1_1834963_1835116_+,NA|54aa|up_6|NC_008312.1_1835656_1835818_+,NA|464aa|up_0|NC_008312.1_1849483_1850875_-,NA|428aa|down_2|NC_008312.1_1854402_1855686_-	NA|71aa|up_9|NC_008312.1_1833561_1833774_-	cd01716, Hfq, bacterial Hfq-like	NA|279aa|up_8|NC_008312.1_1833836_1834673_+	PLN02536, PLN02536, diaminopimelate epimerase	NA|51aa|up_7|NC_008312.1_1834963_1835116_+	NA	NA|54aa|up_6|NC_008312.1_1835656_1835818_+	NA	NA|253aa|up_5|NC_008312.1_1837407_1838166_+	PRK06136, PRK06136, uroporphyrinogen-III C-methyltransferase	NA|206aa|up_4|NC_008312.1_1838474_1839092_+	PRK02726, PRK02726, molybdenum cofactor guanylyltransferase	NA|57aa|up_3|NC_008312.1_1840226_1840397_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|72aa|up_2|NC_008312.1_1843886_1844102_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|988aa|up_1|NC_008312.1_1844522_1847486_+	COG1026, COG1026, Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]	NA|464aa|up_0|NC_008312.1_1849483_1850875_-	NA	NA|110aa|down_0|NC_008312.1_1852806_1853136_-	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|339aa|down_1|NC_008312.1_1853301_1854318_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|428aa|down_2|NC_008312.1_1854402_1855686_-	NA	NA|578aa|down_3|NC_008312.1_1855866_1857600_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|321aa|down_4|NC_008312.1_1857686_1858649_-	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|491aa|down_5|NC_008312.1_1862475_1863948_-	pfam13809, Tubulin_2, Tubulin like	NA|86aa|down_6|NC_008312.1_1866981_1867239_-	pfam08865, DUF1830, Domain of unknown function (DUF1830)	NA|423aa|down_7|NC_008312.1_1867439_1868708_-	PRK07379, PRK07379, coproporphyrinogen III oxidase; Provisional	NA|360aa|down_8|NC_008312.1_1869678_1870758_+	COG4956, COG4956, Integral membrane protein (PIN domain superfamily) [General function prediction only]	NA|218aa|down_9|NC_008312.1_1871193_1871847_+	TIGR04282, hypothetical_protein, transferase 1, rSAM/selenodomain-associated
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	15	1921946-1922051	12	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GAATACTTCTTTTATCCCTTTGAAA	25	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|167aa|up_3|NC_008312.1_1916944_1917445_-,NA|63aa|down_8|NC_008312.1_1943206_1943395_+	NA|89aa|up_9|NC_008312.1_1902053_1902320_-	PRK13683, PRK13683, hypothetical protein; Provisional	NA|353aa|up_8|NC_008312.1_1904237_1905296_+	TIGR01152, Photosystem_II_D2_protein, Photosystem II, DII subunit (also called Q(A))	NA|94aa|up_7|NC_008312.1_1907659_1907941_+	PRK00823, phhB, pterin-4-alpha-carbinolamine dehydratase; Validated	NA|545aa|up_6|NC_008312.1_1908789_1910424_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|640aa|up_5|NC_008312.1_1911567_1913487_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|321aa|up_4|NC_008312.1_1914920_1915883_+	TIGR01139, Cysteine_synthase, cysteine synthase A	NA|167aa|up_3|NC_008312.1_1916944_1917445_-	NA	NA|607aa|up_2|NC_008312.1_1917667_1919488_-	cd10229, HSPA12_like_NBD, Nucleotide-binding domain of HSPA12A, HSPA12B and similar proteins	NA|100aa|up_1|NC_008312.1_1919536_1919836_-	cd10229, HSPA12_like_NBD, Nucleotide-binding domain of HSPA12A, HSPA12B and similar proteins	NA|601aa|up_0|NC_008312.1_1919853_1921656_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|225aa|down_0|NC_008312.1_1922202_1922877_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|413aa|down_1|NC_008312.1_1926210_1927449_-	COG2304, COG2304, Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]	NA|230aa|down_2|NC_008312.1_1930018_1930708_+	PRK12552, PRK12552, ATP-dependent Clp protease proteolytic subunit	NA|197aa|down_3|NC_008312.1_1930922_1931513_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|209aa|down_4|NC_008312.1_1934395_1935022_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|416aa|down_5|NC_008312.1_1935766_1937014_+	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|315aa|down_6|NC_008312.1_1938099_1939044_+	COG0530, ECM27, Ca2+/Na+ antiporter [Inorganic ion transport and metabolism]	NA|78aa|down_7|NC_008312.1_1942928_1943162_+	pfam13592, HTH_33, Winged helix-turn helix	NA|63aa|down_8|NC_008312.1_1943206_1943395_+	NA	NA|122aa|down_9|NC_008312.1_1944501_1944867_-	COG1950, COG1950, Predicted membrane protein [Function unknown]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	16	1930655-1930752	13	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CAAGTTTTAGCAAGTAGAAAAGAACTA	27	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|167aa|up_5|NC_008312.1_1916944_1917445_-,NA|63aa|down_5|NC_008312.1_1943206_1943395_+	NA|94aa|up_9|NC_008312.1_1907659_1907941_+	PRK00823, phhB, pterin-4-alpha-carbinolamine dehydratase; Validated	NA|545aa|up_8|NC_008312.1_1908789_1910424_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|640aa|up_7|NC_008312.1_1911567_1913487_+	pfam04966, OprB, Carbohydrate-selective porin, OprB family	NA|321aa|up_6|NC_008312.1_1914920_1915883_+	TIGR01139, Cysteine_synthase, cysteine synthase A	NA|167aa|up_5|NC_008312.1_1916944_1917445_-	NA	NA|607aa|up_4|NC_008312.1_1917667_1919488_-	cd10229, HSPA12_like_NBD, Nucleotide-binding domain of HSPA12A, HSPA12B and similar proteins	NA|100aa|up_3|NC_008312.1_1919536_1919836_-	cd10229, HSPA12_like_NBD, Nucleotide-binding domain of HSPA12A, HSPA12B and similar proteins	NA|601aa|up_2|NC_008312.1_1919853_1921656_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|225aa|up_1|NC_008312.1_1922202_1922877_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|413aa|up_0|NC_008312.1_1926210_1927449_-	COG2304, COG2304, Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]	NA|197aa|down_0|NC_008312.1_1930922_1931513_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|209aa|down_1|NC_008312.1_1934395_1935022_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|416aa|down_2|NC_008312.1_1935766_1937014_+	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|315aa|down_3|NC_008312.1_1938099_1939044_+	COG0530, ECM27, Ca2+/Na+ antiporter [Inorganic ion transport and metabolism]	NA|78aa|down_4|NC_008312.1_1942928_1943162_+	pfam13592, HTH_33, Winged helix-turn helix	NA|63aa|down_5|NC_008312.1_1943206_1943395_+	NA	NA|122aa|down_6|NC_008312.1_1944501_1944867_-	COG1950, COG1950, Predicted membrane protein [Function unknown]	NA|364aa|down_7|NC_008312.1_1944893_1945985_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|50aa|down_8|NC_008312.1_1951618_1951768_+	pfam01710, HTH_Tnp_IS630, Transposase	NA|101aa|down_9|NC_008312.1_1954985_1955288_+	pfam13551, HTH_29, Winged helix-turn helix
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	17	1941216-1941324	14	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	ATTGAAAACTTTTGGTTAAAAGTGAAAGCTATTCTAA	37	1	1	1941253-1941287	NC_008312.1_7654959-7654993	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA,NA|63aa|down_1|NC_008312.1_1943206_1943395_+,NA|56aa|down_7|NC_008312.1_1955777_1955945_+	NA|607aa|up_9|NC_008312.1_1917667_1919488_-	cd10229, HSPA12_like_NBD, Nucleotide-binding domain of HSPA12A, HSPA12B and similar proteins	NA|100aa|up_8|NC_008312.1_1919536_1919836_-	cd10229, HSPA12_like_NBD, Nucleotide-binding domain of HSPA12A, HSPA12B and similar proteins	NA|601aa|up_7|NC_008312.1_1919853_1921656_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|225aa|up_6|NC_008312.1_1922202_1922877_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|413aa|up_5|NC_008312.1_1926210_1927449_-	COG2304, COG2304, Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only]	NA|230aa|up_4|NC_008312.1_1930018_1930708_+	PRK12552, PRK12552, ATP-dependent Clp protease proteolytic subunit	NA|197aa|up_3|NC_008312.1_1930922_1931513_+	PRK00277, clpP, ATP-dependent Clp protease proteolytic subunit; Reviewed	NA|209aa|up_2|NC_008312.1_1934395_1935022_+	COG0484, DnaJ, DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]	NA|416aa|up_1|NC_008312.1_1935766_1937014_+	TIGR01444, 2-O-methyltransferase_NoeI, methyltransferase, FkbM family	NA|315aa|up_0|NC_008312.1_1938099_1939044_+	COG0530, ECM27, Ca2+/Na+ antiporter [Inorganic ion transport and metabolism]	NA|78aa|down_0|NC_008312.1_1942928_1943162_+	pfam13592, HTH_33, Winged helix-turn helix	NA|63aa|down_1|NC_008312.1_1943206_1943395_+	NA	NA|122aa|down_2|NC_008312.1_1944501_1944867_-	COG1950, COG1950, Predicted membrane protein [Function unknown]	NA|364aa|down_3|NC_008312.1_1944893_1945985_-	PRK09601, PRK09601, redox-regulated ATPase YchF	NA|50aa|down_4|NC_008312.1_1951618_1951768_+	pfam01710, HTH_Tnp_IS630, Transposase	NA|101aa|down_5|NC_008312.1_1954985_1955288_+	pfam13551, HTH_29, Winged helix-turn helix	NA|145aa|down_6|NC_008312.1_1955349_1955784_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|56aa|down_7|NC_008312.1_1955777_1955945_+	NA	NA|47aa|down_8|NC_008312.1_1956045_1956186_-	COG3415, COG3415, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|70aa|down_9|NC_008312.1_1956797_1957007_+	pfam13597, NRDD, Anaerobic ribonucleoside-triphosphate reductase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	18	2180791-2180905	15	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TGGAGACGGGCTCTGGCTGCTGCAATTTTTTCTGGTC	37	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|358aa|up_8|NC_008312.1_2165107_2166181_-,NA|184aa|up_6|NC_008312.1_2167778_2168330_+,NA|59aa|down_0|NC_008312.1_2182027_2182204_+,NA|83aa|down_2|NC_008312.1_2183804_2184053_-,NA|60aa|down_5|NC_008312.1_2187782_2187962_+,NA|66aa|down_6|NC_008312.1_2188291_2188489_+	NA|346aa|up_9|NC_008312.1_2164054_2165092_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|358aa|up_8|NC_008312.1_2165107_2166181_-	NA	NA|360aa|up_7|NC_008312.1_2166251_2167331_-	cd09008, MTAN, 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidases	NA|184aa|up_6|NC_008312.1_2167778_2168330_+	NA	NA|67aa|up_5|NC_008312.1_2169562_2169763_+	PRK02576, psbZ, photosystem II reaction center protein PsbZ	NA|194aa|up_4|NC_008312.1_2170543_2171125_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|37aa|up_3|NC_008312.1_2172813_2172924_-	pfam13612, DDE_Tnp_1_3, Transposase DDE domain	NA|271aa|up_2|NC_008312.1_2173202_2174015_-	COG1398, OLE1, Fatty-acid desaturase [Lipid metabolism]	NA|417aa|up_1|NC_008312.1_2174745_2175996_+	pfam06838, Met_gamma_lyase, Methionine gamma-lyase	NA|465aa|up_0|NC_008312.1_2177666_2179061_-	pfam00931, NB-ARC, NB-ARC domain	NA|59aa|down_0|NC_008312.1_2182027_2182204_+	NA	NA|44aa|down_1|NC_008312.1_2182687_2182819_-	pfam01710, HTH_Tnp_IS630, Transposase	NA|83aa|down_2|NC_008312.1_2183804_2184053_-	NA	NA|157aa|down_3|NC_008312.1_2184007_2184478_-	pfam14516, AAA_35, AAA-like domain	NA|143aa|down_4|NC_008312.1_2184529_2184958_-	pfam14516, AAA_35, AAA-like domain	NA|60aa|down_5|NC_008312.1_2187782_2187962_+	NA	NA|66aa|down_6|NC_008312.1_2188291_2188489_+	NA	NA|918aa|down_7|NC_008312.1_2188782_2191536_-	pfam14252, DUF4347, Domain of unknown function (DUF4347)	NA|659aa|down_8|NC_008312.1_2191816_2193793_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|425aa|down_9|NC_008312.1_2193827_2195102_-	cd19978, PBP1_ABC_ligand_binding-like, periplasmic ligand-binding domain of uncharacterized ABC-type transport systems predicted to be involved in the uptake of amino acids, peptides, or inorganic ions
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	19	2353674-2353765	16	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GTTACAGGCTGAACAAGAGCGAGAGCGGG	29	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|46aa|up_9|NC_008312.1_2328887_2329025_+,NA|80aa|up_8|NC_008312.1_2329101_2329341_+,NA|701aa|up_7|NC_008312.1_2330507_2332610_-,NA|118aa|down_1|NC_008312.1_2359258_2359612_+,NA|462aa|down_2|NC_008312.1_2360071_2361457_+,NA|125aa|down_8|NC_008312.1_2373079_2373454_+	NA|46aa|up_9|NC_008312.1_2328887_2329025_+	NA	NA|80aa|up_8|NC_008312.1_2329101_2329341_+	NA	NA|701aa|up_7|NC_008312.1_2330507_2332610_-	NA	NA|365aa|up_6|NC_008312.1_2333972_2335067_+	TIGR01208, rmlA_long, glucose-1-phosphate thymidylylransferase, long form	NA|281aa|up_5|NC_008312.1_2335383_2336226_+	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|361aa|up_4|NC_008312.1_2336376_2337459_+	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|306aa|up_3|NC_008312.1_2338370_2339288_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|718aa|up_2|NC_008312.1_2339840_2341994_-	PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase; Provisional	NA|1532aa|up_1|NC_008312.1_2343202_2347798_+	COG0553, HepA, Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair]	NA|392aa|up_0|NC_008312.1_2348273_2349449_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|1746aa|down_0|NC_008312.1_2354017_2359255_+	pfam12770, CHAT, CHAT domain	NA|118aa|down_1|NC_008312.1_2359258_2359612_+	NA	NA|462aa|down_2|NC_008312.1_2360071_2361457_+	NA	NA|162aa|down_3|NC_008312.1_2361895_2362381_+	pfam18306, LDcluster4, SLOG cluster4 family	NA|379aa|down_4|NC_008312.1_2362917_2364054_-	COG1873, COG1873, Protein implicated in RNA metabolism, contains PRC-barrel domain [General    function prediction only]	NA|1220aa|down_5|NC_008312.1_2364598_2368258_-	TIGR02169, chromosome_segregation_protein_related_ptotein, chromosome segregation protein SMC, primarily archaeal type	NA|266aa|down_6|NC_008312.1_2369007_2369805_-	COG2875, CobM, Precorrin-4 methylase [Coenzyme metabolism]	NA|289aa|down_7|NC_008312.1_2369917_2370784_-	pfam01790, LGT, Prolipoprotein diacylglyceryl transferase	NA|125aa|down_8|NC_008312.1_2373079_2373454_+	NA	NA|369aa|down_9|NC_008312.1_2374238_2375345_-	PRK04447, PRK04447, hypothetical protein; Provisional
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	20	2407713-2407829	17	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TTTATTATATATCGGGTCTATAAATAATTTTCT	33	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA,NA|67aa|down_0|NC_008312.1_2409950_2410151_-,NA|21aa|down_9|NC_008312.1_2422414_2422477_-	NA|226aa|up_9|NC_008312.1_2389694_2390372_+	PRK00698, tmk, thymidylate kinase; Validated	NA|320aa|up_8|NC_008312.1_2390590_2391550_+	PRK07399, PRK07399, DNA polymerase III subunit delta'; Validated	NA|133aa|up_7|NC_008312.1_2391935_2392334_-	pfam03745, DUF309, Domain of unknown function (DUF309)	NA|122aa|up_6|NC_008312.1_2392916_2393282_-	CHL00165, ftrB, ferredoxin thioreductase subunit beta; Validated	NA|821aa|up_5|NC_008312.1_2394704_2397167_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|385aa|up_4|NC_008312.1_2397803_2398958_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|516aa|up_3|NC_008312.1_2399102_2400650_-	COG0758, Smf, Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion]	NA|694aa|up_2|NC_008312.1_2402845_2404927_-	COG0514, RecQ, Superfamily II DNA helicase [DNA replication, recombination, and repair]	NA|169aa|up_1|NC_008312.1_2405020_2405527_+	PRK00168, coaD, phosphopantetheine adenylyltransferase; Provisional	NA|223aa|up_0|NC_008312.1_2405481_2406150_+	cd06503, ATP-synt_Fo_b, F-type ATP synthase, membrane subunit b	NA|67aa|down_0|NC_008312.1_2409950_2410151_-	NA	NA|104aa|down_1|NC_008312.1_2411298_2411610_-	CHL00015, ndhE, NADH dehydrogenase subunit 4L	NA|208aa|down_2|NC_008312.1_2411650_2412274_-	CHL00016, ndhG, NADH dehydrogenase subunit 6	NA|210aa|down_3|NC_008312.1_2412393_2413023_-	TIGR00403, NADPH-quinone_oxidoreductase_subunit_I, NADH-plastoquinone oxidoreductase subunit I protein	NA|373aa|down_4|NC_008312.1_2415283_2416402_-	CHL00032, ndhA, NADH dehydrogenase subunit 1	NA|384aa|down_5|NC_008312.1_2416672_2417824_-	PRK14036, PRK14036, citrate synthase; Provisional	NA|168aa|down_6|NC_008312.1_2417921_2418425_-	COG2062, SixA, Phosphohistidine phosphatase SixA [Signal transduction mechanisms]	NA|166aa|down_7|NC_008312.1_2419307_2419805_-	COG1403, McrA, Restriction endonuclease [Defense mechanisms]	NA|396aa|down_8|NC_008312.1_2420397_2421585_+	PRK00053, alr, alanine racemase; Reviewed	NA|21aa|down_9|NC_008312.1_2422414_2422477_-	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	21	2627134-2631455	5	CRT	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	NNTTTGAGGTTTNANGTT	18	25	54	2627152-2627177|2627152-2627177|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627443-2627467|2627587-2627605|2627587-2627605|2627624-2627649|2628213-2628231|2628213-2628231|2628213-2628231|2628213-2628231|2628294-2628312|2628294-2628312|2628331-2628370|2628475-2628493|2628475-2628493|2628475-2628493|2628475-2628493|2628512-2628530|2628592-2628610|2628629-2628647|2628629-2628647|2628710-2628735|2628754-2628772|2629023-2629041|2629023-2629041|2629118-2629142|2629205-2629223|2629205-2629223|2629300-2629324|2630042-2630066|2630418-2630436|2630759-2630777|2630998-2631016|2631035-2631059|2631078-2631102|2631121-2631139|2631121-2631139|2631303-2631321|2631303-2631321	NC_008312.1_2631447-2631472|NC_008312.1_2941890-2941915|NC_008312.1_848634-848658|NC_008312.1_1538289-1538313|NC_008312.1_1538872-1538848|NC_008312.1_2705216-2705240|NC_008312.1_2705223-2705247|NC_008312.1_2941325-2941349|NC_008312.1_2941883-2941907|NC_008312.1_3331280-3331256|NC_008312.1_3331287-3331263|NC_008312.1_3331534-3331510|NC_008312.1_3331541-3331517|NC_008312.1_3331830-3331806|NC_008312.1_3375660-3375636|NC_008312.1_4271774-4271750|NC_008312.1_5423269-5423293|NC_008312.1_7559000-7558976|NC_008312.1_2941869-2941887|NC_008312.1_7158754-7158772|NC_008312.1_2941341-2941366|NC_008312.1_1922567-1922549|NC_008312.1_4987984-4987966|NC_008312.1_5832738-5832720|NC_008312.1_7325462-7325480|NC_008312.1_4987984-4987966|NC_008312.1_7325462-7325480|NC_008312.1_2631433-2631472|NC_008312.1_1922567-1922549|NC_008312.1_4987984-4987966|NC_008312.1_5832738-5832720|NC_008312.1_7325462-7325480|NC_008312.1_3958059-3958077|NC_008312.1_3958059-3958077|NC_008312.1_1282054-1282036|NC_008312.1_1993335-1993353|NC_008312.1_2631433-2631458|NC_008312.1_2698739-2698721|NC_008312.1_2627131-2627149|NC_008312.1_2396366-2396384|NC_008312.1_2627125-2627149|NC_008312.1_2627131-2627149|NC_008312.1_2396366-2396384|NC_008312.1_2627125-2627149|NC_008312.1_2627125-2627149|NC_008312.1_2698739-2698721|NC_008312.1_2698739-2698721|NC_008312.1_2698739-2698721|NC_008312.1_2941335-2941359|NC_008312.1_2627125-2627149|NC_008312.1_2627131-2627149|NC_008312.1_2396366-2396384|NC_008312.1_2941341-2941359|NC_008312.1_5816731-5816713	NA	85	85	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|338aa|up_1|NC_008312.1_2624875_2625889_+,NA|62aa|down_1|NC_008312.1_2632599_2632785_+,NA|117aa|down_6|NC_008312.1_2640086_2640437_-,NA|117aa|down_7|NC_008312.1_2640870_2641221_-,NA|50aa|down_8|NC_008312.1_2641514_2641664_-	NA|753aa|up_9|NC_008312.1_2611092_2613351_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|86aa|up_8|NC_008312.1_2613496_2613754_-	pfam00931, NB-ARC, NB-ARC domain	NA|68aa|up_7|NC_008312.1_2616547_2616751_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|50aa|up_6|NC_008312.1_2616707_2616857_-	pfam02643, DUF192, Uncharacterized ACR, COG1430	NA|506aa|up_5|NC_008312.1_2617043_2618561_-	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|495aa|up_4|NC_008312.1_2618664_2620149_-	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|534aa|up_3|NC_008312.1_2621231_2622833_+	pfam13282, DUF4070, Domain of unknown function (DUF4070)	NA|391aa|up_2|NC_008312.1_2623271_2624444_+	PRK00064, recF, recombination protein F; Reviewed	NA|338aa|up_1|NC_008312.1_2624875_2625889_+	NA	NA|68aa|up_0|NC_008312.1_2626613_2626817_-	pfam05685, Uma2, Putative restriction endonuclease	NA|293aa|down_0|NC_008312.1_2631604_2632483_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|62aa|down_1|NC_008312.1_2632599_2632785_+	NA	NA|114aa|down_2|NC_008312.1_2633968_2634310_-	NF033474, DivGenRetAVD, diversity-generating retroelement protein Avd	NA|878aa|down_3|NC_008312.1_2634406_2637040_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|351aa|down_4|NC_008312.1_2637043_2638096_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|644aa|down_5|NC_008312.1_2638092_2640024_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|117aa|down_6|NC_008312.1_2640086_2640437_-	NA	NA|117aa|down_7|NC_008312.1_2640870_2641221_-	NA	NA|50aa|down_8|NC_008312.1_2641514_2641664_-	NA	NA|396aa|down_9|NC_008312.1_2643844_2645032_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	22	2921383-2921463	18	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TTTTACTAAGGTATCAGATAATAT	24	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|105aa|up_7|NC_008312.1_2886803_2887118_+,NA|394aa|down_0|NC_008312.1_2921965_2923147_+,NA|839aa|down_1|NC_008312.1_2923767_2926284_+,NA|507aa|down_2|NC_008312.1_2926395_2927916_+	NA|166aa|up_9|NC_008312.1_2883902_2884400_+	pfam13565, HTH_32, Homeodomain-like domain	NA|531aa|up_8|NC_008312.1_2884845_2886438_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|105aa|up_7|NC_008312.1_2886803_2887118_+	NA	NA|2685aa|up_6|NC_008312.1_2888236_2896291_+	PRK09532, PRK09532, DNA polymerase III subunit alpha; Reviewed	NA|664aa|up_5|NC_008312.1_2897003_2898995_-	smart00187, INB, Integrin beta subunits (N-terminal portion of extracellular region)	NA|864aa|up_4|NC_008312.1_2902685_2905277_-	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|2633aa|up_3|NC_008312.1_2905930_2913829_-	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain	NA|706aa|up_2|NC_008312.1_2916259_2918377_-	COG1523, PulA, Type II secretory pathway, pullulanase PulA and related glycosidases [Carbohydrate transport and metabolism]	NA|69aa|up_1|NC_008312.1_2918579_2918786_+	pfam00582, Usp, Universal stress protein family	NA|37aa|up_0|NC_008312.1_2918932_2919043_+	COG0589, UspA, Universal stress protein UspA and related nucleotide-binding proteins [Signal transduction mechanisms]	NA|394aa|down_0|NC_008312.1_2921965_2923147_+	NA	NA|839aa|down_1|NC_008312.1_2923767_2926284_+	NA	NA|507aa|down_2|NC_008312.1_2926395_2927916_+	NA	NA|874aa|down_3|NC_008312.1_2928041_2930663_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|133aa|down_4|NC_008312.1_2932408_2932807_+	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|745aa|down_5|NC_008312.1_2933587_2935822_-	TIGR00930, Solute_carrier_family_12_member_1, K-Cl cotransporter	NA|546aa|down_6|NC_008312.1_2936255_2937893_+	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|518aa|down_7|NC_008312.1_2939720_2941274_-	PRK07373, PRK07373, DNA polymerase III subunit alpha; Reviewed	NA|1010aa|down_8|NC_008312.1_2947741_2950771_+	TIGR00845, Sodium/calcium_exchanger_1, sodium/calcium exchanger 1	NA|292aa|down_9|NC_008312.1_2954959_2955835_+	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	23	2941350-2941874	6	CRT	no	cas14j	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	AGGTGTNAATTTTTTGAGGT	20	1	18	2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407|2941370-2941407	NC_008312.1_2941312-2941349|NC_008312.1_3331293-3331256|NC_008312.1_3331547-3331510|NC_008312.1_2705217-2705254|NC_008312.1_3331265-3331228|NC_008312.1_3331272-3331235|NC_008312.1_3331279-3331242|NC_008312.1_3331286-3331249|NC_008312.1_3331300-3331263|NC_008312.1_3331307-3331270|NC_008312.1_3331314-3331277|NC_008312.1_3331406-3331369|NC_008312.1_3331420-3331383|NC_008312.1_3331427-3331390|NC_008312.1_3331533-3331496|NC_008312.1_3331540-3331503|NC_008312.1_3331702-3331665|NC_008312.1_3331723-3331686	NA	8	8	TypeV	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|734aa|up_8|NC_008312.1_2919565_2921767_+,NA|394aa|up_7|NC_008312.1_2921965_2923147_+,NA|839aa|up_6|NC_008312.1_2923767_2926284_+,NA|507aa|up_5|NC_008312.1_2926395_2927916_+,NA|97aa|down_2|NC_008312.1_2956693_2956984_-	NA|37aa|up_9|NC_008312.1_2918932_2919043_+	COG0589, UspA, Universal stress protein UspA and related nucleotide-binding proteins [Signal transduction mechanisms]	NA|734aa|up_8|NC_008312.1_2919565_2921767_+	NA	NA|394aa|up_7|NC_008312.1_2921965_2923147_+	NA	NA|839aa|up_6|NC_008312.1_2923767_2926284_+	NA	NA|507aa|up_5|NC_008312.1_2926395_2927916_+	NA	NA|874aa|up_4|NC_008312.1_2928041_2930663_-	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|133aa|up_3|NC_008312.1_2932408_2932807_+	cd17548, REC_DivK-like, phosphoacceptor receiver (REC) domain of DivK and similar proteins	NA|745aa|up_2|NC_008312.1_2933587_2935822_-	TIGR00930, Solute_carrier_family_12_member_1, K-Cl cotransporter	NA|546aa|up_1|NC_008312.1_2936255_2937893_+	pfam05128, DUF697, Domain of unknown function (DUF697)	NA|518aa|up_0|NC_008312.1_2939720_2941274_-	PRK07373, PRK07373, DNA polymerase III subunit alpha; Reviewed	NA|1010aa|down_0|NC_008312.1_2947741_2950771_+	TIGR00845, Sodium/calcium_exchanger_1, sodium/calcium exchanger 1	NA|292aa|down_1|NC_008312.1_2954959_2955835_+	cd07325, M48_Ste24p_like, M48 Ste24 endopeptidase-like, integral membrane metallopeptidase	NA|97aa|down_2|NC_008312.1_2956693_2956984_-	NA	cas14j|272aa|down_3|NC_008312.1_2957063_2957879_-	pfam12323, HTH_OrfB_IS605, Helix-turn-helix domain	NA|57aa|down_4|NC_008312.1_2958688_2958859_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|189aa|down_5|NC_008312.1_2959028_2959595_+	pfam12263, DUF3611, Protein of unknown function (DUF3611)	NA|247aa|down_6|NC_008312.1_2960959_2961700_+	pfam13413, HTH_25, Helix-turn-helix domain	NA|370aa|down_7|NC_008312.1_2961696_2962806_-	PRK05429, PRK05429, gamma-glutamyl kinase; Provisional	NA|357aa|down_8|NC_008312.1_2963590_2964661_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|118aa|down_9|NC_008312.1_2965670_2966024_-	pfam01187, MIF, Macrophage migration inhibitory factor (MIF)
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	24	3021220-3021456	19	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GCCTGAAGCGGAATTAATGGAAAC	24	0	0	NA	NA	NA	3	3	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|116aa|up_9|NC_008312.1_3003397_3003745_+,NA|223aa|up_8|NC_008312.1_3005399_3006068_-,NA|80aa|up_4|NC_008312.1_3014152_3014392_-,NA|194aa|up_3|NC_008312.1_3015716_3016298_+,NA|51aa|up_0|NC_008312.1_3020410_3020563_-,NA|68aa|down_0|NC_008312.1_3022167_3022371_-	NA|116aa|up_9|NC_008312.1_3003397_3003745_+	NA	NA|223aa|up_8|NC_008312.1_3005399_3006068_-	NA	NA|295aa|up_7|NC_008312.1_3006412_3007297_-	cd01945, ribokinase_group_B, Ribokinase-like subgroup B	NA|451aa|up_6|NC_008312.1_3008679_3010032_+	TIGR04095, type_III_restriction_protein_res_subunit, DNA phosphorothioation system restriction enzyme	NA|890aa|up_5|NC_008312.1_3010873_3013543_-	pfam03200, Glyco_hydro_63, Glycosyl hydrolase family 63 C-terminal domain	NA|80aa|up_4|NC_008312.1_3014152_3014392_-	NA	NA|194aa|up_3|NC_008312.1_3015716_3016298_+	NA	NA|261aa|up_2|NC_008312.1_3016553_3017336_+	COG3694, COG3694, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|565aa|up_1|NC_008312.1_3018029_3019724_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|51aa|up_0|NC_008312.1_3020410_3020563_-	NA	NA|68aa|down_0|NC_008312.1_3022167_3022371_-	NA	NA|74aa|down_1|NC_008312.1_3023055_3023277_-	pfam02941, FeThRed_A, Ferredoxin thioredoxin reductase variable alpha chain	NA|646aa|down_2|NC_008312.1_3023399_3025337_-	PRK14559, PRK14559, serine/threonine phosphatase	NA|155aa|down_3|NC_008312.1_3025996_3026461_+	COG1585, COG1585, Membrane protein implicated in regulation of membrane protease activity [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]	NA|322aa|down_4|NC_008312.1_3026641_3027607_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|132aa|down_5|NC_008312.1_3027868_3028264_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|88aa|down_6|NC_008312.1_3028347_3028611_+	PRK05974, PRK05974, phosphoribosylformylglycinamidine synthase subunit PurS; Reviewed	NA|232aa|down_7|NC_008312.1_3028863_3029559_+	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ	NA|411aa|down_8|NC_008312.1_3031976_3033209_-	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|293aa|down_9|NC_008312.1_3033920_3034799_-	PRK05481, PRK05481, lipoyl synthase; Provisional
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	25	3030162-3030266	20	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AAAATGTTATTGAATCTATTCATTTTCAGC	30	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|51aa|up_8|NC_008312.1_3020410_3020563_-,NA|68aa|up_7|NC_008312.1_3022167_3022371_-,NA|53aa|down_4|NC_008312.1_3039942_3040101_-	NA|565aa|up_9|NC_008312.1_3018029_3019724_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|51aa|up_8|NC_008312.1_3020410_3020563_-	NA	NA|68aa|up_7|NC_008312.1_3022167_3022371_-	NA	NA|74aa|up_6|NC_008312.1_3023055_3023277_-	pfam02941, FeThRed_A, Ferredoxin thioredoxin reductase variable alpha chain	NA|646aa|up_5|NC_008312.1_3023399_3025337_-	PRK14559, PRK14559, serine/threonine phosphatase	NA|155aa|up_4|NC_008312.1_3025996_3026461_+	COG1585, COG1585, Membrane protein implicated in regulation of membrane protease activity [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]	NA|322aa|up_3|NC_008312.1_3026641_3027607_+	COG0330, HflC, Membrane protease subunits, stomatin/prohibitin homologs [Posttranslational modification, protein turnover, chaperones]	NA|132aa|up_2|NC_008312.1_3027868_3028264_-	COG0735, Fur, Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]	NA|88aa|up_1|NC_008312.1_3028347_3028611_+	PRK05974, PRK05974, phosphoribosylformylglycinamidine synthase subunit PurS; Reviewed	NA|232aa|up_0|NC_008312.1_3028863_3029559_+	PRK03619, PRK03619, phosphoribosylformylglycinamidine synthase subunit PurQ	NA|411aa|down_0|NC_008312.1_3031976_3033209_-	PRK07598, PRK07598, RNA polymerase sigma factor SigC; Validated	NA|293aa|down_1|NC_008312.1_3033920_3034799_-	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|171aa|down_2|NC_008312.1_3035690_3036203_+	pfam01475, FUR, Ferric uptake regulator family	NA|488aa|down_3|NC_008312.1_3036359_3037823_-	COG3409, COG3409, Putative peptidoglycan-binding domain-containing protein [Cell envelope biogenesis, outer membrane]	NA|53aa|down_4|NC_008312.1_3039942_3040101_-	NA	NA|228aa|down_5|NC_008312.1_3040868_3041552_-	PRK00026, trmD, tRNA (guanine-N(1)-)-methyltransferase; Reviewed	NA|292aa|down_6|NC_008312.1_3042693_3043569_+	TIGR02069, cyanophycinase, cyanophycinase	NA|903aa|down_7|NC_008312.1_3043879_3046588_+	TIGR02068, Cyanophycin_synthetase, cyanophycin synthetase	NA|48aa|down_8|NC_008312.1_3047083_3047227_-	PLN00014, PLN00014, light-harvesting-like protein 3; Provisional	NA|863aa|down_9|NC_008312.1_3048339_3050928_+	cd13653, PBP2_phosphate_like_1, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	26	3607264-3607379	21	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CAATATATAGTTTAAGTAAGAATAATCAGGCTAATAAAG	39	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|75aa|up_1|NC_008312.1_3605595_3605820_+,NA|61aa|down_2|NC_008312.1_3609392_3609575_+,NA|265aa|down_8|NC_008312.1_3618886_3619681_-	NA|55aa|up_9|NC_008312.1_3597654_3597819_+	PRK09368, PRK09368, gas vesicle structural protein GvpA	NA|76aa|up_8|NC_008312.1_3598478_3598706_+	PRK09371, PRK09371, gas vesicle structural protein GvpA	NA|73aa|up_7|NC_008312.1_3599248_3599467_+	PRK09371, PRK09371, gas vesicle structural protein GvpA	NA|191aa|up_6|NC_008312.1_3600514_3601087_+	NF012221, MARTX_Nterm, MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin RtxA	NA|628aa|up_5|NC_008312.1_3601649_3603533_+	cd02035, ArsA, Arsenical pump-driving ATPase ArsA	NA|139aa|up_4|NC_008312.1_3603599_3604016_+	pfam05667, DUF812, Protein of unknown function (DUF812)	NA|312aa|up_3|NC_008312.1_3604201_3605137_+	TIGR02640, Gas_vesicle_protein_GvpN, gas vesicle protein GvpN	NA|104aa|up_2|NC_008312.1_3605133_3605445_+	pfam00741, Gas_vesicle, Gas vesicle protein	NA|75aa|up_1|NC_008312.1_3605595_3605820_+	NA	NA|154aa|up_0|NC_008312.1_3606013_3606475_+	pfam05121, GvpK, Gas vesicle protein K	NA|312aa|down_0|NC_008312.1_3608131_3609067_+	TIGR02640, Gas_vesicle_protein_GvpN, gas vesicle protein GvpN	NA|105aa|down_1|NC_008312.1_3609063_3609378_+	pfam00741, Gas_vesicle, Gas vesicle protein	NA|61aa|down_2|NC_008312.1_3609392_3609575_+	NA	NA|640aa|down_3|NC_008312.1_3610553_3612473_-	TIGR01243, Cell_division_cycle_protein_48_homolog_MJ1156, AAA family ATPase, CDC48 subfamily	NA|86aa|down_4|NC_008312.1_3613721_3613979_-	pfam05120, GvpG, Gas vesicle protein G	NA|267aa|down_5|NC_008312.1_3614310_3615111_+	pfam06386, GvpL_GvpF, Gas vesicle synthesis protein GvpL/GvpF	NA|226aa|down_6|NC_008312.1_3615411_3616089_+	pfam06386, GvpL_GvpF, Gas vesicle synthesis protein GvpL/GvpF	NA|157aa|down_7|NC_008312.1_3617963_3618434_-	PRK05422, smpB, SsrA-binding protein SmpB	NA|265aa|down_8|NC_008312.1_3618886_3619681_-	NA	NA|110aa|down_9|NC_008312.1_3619957_3620287_-	pfam11282, DUF3082, Protein of unknown function (DUF3082)
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	27	3627622-3627843	22	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GCAGGTGGTAGGGTTGAATTTCTACTAAGGTTCATTGGTGCTTTTATTTATTTA	54	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|265aa|up_5|NC_008312.1_3618886_3619681_-,NA|184aa|up_2|NC_008312.1_3622432_3622984_-,NA	NA|86aa|up_9|NC_008312.1_3613721_3613979_-	pfam05120, GvpG, Gas vesicle protein G	NA|267aa|up_8|NC_008312.1_3614310_3615111_+	pfam06386, GvpL_GvpF, Gas vesicle synthesis protein GvpL/GvpF	NA|226aa|up_7|NC_008312.1_3615411_3616089_+	pfam06386, GvpL_GvpF, Gas vesicle synthesis protein GvpL/GvpF	NA|157aa|up_6|NC_008312.1_3617963_3618434_-	PRK05422, smpB, SsrA-binding protein SmpB	NA|265aa|up_5|NC_008312.1_3618886_3619681_-	NA	NA|110aa|up_4|NC_008312.1_3619957_3620287_-	pfam11282, DUF3082, Protein of unknown function (DUF3082)	NA|414aa|up_3|NC_008312.1_3620966_3622208_-	cd10798, GH57N_like_1, Uncharacterized subfamily of  glycoside hydrolase family 57 (GH57)	NA|184aa|up_2|NC_008312.1_3622432_3622984_-	NA	NA|310aa|up_1|NC_008312.1_3624339_3625269_+	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|433aa|up_0|NC_008312.1_3626090_3627389_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|650aa|down_0|NC_008312.1_3628412_3630362_-	sd00006, TPR, Tetratricopeptide repeat	NA|187aa|down_1|NC_008312.1_3631291_3631852_-	pfam14218, COP23, Circadian oscillating protein COP23	NA|206aa|down_2|NC_008312.1_3632058_3632676_-	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|469aa|down_3|NC_008312.1_3633779_3635186_-	TIGR00400, mgtE, Mg2+ transporter (mgtE)	NA|542aa|down_4|NC_008312.1_3635608_3637234_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|596aa|down_5|NC_008312.1_3638103_3639891_+	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|770aa|down_6|NC_008312.1_3640630_3642940_-	pfam02624, YcaO, YcaO cyclodehydratase, ATP-ad Mg2+-binding	NA|113aa|down_7|NC_008312.1_3643249_3643588_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|125aa|down_8|NC_008312.1_3643589_3643964_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|66aa|down_9|NC_008312.1_3644125_3644323_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	28	3650625-3650737	23	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	ATCTGAGTTATATGTGGGATAATTATA	27	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|68aa|up_2|NC_008312.1_3646964_3647168_-,NA|72aa|down_1|NC_008312.1_3653337_3653553_+,NA|73aa|down_3|NC_008312.1_3656183_3656402_-,NA|71aa|down_6|NC_008312.1_3659572_3659785_-,NA|232aa|down_7|NC_008312.1_3659860_3660556_-	NA|542aa|up_9|NC_008312.1_3635608_3637234_-	COG0025, NhaP, NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism]	NA|596aa|up_8|NC_008312.1_3638103_3639891_+	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|770aa|up_7|NC_008312.1_3640630_3642940_-	pfam02624, YcaO, YcaO cyclodehydratase, ATP-ad Mg2+-binding	NA|113aa|up_6|NC_008312.1_3643249_3643588_-	pfam18480, DUF5615, Domain of unknown function (DUF5615)	NA|125aa|up_5|NC_008312.1_3643589_3643964_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|66aa|up_4|NC_008312.1_3644125_3644323_+	cd16382, XisI-like, XisI is FdxN element excision controlling factor protein	NA|476aa|up_3|NC_008312.1_3644580_3646008_+	TIGR03605, antibiot_sagB, SagB-type dehydrogenase domain	NA|68aa|up_2|NC_008312.1_3646964_3647168_-	NA	NA|667aa|up_1|NC_008312.1_3647883_3649884_+	TIGR03895, hypothetical_protein, cyanobactin maturation protease, PatA/PatG family	NA|71aa|up_0|NC_008312.1_3650113_3650326_-	TIGR04447, hypothetical_protein, cyanobactin cluster PatC/TenC/TruC protein	NA|703aa|down_0|NC_008312.1_3651036_3653145_+	TIGR03895, hypothetical_protein, cyanobactin maturation protease, PatA/PatG family	NA|72aa|down_1|NC_008312.1_3653337_3653553_+	NA	NA|523aa|down_2|NC_008312.1_3654023_3655592_-	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|73aa|down_3|NC_008312.1_3656183_3656402_-	NA	NA|255aa|down_4|NC_008312.1_3656449_3657214_-	pfam13672, PP2C_2, Protein phosphatase 2C	NA|221aa|down_5|NC_008312.1_3657637_3658300_-	COG4245, TerY, Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]	NA|71aa|down_6|NC_008312.1_3659572_3659785_-	NA	NA|232aa|down_7|NC_008312.1_3659860_3660556_-	NA	NA|656aa|down_8|NC_008312.1_3660871_3662839_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|441aa|down_9|NC_008312.1_3663416_3664739_+	COG3597, COG3597, Uncharacterized protein/domain associated with GTPases [Function unknown]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	29	3671484-3671793	24	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AATATTAATAGTGCTCAAGCTAAGGTTGACATTGCTG	37	1	1	3671521-3671546	NC_008312.1_3671479-3671504	NA	4	4	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|73aa|up_8|NC_008312.1_3656183_3656402_-,NA|71aa|up_5|NC_008312.1_3659572_3659785_-,NA|232aa|up_4|NC_008312.1_3659860_3660556_-,NA|346aa|up_1|NC_008312.1_3665249_3666287_+,NA|290aa|up_0|NC_008312.1_3668885_3669755_+,NA|107aa|down_1|NC_008312.1_3676130_3676451_+,NA|81aa|down_3|NC_008312.1_3679357_3679600_-,NA|82aa|down_7|NC_008312.1_3686720_3686966_-,NA|89aa|down_8|NC_008312.1_3686925_3687192_-,NA|66aa|down_9|NC_008312.1_3687914_3688112_+	NA|523aa|up_9|NC_008312.1_3654023_3655592_-	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|73aa|up_8|NC_008312.1_3656183_3656402_-	NA	NA|255aa|up_7|NC_008312.1_3656449_3657214_-	pfam13672, PP2C_2, Protein phosphatase 2C	NA|221aa|up_6|NC_008312.1_3657637_3658300_-	COG4245, TerY, Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only]	NA|71aa|up_5|NC_008312.1_3659572_3659785_-	NA	NA|232aa|up_4|NC_008312.1_3659860_3660556_-	NA	NA|656aa|up_3|NC_008312.1_3660871_3662839_+	PRK00174, PRK00174, acetyl-CoA synthetase; Provisional	NA|441aa|up_2|NC_008312.1_3663416_3664739_+	COG3597, COG3597, Uncharacterized protein/domain associated with GTPases [Function unknown]	NA|346aa|up_1|NC_008312.1_3665249_3666287_+	NA	NA|290aa|up_0|NC_008312.1_3668885_3669755_+	NA	NA|1045aa|down_0|NC_008312.1_3672571_3675706_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|107aa|down_1|NC_008312.1_3676130_3676451_+	NA	NA|636aa|down_2|NC_008312.1_3676729_3678637_+	COG3596, COG3596, Predicted GTPase [General function prediction only]	NA|81aa|down_3|NC_008312.1_3679357_3679600_-	NA	NA|220aa|down_4|NC_008312.1_3679707_3680367_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|1422aa|down_5|NC_008312.1_3680799_3685065_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|236aa|down_6|NC_008312.1_3685332_3686040_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|82aa|down_7|NC_008312.1_3686720_3686966_-	NA	NA|89aa|down_8|NC_008312.1_3686925_3687192_-	NA	NA|66aa|down_9|NC_008312.1_3687914_3688112_+	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	30	4031033-4031148	25	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AAAAAAATTAACTATTGGATACCTCAAAAAAAC	33	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|124aa|up_5|NC_008312.1_4023774_4024146_-,NA|135aa|down_1|NC_008312.1_4034053_4034458_+,NA|147aa|down_2|NC_008312.1_4034599_4035040_-,NA|50aa|down_5|NC_008312.1_4037877_4038027_-,NA|50aa|down_6|NC_008312.1_4038176_4038326_-,NA|362aa|down_7|NC_008312.1_4039901_4040987_+	NA|211aa|up_9|NC_008312.1_4018527_4019160_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|211aa|up_8|NC_008312.1_4019273_4019906_-	pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily	NA|199aa|up_7|NC_008312.1_4020220_4020817_-	cd02109, arch_bact_SO_family_Moco, bacterial and archael members of the sulfite oxidase (SO) family of molybdopterin binding domains	NA|410aa|up_6|NC_008312.1_4022247_4023477_-	COG1316, LytR, Transcriptional regulator [Transcription]	NA|124aa|up_5|NC_008312.1_4023774_4024146_-	NA	NA|237aa|up_4|NC_008312.1_4024834_4025545_+	PRK05581, PRK05581, ribulose-phosphate 3-epimerase; Validated	NA|250aa|up_3|NC_008312.1_4025709_4026459_-	COG1651, DsbG, Protein-disulfide isomerase [Posttranslational modification, protein turnover, chaperones]	NA|457aa|up_2|NC_008312.1_4026712_4028083_-	pfam06271, RDD, RDD family	NA|416aa|up_1|NC_008312.1_4028144_4029392_-	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|198aa|up_0|NC_008312.1_4029616_4030210_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|633aa|down_0|NC_008312.1_4031580_4033479_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|135aa|down_1|NC_008312.1_4034053_4034458_+	NA	NA|147aa|down_2|NC_008312.1_4034599_4035040_-	NA	NA|267aa|down_3|NC_008312.1_4035390_4036191_+	pfam00702, Hydrolase, haloacid dehalogenase-like hydrolase	NA|405aa|down_4|NC_008312.1_4036499_4037714_+	smart00933, NurA, NurA nuclease	NA|50aa|down_5|NC_008312.1_4037877_4038027_-	NA	NA|50aa|down_6|NC_008312.1_4038176_4038326_-	NA	NA|362aa|down_7|NC_008312.1_4039901_4040987_+	NA	NA|601aa|down_8|NC_008312.1_4041421_4043224_+	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|624aa|down_9|NC_008312.1_4043561_4045433_+	cd16383, GUN4, porphyrin-binding protein domain GUN4
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	31	4129226-4129391	26	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AAGTTTTGAGTAGACTAATAGATTGATTTTATGAATAGCTTCGTAAGTTAAAT	53	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|129aa|up_7|NC_008312.1_4113225_4113612_+,NA|70aa|down_6|NC_008312.1_4141096_4141306_-,NA|125aa|down_9|NC_008312.1_4144887_4145262_-	NA|460aa|up_9|NC_008312.1_4108472_4109852_-	cd07136, ALDH_YwdH-P39616, Bacillus subtilis aldehyde dehydrogenase ywdH-like	NA|751aa|up_8|NC_008312.1_4110198_4112451_-	COG0317, SpoT, Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription]	NA|129aa|up_7|NC_008312.1_4113225_4113612_+	NA	NA|204aa|up_6|NC_008312.1_4113608_4114220_+	pfam13453, zf-TFIIB, Transcription factor zinc-finger	NA|553aa|up_5|NC_008312.1_4114447_4116106_+	COG1123, COG1123, ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only]	NA|203aa|up_4|NC_008312.1_4118495_4119104_+	cd02137, MhqN-like, nitroreductase family protein similar to the NAD(P)H nitroreductase MhqN	NA|426aa|up_3|NC_008312.1_4122021_4123299_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|354aa|up_2|NC_008312.1_4123683_4124745_+	cd06853, GT_WecA_like, This subfamily contains Escherichia coli WecA, Bacillus subtilis TagO and related proteins	NA|426aa|up_1|NC_008312.1_4126185_4127463_+	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|149aa|up_0|NC_008312.1_4127923_4128370_+	PRK05395, PRK05395, type II 3-dehydroquinate dehydratase	NA|380aa|down_0|NC_008312.1_4129511_4130651_+	pfam03747, ADP_ribosyl_GH, ADP-ribosylglycohydrolase	NA|837aa|down_1|NC_008312.1_4130681_4133192_+	COG1480, COG1480, Predicted membrane-associated HD superfamily hydrolase [General function prediction only]	NA|244aa|down_2|NC_008312.1_4133384_4134116_-	PRK00024, PRK00024, DNA repair protein RadC	NA|182aa|down_3|NC_008312.1_4134326_4134872_+	cd02232, cupin_ARD, acireductone dioxygenase (ARD), cupin domain	NA|346aa|down_4|NC_008312.1_4135836_4136874_+	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|829aa|down_5|NC_008312.1_4138434_4140921_-	pfam12770, CHAT, CHAT domain	NA|70aa|down_6|NC_008312.1_4141096_4141306_-	NA	NA|414aa|down_7|NC_008312.1_4142264_4143506_-	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|341aa|down_8|NC_008312.1_4143819_4144842_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|125aa|down_9|NC_008312.1_4144887_4145262_-	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	32	4183016-4183244	1	PILER-CR	no	DinG	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Type IV-A	TATTAAATGGTGGTCAAGGAAATGATATTTTAAATGGTAAT	41	0	0	NA	NA	NA	2	2	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA,NA|312aa|down_5|NC_008312.1_4199214_4200150_-	NA|378aa|up_9|NC_008312.1_4166809_4167943_-	cd00616, AHBA_syn, 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn)	NA|361aa|up_8|NC_008312.1_4169524_4170607_+	PRK09354, recA, recombinase A; Provisional	NA|292aa|up_7|NC_008312.1_4170842_4171718_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|122aa|up_6|NC_008312.1_4172627_4172993_-	pfam08844, DUF1815, Domain of unknown function (DUF1815)	NA|71aa|up_5|NC_008312.1_4173785_4173998_-	pfam10999, DUF2839, Protein of unknown function (DUF2839)	DinG|513aa|up_4|NC_008312.1_4174531_4176070_-	COG1199, DinG, Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair]	NA|518aa|up_3|NC_008312.1_4176774_4178328_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|283aa|up_2|NC_008312.1_4178364_4179213_+	PRK07417, PRK07417, prephenate/arogenate dehydrogenase	NA|354aa|up_1|NC_008312.1_4180081_4181143_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|184aa|up_0|NC_008312.1_4181554_4182106_-	cd03017, PRX_BCP, Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of  thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides	NA|518aa|down_0|NC_008312.1_4184502_4186056_+	cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) similar ot the Spinach thylakoid lumen protein TLP40	NA|232aa|down_1|NC_008312.1_4186207_4186903_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|1487aa|down_2|NC_008312.1_4187461_4191922_-	cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis	NA|980aa|down_3|NC_008312.1_4192356_4195296_-	pfam04577, DUF563, Protein of unknown function (DUF563)	NA|328aa|down_4|NC_008312.1_4196544_4197528_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|312aa|down_5|NC_008312.1_4199214_4200150_-	NA	NA|303aa|down_6|NC_008312.1_4201887_4202796_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|413aa|down_7|NC_008312.1_4203101_4204340_+	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|167aa|down_8|NC_008312.1_4204509_4205010_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|86aa|down_9|NC_008312.1_4205398_4205656_+	pfam13239, 2TM, 2TM domain
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	33	4206483-4206579	27	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TCAGAAATTACGGATATTTTATTGGAAAAGTTT	33	1	9	4206516-4206546|4206516-4206546|4206516-4206546|4206516-4206546|4206516-4206546|4206516-4206546|4206516-4206546|4206516-4206546|4206516-4206546	NC_008312.1_4208808-4208838|NC_008312.1_4206720-4206750|NC_008312.1_4206924-4206954|NC_008312.1_4207128-4207158|NC_008312.1_4207788-4207818|NC_008312.1_4207992-4208022|NC_008312.1_4208196-4208226|NC_008312.1_4208604-4208634|NC_008312.1_4209012-4209042	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|312aa|up_5|NC_008312.1_4199214_4200150_-,NA|336aa|down_3|NC_008312.1_4213177_4214185_-,NA|279aa|down_6|NC_008312.1_4220714_4221551_+	NA|232aa|up_9|NC_008312.1_4186207_4186903_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|1487aa|up_8|NC_008312.1_4187461_4191922_-	cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis	NA|980aa|up_7|NC_008312.1_4192356_4195296_-	pfam04577, DUF563, Protein of unknown function (DUF563)	NA|328aa|up_6|NC_008312.1_4196544_4197528_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|312aa|up_5|NC_008312.1_4199214_4200150_-	NA	NA|303aa|up_4|NC_008312.1_4201887_4202796_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|413aa|up_3|NC_008312.1_4203101_4204340_+	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|167aa|up_2|NC_008312.1_4204509_4205010_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|86aa|up_1|NC_008312.1_4205398_4205656_+	pfam13239, 2TM, 2TM domain	NA|103aa|up_0|NC_008312.1_4205706_4206015_+	pfam11378, DUF3181, Protein of unknown function (DUF3181)	NA|388aa|down_0|NC_008312.1_4209105_4210269_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|198aa|down_1|NC_008312.1_4210708_4211302_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|98aa|down_2|NC_008312.1_4211320_4211614_-	cd01339, LDH-like_MDH, L-lactate dehydrogenase-like malate dehydrogenase proteins	NA|336aa|down_3|NC_008312.1_4213177_4214185_-	NA	NA|490aa|down_4|NC_008312.1_4214708_4216178_+	pfam14356, DUF4403, Domain of unknown function (DUF4403)	NA|540aa|down_5|NC_008312.1_4218704_4220324_-	cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|279aa|down_6|NC_008312.1_4220714_4221551_+	NA	NA|251aa|down_7|NC_008312.1_4222508_4223261_+	PRK00347, PRK00347, DNA/RNA nuclease SfsA	NA|270aa|down_8|NC_008312.1_4224521_4225331_+	COG1864, NUC1, DNA/RNA endonuclease G, NUC1 [Nucleotide transport and metabolism]	NA|66aa|down_9|NC_008312.1_4225652_4225850_-	pfam07924, NuiA, Nuclease A inhibitor-like protein
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	34	4303513-4303584	28	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TTTCCACTCATACCTGTTACACT	23	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|334aa|up_5|NC_008312.1_4289907_4290909_-,NA|286aa|down_9|NC_008312.1_4319924_4320782_-	NA|210aa|up_9|NC_008312.1_4283569_4284199_+	pfam04313, HSDR_N, Type I restriction enzyme R protein N-terminus (HSDR_N)	NA|892aa|up_8|NC_008312.1_4284495_4287171_-	sd00006, TPR, Tetratricopeptide repeat	NA|412aa|up_7|NC_008312.1_4287743_4288979_-	COG3825, COG3825, Uncharacterized protein conserved in bacteria [Function unknown]	NA|316aa|up_6|NC_008312.1_4288975_4289923_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|334aa|up_5|NC_008312.1_4289907_4290909_-	NA	NA|982aa|up_4|NC_008312.1_4291520_4294466_-	cd06268, PBP1_ABC_transporter_LIVBP-like, periplasmic binding domain of ATP-binding cassette transporter-like systems that belong to the type 1 periplasmic binding fold protein superfamily	NA|455aa|up_3|NC_008312.1_4294716_4296081_-	pfam08811, DUF1800, Protein of unknown function (DUF1800)	NA|689aa|up_2|NC_008312.1_4296675_4298742_-	PRK12740, PRK12740, elongation factor G-like protein EF-G2	NA|449aa|up_1|NC_008312.1_4299818_4301165_-	pfam00067, p450, Cytochrome P450	NA|491aa|up_0|NC_008312.1_4301647_4303120_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|495aa|down_0|NC_008312.1_4303654_4305139_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|114aa|down_1|NC_008312.1_4306233_4306575_+	COG4995, COG4995, Uncharacterized protein conserved in bacteria [Function unknown]	NA|1373aa|down_2|NC_008312.1_4307060_4311179_-	cd08602, GDPD_ScGlpQ1_like, Glycerophosphodiester phosphodiesterase domain of Streptomycin coelicolor (GlpQ1) and similar proteins	NA|349aa|down_3|NC_008312.1_4313546_4314593_+	pfam00520, Ion_trans, Ion transport protein	NA|245aa|down_4|NC_008312.1_4314811_4315546_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|263aa|down_5|NC_008312.1_4315772_4316561_-	TIGR02454, Uncharacterized_protein_MJ1089, cobalt ECF transporter T component CbiQ	NA|215aa|down_6|NC_008312.1_4316787_4317432_-	PRK06265, PRK06265, cobalt transporter CbiM	NA|169aa|down_7|NC_008312.1_4317525_4318032_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|346aa|down_8|NC_008312.1_4318431_4319469_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|286aa|down_9|NC_008312.1_4319924_4320782_-	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	35	4398820-4398952	29	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AGATAAAAATTCCCCAGTAGCGATCGCCGCTCAAAAA	37	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|78aa|up_9|NC_008312.1_4382743_4382977_+,NA|222aa|up_5|NC_008312.1_4388851_4389517_-,NA|112aa|up_4|NC_008312.1_4391623_4391959_-,NA|52aa|up_3|NC_008312.1_4392159_4392315_-,NA|71aa|down_2|NC_008312.1_4403665_4403878_+	NA|78aa|up_9|NC_008312.1_4382743_4382977_+	NA	NA|224aa|up_8|NC_008312.1_4384040_4384712_-	COG4340, COG4340, Uncharacterized protein conserved in bacteria [Function unknown]	NA|255aa|up_7|NC_008312.1_4385467_4386232_-	COG3836, HpcH, 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase [Carbohydrate transport and metabolism]	NA|522aa|up_6|NC_008312.1_4387207_4388773_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|222aa|up_5|NC_008312.1_4388851_4389517_-	NA	NA|112aa|up_4|NC_008312.1_4391623_4391959_-	NA	NA|52aa|up_3|NC_008312.1_4392159_4392315_-	NA	NA|452aa|up_2|NC_008312.1_4392553_4393909_+	cd13149, MATE_like_2, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|710aa|up_1|NC_008312.1_4394348_4396478_+	COG4248, COG4248, Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains [General function prediction only]	NA|594aa|up_0|NC_008312.1_4396696_4398478_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|619aa|down_0|NC_008312.1_4399305_4401162_+	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|172aa|down_1|NC_008312.1_4402323_4402839_+	cd06121, cupin_YML079wp, Saccharomyces cerevisiae YML079wp and related proteins, cupin domain	NA|71aa|down_2|NC_008312.1_4403665_4403878_+	NA	NA|383aa|down_3|NC_008312.1_4404603_4405752_+	cd13661, PBP2_PotD_PotF_like_1, The periplasmic substrate-binding component of an uncharacterized active transport system closely related to spermidine and putrescine transporters; contains the type 2 periplasmic binding fold	NA|187aa|down_4|NC_008312.1_4405802_4406363_-	PRK00300, gmk, guanylate kinase; Provisional	NA|84aa|down_5|NC_008312.1_4406495_4406747_-	PRK04323, PRK04323, hypothetical protein; Provisional	NA|584aa|down_6|NC_008312.1_4406935_4408687_+	pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA)	NA|268aa|down_7|NC_008312.1_4409175_4409979_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|283aa|down_8|NC_008312.1_4410978_4411827_-	TIGR02821, S-formylglutathione_hydrolase, S-formylglutathione hydrolase	NA|210aa|down_9|NC_008312.1_4412469_4413099_+	cd00593, RIBOc, RIBOc
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	36	4479665-4479755	30	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AAAAGTTAAAATTATTAATTCCGA	24	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|109aa|up_3|NC_008312.1_4477043_4477370_-,NA|146aa|up_2|NC_008312.1_4477416_4477854_-,NA|84aa|up_0|NC_008312.1_4479398_4479650_-,NA|69aa|down_6|NC_008312.1_4493144_4493351_+,NA|119aa|down_8|NC_008312.1_4495868_4496225_+	NA|44aa|up_9|NC_008312.1_4471356_4471488_-	pfam02468, PsbN, Photosystem II reaction centre N protein (psbN)	NA|68aa|up_8|NC_008312.1_4471575_4471779_+	PRK02624, psbH, photosystem II reaction center protein PsbH	NA|88aa|up_7|NC_008312.1_4471892_4472156_+	PRK14857, tatA, TatA/E family twin arginine-targeting protein translocase	NA|220aa|up_6|NC_008312.1_4472263_4472923_+	PRK05426, PRK05426, peptidyl-tRNA hydrolase; Provisional	NA|89aa|up_5|NC_008312.1_4473755_4474022_+	pfam11344, DUF3146, Protein of unknown function (DUF3146)	NA|572aa|up_4|NC_008312.1_4474214_4475930_+	PRK05380, pyrG, CTP synthetase; Validated	NA|109aa|up_3|NC_008312.1_4477043_4477370_-	NA	NA|146aa|up_2|NC_008312.1_4477416_4477854_-	NA	NA|304aa|up_1|NC_008312.1_4478486_4479398_-	COG1940, NagC, Transcriptional regulator/sugar kinase [Transcription / Carbohydrate transport and metabolism]	NA|84aa|up_0|NC_008312.1_4479398_4479650_-	NA	NA|1077aa|down_0|NC_008312.1_4479978_4483209_+	PRK05294, carB, carbamoyl-phosphate synthase large subunit	NA|607aa|down_1|NC_008312.1_4484080_4485901_-	COG0370, FeoB, Fe2+ transport system protein B [Inorganic ion transport and metabolism]	NA|90aa|down_2|NC_008312.1_4486041_4486311_-	pfam04023, FeoA, FeoA domain	NA|343aa|down_3|NC_008312.1_4488287_4489316_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|125aa|down_4|NC_008312.1_4489708_4490083_+	pfam08853, DUF1823, Domain of unknown function (DUF1823)	NA|116aa|down_5|NC_008312.1_4490873_4491221_-	pfam10664, NdhM, Cyanobacterial and plastid NDH-1 subunit M	NA|69aa|down_6|NC_008312.1_4493144_4493351_+	NA	NA|177aa|down_7|NC_008312.1_4494783_4495314_+	COG2236, COG2236, Predicted phosphoribosyltransferases [General function prediction only]	NA|119aa|down_8|NC_008312.1_4495868_4496225_+	NA	NA|416aa|down_9|NC_008312.1_4498809_4500057_+	COG4908, COG4908, Uncharacterized protein containing a NRPS condensation (elongation) domain [General function prediction only]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	37	4605178-4605259	31	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GCCCCTAACCGTTATCTCATATT	23	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|154aa|up_0|NC_008312.1_4597744_4598206_+,NA|34aa|down_0|NC_008312.1_4605414_4605516_-,NA|178aa|down_2|NC_008312.1_4606541_4607075_-	NA|787aa|up_9|NC_008312.1_4578904_4581265_-	cd11301, Fut1_Fut2_like, Alpha-1,2-fucosyltransferase	NA|322aa|up_8|NC_008312.1_4581355_4582321_+	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|377aa|up_7|NC_008312.1_4584743_4585874_+	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|712aa|up_6|NC_008312.1_4588812_4590948_+	sd00006, TPR, Tetratricopeptide repeat	NA|187aa|up_5|NC_008312.1_4591369_4591930_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|152aa|up_4|NC_008312.1_4591986_4592442_-	pfam13565, HTH_32, Homeodomain-like domain	NA|305aa|up_3|NC_008312.1_4592698_4593613_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|356aa|up_2|NC_008312.1_4593895_4594963_-	COG1173, DppC, ABC-type dipeptide/oligopeptide/nickel transport systems, permease components [Amino acid transport and metabolism / Inorganic ion transport and metabolism]	NA|311aa|up_1|NC_008312.1_4596039_4596972_-	PRK00089, era, GTPase Era; Reviewed	NA|154aa|up_0|NC_008312.1_4597744_4598206_+	NA	NA|34aa|down_0|NC_008312.1_4605414_4605516_-	NA	NA|42aa|down_1|NC_008312.1_4605830_4605956_+	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|178aa|down_2|NC_008312.1_4606541_4607075_-	NA	NA|293aa|down_3|NC_008312.1_4609990_4610869_+	pfam03881, Fructosamin_kin, Fructosamine kinase	NA|332aa|down_4|NC_008312.1_4612246_4613242_-	pfam00891, Methyltransf_2, O-methyltransferase	NA|1332aa|down_5|NC_008312.1_4614965_4618961_-	PRK12467, PRK12467, peptide synthase; Provisional	NA|323aa|down_6|NC_008312.1_4619068_4620037_-	COG2084, MmsB, 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism]	NA|362aa|down_7|NC_008312.1_4620564_4621650_-	PRK01372, ddl, D-alanine--D-alanine ligase; Reviewed	NA|346aa|down_8|NC_008312.1_4621710_4622748_-	COG1181, DdlA, D-alanine-D-alanine ligase and related ATP-grasp enzymes [Cell envelope biogenesis, outer membrane]	NA|364aa|down_9|NC_008312.1_4622977_4624069_-	PLN00139, PLN00139, hypothetical protein; Provisional
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	38	5739022-5739126	32	CRISPRCasFinder	no	cas3	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	GCTAGTAAATATTAGCAGTTTTTGTCTGACCAAT	34	0	0	NA	NA	NA	1	1	Unclear	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|51aa|up_5|NC_008312.1_5727948_5728101_+,NA|317aa|down_2|NC_008312.1_5743981_5744932_+,NA|69aa|down_3|NC_008312.1_5745007_5745214_+,NA|82aa|down_4|NC_008312.1_5745245_5745491_-,NA|230aa|down_5|NC_008312.1_5745693_5746383_+,NA|46aa|down_7|NC_008312.1_5747935_5748073_+	NA|73aa|up_9|NC_008312.1_5716919_5717138_+	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|675aa|up_8|NC_008312.1_5717511_5719536_+	COG0557, VacB, Exoribonuclease R [Transcription]	NA|258aa|up_7|NC_008312.1_5722163_5722937_-	PRK00748, PRK00748, 1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase; Validated	cas3|496aa|up_6|NC_008312.1_5726338_5727826_-	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|51aa|up_5|NC_008312.1_5727948_5728101_+	NA	NA|442aa|up_4|NC_008312.1_5730127_5731453_-	TIGR01125, Ribosomal_protein_S12_methylthiotransferase_RimO, ribosomal protein S12 methylthiotransferase RimO	NA|303aa|up_3|NC_008312.1_5732788_5733697_+	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|146aa|up_2|NC_008312.1_5733995_5734433_+	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|125aa|up_1|NC_008312.1_5736616_5736991_-	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|61aa|up_0|NC_008312.1_5737923_5738106_+	cd13989, STKc_IKK, Catalytic domain of the Serine/Threonine kinase, Inhibitor of Nuclear Factor-KappaB Kinase (IKK)	NA|46aa|down_0|NC_008312.1_5740029_5740167_+	pfam13592, HTH_33, Winged helix-turn helix	NA|35aa|down_1|NC_008312.1_5740483_5740588_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|317aa|down_2|NC_008312.1_5743981_5744932_+	NA	NA|69aa|down_3|NC_008312.1_5745007_5745214_+	NA	NA|82aa|down_4|NC_008312.1_5745245_5745491_-	NA	NA|230aa|down_5|NC_008312.1_5745693_5746383_+	NA	NA|209aa|down_6|NC_008312.1_5747002_5747629_-	PRK05953, PRK05953, Precorrin-8X methylmutase	NA|46aa|down_7|NC_008312.1_5747935_5748073_+	NA	NA|218aa|down_8|NC_008312.1_5748620_5749274_-	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|337aa|down_9|NC_008312.1_5749798_5750809_+	sd00006, TPR, Tetratricopeptide repeat
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	39	5774650-5774836	2	PILER-CR	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GGTGCTTCAGAAAATGAATGAATTAACTATTGCGCAATAGTTGGAA	46	0	0	NA	NA	NA	2	2	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|72aa|up_5|NC_008312.1_5770498_5770714_+,NA|552aa|down_0|NC_008312.1_5775406_5777062_-,NA|48aa|down_4|NC_008312.1_5785913_5786057_+	NA|2491aa|up_9|NC_008312.1_5754716_5762189_-	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|570aa|up_8|NC_008312.1_5762329_5764039_-	sd00006, TPR, Tetratricopeptide repeat	NA|614aa|up_7|NC_008312.1_5764285_5766127_+	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|47aa|up_6|NC_008312.1_5766298_5766439_+	pfam07282, OrfB_Zn_ribbon, Putative transposase DNA-binding domain	NA|72aa|up_5|NC_008312.1_5770498_5770714_+	NA	NA|296aa|up_4|NC_008312.1_5770703_5771591_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|51aa|up_3|NC_008312.1_5771979_5772132_-	pfam05145, AbrB, Transition state regulatory protein AbrB	NA|463aa|up_2|NC_008312.1_5772258_5773647_-	COG0119, LeuA, Isopropylmalate/homocitrate/citramalate synthases [Amino acid transport and metabolism]	NA|83aa|up_1|NC_008312.1_5773772_5774021_-	pfam13561, adh_short_C2, Enoyl-(Acyl carrier protein) reductase	NA|112aa|up_0|NC_008312.1_5774036_5774372_-	PRK12825, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional	NA|552aa|down_0|NC_008312.1_5775406_5777062_-	NA	NA|39aa|down_1|NC_008312.1_5781862_5781979_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|346aa|down_2|NC_008312.1_5782719_5783757_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|129aa|down_3|NC_008312.1_5784150_5784537_+	pfam08870, DndE, DNA sulphur modification protein DndE	NA|48aa|down_4|NC_008312.1_5785913_5786057_+	NA	NA|279aa|down_5|NC_008312.1_5787327_5788164_-	PRK06427, PRK06427, bifunctional hydroxy-methylpyrimidine kinase/ hydroxy-phosphomethylpyrimidine kinase; Reviewed	NA|424aa|down_6|NC_008312.1_5788221_5789493_-	PRK09330, PRK09330, cell division protein FtsZ; Validated	NA|279aa|down_7|NC_008312.1_5789682_5790519_-	COG1589, FtsQ, Cell division septal protein [Cell envelope biogenesis, outer membrane]	NA|456aa|down_8|NC_008312.1_5792468_5793836_+	pfam13191, AAA_16, AAA ATPase domain	NA|1859aa|down_9|NC_008312.1_5793962_5799539_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	40	5910066-5910395	3	PILER-CR	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CTCAGGAAATGGAGGAGCAATTAATTTGGATGCTGGTGGCAATATCACTACACAATCCCTCAAT	64	0	0	NA	NA	NA	3	3	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|68aa|up_2|NC_008312.1_5899521_5899725_+,NA|442aa|up_0|NC_008312.1_5904038_5905364_-,NA|229aa|down_7|NC_008312.1_5926432_5927119_+	NA|1454aa|up_9|NC_008312.1_5876967_5881329_+	cd05926, FACL_fum10p_like, Subfamily of fatty acid CoA ligase (FACL) similar to Fum10p of Gibberella moniliformis	NA|1910aa|up_8|NC_008312.1_5881442_5887172_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|1355aa|up_7|NC_008312.1_5887336_5891401_+	COG3321, COG3321, Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism]	NA|163aa|up_6|NC_008312.1_5891767_5892256_+	pfam13924, Lipocalin_5, Lipocalin-like domain	NA|494aa|up_5|NC_008312.1_5892399_5893881_+	COG2072, TrkA, Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism]	NA|562aa|up_4|NC_008312.1_5894130_5895816_+	pfam01764, Lipase_3, Lipase (class 3)	NA|640aa|up_3|NC_008312.1_5897085_5899005_+	pfam00743, FMO-like, Flavin-binding monooxygenase-like	NA|68aa|up_2|NC_008312.1_5899521_5899725_+	NA	NA|592aa|up_1|NC_008312.1_5900217_5901993_-	COG0661, AarF, Predicted unusual protein kinase [General function prediction only]	NA|442aa|up_0|NC_008312.1_5904038_5905364_-	NA	NA|253aa|down_0|NC_008312.1_5917265_5918024_-	cd16443, LplA, lipoate-protein ligase	NA|165aa|down_1|NC_008312.1_5918585_5919080_-	cd17036, T3SC_YbjN-like_1, T110839 is structurally similar to type III secretion system chaperones and YbjN family proteins	NA|364aa|down_2|NC_008312.1_5919176_5920268_-	COG2226, UbiE, Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism]	NA|474aa|down_3|NC_008312.1_5920769_5922191_-	TIGR00653, Glutamine_synthetase, glutamine synthetase, type I	NA|170aa|down_4|NC_008312.1_5922959_5923469_+	cd12126, APC_beta, Allophycocyanin beta subunit of the phycobilisome core	NA|285aa|down_5|NC_008312.1_5923743_5924598_+	COG1189, COG1189, Predicted rRNA methylase [Translation, ribosomal structure and biogenesis]	NA|139aa|down_6|NC_008312.1_5925253_5925670_+	pfam02668, TauD, Taurine catabolism dioxygenase TauD, TfdA family	NA|229aa|down_7|NC_008312.1_5926432_5927119_+	NA	NA|482aa|down_8|NC_008312.1_5927247_5928693_-	cd00880, Era_like, E	NA|493aa|down_9|NC_008312.1_5931919_5933398_-	pfam05128, DUF697, Domain of unknown function (DUF697)
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	41	6337044-6337430	4	PILER-CR	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	CAAGTAGCTTCTATGGTGTCCCAAGAA	27	0	0	NA	NA	NA	4	4	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|105aa|up_5|NC_008312.1_6328588_6328903_+,NA|102aa|up_0|NC_008312.1_6335522_6335828_+,NA	NA|711aa|up_9|NC_008312.1_6319090_6321223_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|360aa|up_8|NC_008312.1_6321623_6322703_-	PRK09196, PRK09196, fructose-bisphosphate aldolase class II	NA|216aa|up_7|NC_008312.1_6323890_6324538_+	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|589aa|up_6|NC_008312.1_6325133_6326900_-	pfam09587, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|105aa|up_5|NC_008312.1_6328588_6328903_+	NA	NA|539aa|up_4|NC_008312.1_6329322_6330939_-	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]	NA|290aa|up_3|NC_008312.1_6331606_6332476_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|254aa|up_2|NC_008312.1_6332732_6333494_+	pfam00427, PBS_linker_poly, Phycobilisome Linker polypeptide	NA|174aa|up_1|NC_008312.1_6333691_6334213_+	pfam09367, CpeS, CpeS-like protein	NA|102aa|up_0|NC_008312.1_6335522_6335828_+	NA	NA|408aa|down_0|NC_008312.1_6338711_6339935_+	COG4175, ProV, ABC-type proline/glycine betaine transport system, ATPase component [Amino acid transport and metabolism]	NA|660aa|down_1|NC_008312.1_6340308_6342288_+	cd13638, PBP2_EcProx_like, Substrate binding domain of Escherichia coli betaine transport system-like; the type 2 periplasmic binding protein fold	NA|194aa|down_2|NC_008312.1_6342796_6343378_-	pfam09367, CpeS, CpeS-like protein	NA|279aa|down_3|NC_008312.1_6343753_6344590_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|290aa|down_4|NC_008312.1_6344999_6345869_-	COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]	NA|242aa|down_5|NC_008312.1_6348238_6348964_+	pfam06967, Mo-nitro_C, Mo-dependent nitrogenase C-terminus	NA|353aa|down_6|NC_008312.1_6349314_6350373_+	PRK12299, obgE, GTPase CgtA; Reviewed	NA|170aa|down_7|NC_008312.1_6350461_6350971_-	COG4970, FimT, Tfp pilus assembly protein FimT [Cell motility and secretion / Intracellular trafficking and secretion]	NA|237aa|down_8|NC_008312.1_6351648_6352359_-	COG2165, PulG, Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion]	NA|339aa|down_9|NC_008312.1_6352421_6353438_-	COG4795, PulJ, Type II secretory pathway, component PulJ [Intracellular trafficking and secretion]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	42	6636390-6636481	33	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	AAATGAATCAATATTCTAGTTCT	23	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|54aa|up_6|NC_008312.1_6619053_6619215_+,NA|62aa|up_3|NC_008312.1_6623421_6623607_-,NA|571aa|down_1|NC_008312.1_6641687_6643400_-,NA|141aa|down_2|NC_008312.1_6644324_6644747_-,NA|53aa|down_5|NC_008312.1_6648830_6648989_-	NA|122aa|up_9|NC_008312.1_6615281_6615647_+	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|346aa|up_8|NC_008312.1_6615912_6616950_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|169aa|up_7|NC_008312.1_6616967_6617474_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|54aa|up_6|NC_008312.1_6619053_6619215_+	NA	NA|464aa|up_5|NC_008312.1_6619683_6621075_-	COG4250, COG4250, Predicted sensor protein/domain [Signal transduction mechanisms]	NA|443aa|up_4|NC_008312.1_6621693_6623022_+	pfam12576, DUF3754, Protein of unknown function (DUF3754)	NA|62aa|up_3|NC_008312.1_6623421_6623607_-	NA	NA|29aa|up_2|NC_008312.1_6623996_6624083_+	pfam13267, DUF4058, Protein of unknown function (DUF4058)	NA|179aa|up_1|NC_008312.1_6624978_6625515_+	TIGR02795, Uncharacterized_protein_in_oprL_3'region, tol-pal system protein YbgF	NA|1302aa|up_0|NC_008312.1_6626716_6630622_+	cd04277, ZnMc_serralysin_like, Zinc-dependent metalloprotease, serralysin_like subfamily	NA|365aa|down_0|NC_008312.1_6640563_6641658_-	COG0714, COG0714, MoxR-like ATPases [General function prediction only]	NA|571aa|down_1|NC_008312.1_6641687_6643400_-	NA	NA|141aa|down_2|NC_008312.1_6644324_6644747_-	NA	NA|357aa|down_3|NC_008312.1_6644782_6645853_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|429aa|down_4|NC_008312.1_6646718_6648005_-	COG0334, GdhA, Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]	NA|53aa|down_5|NC_008312.1_6648830_6648989_-	NA	NA|198aa|down_6|NC_008312.1_6650223_6650817_+	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|388aa|down_7|NC_008312.1_6650852_6652016_-	PLN02449, PLN02449, ferrochelatase	NA|213aa|down_8|NC_008312.1_6652209_6652848_+	pfam13649, Methyltransf_25, Methyltransferase domain	NA|379aa|down_9|NC_008312.1_6653982_6655119_+	COG1748, LYS9, Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	43	6697921-6698031	34	CRISPRCasFinder	no	WYL	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	GCGGAAGGCTCAGGAACATGCTTTGTT	27	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|73aa|up_9|NC_008312.1_6682092_6682311_+,NA|121aa|up_3|NC_008312.1_6691473_6691836_+,NA|61aa|down_0|NC_008312.1_6699115_6699298_+,NA|124aa|down_9|NC_008312.1_6712498_6712870_-	NA|73aa|up_9|NC_008312.1_6682092_6682311_+	NA	NA|332aa|up_8|NC_008312.1_6682447_6683443_-	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|174aa|up_7|NC_008312.1_6683810_6684332_-	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|506aa|up_6|NC_008312.1_6685741_6687259_+	cd02142, McbC_SagB-like_oxidoreductase, oxidase similar to the microcin B17 processing protein McbC	WYL|718aa|up_5|NC_008312.1_6688072_6690226_-	pfam13671, AAA_33, AAA domain	NA|94aa|up_4|NC_008312.1_6691170_6691452_+	COG1357, COG1357, Pentapeptide repeats containing protein [Function unknown]	NA|121aa|up_3|NC_008312.1_6691473_6691836_+	NA	NA|171aa|up_2|NC_008312.1_6692390_6692903_-	cd05379, CAP_bacterial, Bacterial CAP (cysteine-rich secretory proteins, antigen 5, and pathogenesis-related 1 proteins) domain proteins	NA|432aa|up_1|NC_008312.1_6693567_6694863_-	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|518aa|up_0|NC_008312.1_6695989_6697543_+	TIGR02733, similar_to_to_phytoene_dehydrogenase, C-3',4' desaturase CrtD	NA|61aa|down_0|NC_008312.1_6699115_6699298_+	NA	NA|466aa|down_1|NC_008312.1_6699592_6700990_-	PLN02518, PLN02518, pheophorbide a oxygenase	NA|30aa|down_2|NC_008312.1_6701529_6701619_-	pfam03742, PetN, PetN	NA|93aa|down_3|NC_008312.1_6702575_6702854_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|438aa|down_4|NC_008312.1_6703128_6704442_+	PRK07591, PRK07591, threonine synthase; Validated	NA|92aa|down_5|NC_008312.1_6704734_6705010_+	cd17074, Ubl_CysO_like, ubiquitin-like (Ubl) domain found in Mycobacterium tuberculosis CysO and similar proteins	NA|343aa|down_6|NC_008312.1_6705408_6706437_+	pfam13529, Peptidase_C39_2, Peptidase_C39 like family	NA|304aa|down_7|NC_008312.1_6707506_6708418_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|291aa|down_8|NC_008312.1_6710539_6711412_-	PRK07428, PRK07428, carboxylating nicotinate-nucleotide diphosphorylase	NA|124aa|down_9|NC_008312.1_6712498_6712870_-	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	44	6722416-6722496	35	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TTATATATGTTGGTGATATTTTG	23	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|124aa|up_7|NC_008312.1_6712498_6712870_-,NA|158aa|up_0|NC_008312.1_6721500_6721974_-,NA|119aa|down_2|NC_008312.1_6726972_6727329_-,NA|259aa|down_5|NC_008312.1_6731612_6732389_-,NA|52aa|down_6|NC_008312.1_6733853_6734009_+,NA|58aa|down_7|NC_008312.1_6734526_6734700_+,NA|38aa|down_8|NC_008312.1_6734775_6734889_+	NA|304aa|up_9|NC_008312.1_6707506_6708418_-	COG1262, COG1262, Uncharacterized conserved protein [Function unknown]	NA|291aa|up_8|NC_008312.1_6710539_6711412_-	PRK07428, PRK07428, carboxylating nicotinate-nucleotide diphosphorylase	NA|124aa|up_7|NC_008312.1_6712498_6712870_-	NA	NA|229aa|up_6|NC_008312.1_6713060_6713747_-	TIGR02702, transcriptional_regulator, iron-sulfur cluster biosynthesis transcriptional regulator SufR	NA|479aa|up_5|NC_008312.1_6713997_6715434_+	PRK11814, PRK11814, cysteine desulfurase activator complex subunit SufB; Provisional	NA|261aa|up_4|NC_008312.1_6716046_6716829_+	CHL00131, ycf16, sulfate ABC transporter protein; Validated	NA|454aa|up_3|NC_008312.1_6716834_6718196_+	COG0719, SufB, Cysteine desulfurase activator SufB [Posttranslational modification, protein turnover, chaperones]	NA|421aa|up_2|NC_008312.1_6718582_6719845_+	PLN02855, PLN02855, Bifunctional selenocysteine lyase/cysteine desulfurase	NA|126aa|up_1|NC_008312.1_6720750_6721128_+	pfam13783, DUF4177, Domain of unknown function (DUF4177)	NA|158aa|up_0|NC_008312.1_6721500_6721974_-	NA	NA|726aa|down_0|NC_008312.1_6723029_6725207_-	TIGR02917, TPR_domain_protein, putative PEP-CTERM system TPR-repeat lipoprotein	NA|535aa|down_1|NC_008312.1_6725214_6726819_-	pfam12770, CHAT, CHAT domain	NA|119aa|down_2|NC_008312.1_6726972_6727329_-	NA	NA|249aa|down_3|NC_008312.1_6727532_6728279_+	pfam04755, PAP_fibrillin, PAP_fibrillin	NA|664aa|down_4|NC_008312.1_6728534_6730526_+	cd11646, Precorrin_3B_C17_MT, Precorrin-3B C(17)-methyltransferase (also named CobJ or CbiH)	NA|259aa|down_5|NC_008312.1_6731612_6732389_-	NA	NA|52aa|down_6|NC_008312.1_6733853_6734009_+	NA	NA|58aa|down_7|NC_008312.1_6734526_6734700_+	NA	NA|38aa|down_8|NC_008312.1_6734775_6734889_+	NA	NA|332aa|down_9|NC_008312.1_6735475_6736471_+	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	45	6864476-6864889	7	CRT	no	PD-DExK	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	TGGCGTTNGGCGTGGCGT	18	1	1	6864494-6864511	NC_008312.1_6864878-6864895	NA	8	8	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|110aa|up_3|NC_008312.1_6860456_6860786_-,NA|131aa|up_2|NC_008312.1_6861232_6861625_-,PD-DExK|204aa|up_0|NC_008312.1_6863317_6863929_+,NA|368aa|down_0|NC_008312.1_6867811_6868915_+	NA|101aa|up_9|NC_008312.1_6849707_6850010_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|282aa|up_8|NC_008312.1_6850356_6851202_+	pfam02557, VanY, D-alanyl-D-alanine carboxypeptidase	NA|230aa|up_7|NC_008312.1_6851334_6852024_+	pfam01434, Peptidase_M41, Peptidase family M41	NA|1879aa|up_6|NC_008312.1_6852047_6857684_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|262aa|up_5|NC_008312.1_6858459_6859245_+	COG0811, TolQ, Biopolymer transport proteins [Intracellular trafficking and secretion]	NA|195aa|up_4|NC_008312.1_6859583_6860168_+	COG0848, ExbD, Biopolymer transport protein [Intracellular trafficking and secretion]	NA|110aa|up_3|NC_008312.1_6860456_6860786_-	NA	NA|131aa|up_2|NC_008312.1_6861232_6861625_-	NA	NA|367aa|up_1|NC_008312.1_6861867_6862968_+	pfam13401, AAA_22, AAA domain	PD-DExK|204aa|up_0|NC_008312.1_6863317_6863929_+	NA	NA|368aa|down_0|NC_008312.1_6867811_6868915_+	NA	NA|551aa|down_1|NC_008312.1_6869159_6870812_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|139aa|down_2|NC_008312.1_6871126_6871543_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|431aa|down_3|NC_008312.1_6872162_6873455_+	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|169aa|down_4|NC_008312.1_6873869_6874376_-	pfam04832, SOUL, SOUL heme-binding protein	NA|1248aa|down_5|NC_008312.1_6874975_6878719_+	pfam12770, CHAT, CHAT domain	NA|215aa|down_6|NC_008312.1_6879043_6879688_-	cd02145, BluB, 5,6-dimethylbenzimidazole synthase	NA|571aa|down_7|NC_008312.1_6880578_6882291_+	PRK09319, PRK09319, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase RibB/GTP cyclohydrolase II RibA	NA|85aa|down_8|NC_008312.1_6882817_6883072_+	pfam10985, DUF2805, Protein of unknown function (DUF2805)	NA|736aa|down_9|NC_008312.1_6883491_6885699_+	COG3211, PhoX, Predicted phosphatase [General function prediction only]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	46	7020732-7020890	36	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TATTGGTGCGTTTTCTGCTGAAGTTTGTCTTGCTCAGAGGTTTATCGGGAGG	52	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|229aa|up_9|NC_008312.1_7015728_7016415_-,NA|48aa|up_7|NC_008312.1_7016661_7016805_-,NA|79aa|up_4|NC_008312.1_7017919_7018156_+,NA|59aa|up_1|NC_008312.1_7019124_7019301_-,NA|65aa|down_7|NC_008312.1_7036096_7036291_-,NA|70aa|down_8|NC_008312.1_7036615_7036825_-	NA|229aa|up_9|NC_008312.1_7015728_7016415_-	NA	NA|98aa|up_8|NC_008312.1_7016411_7016705_-	pfam13546, DDE_5, DDE superfamily endonuclease	NA|48aa|up_7|NC_008312.1_7016661_7016805_-	NA	NA|128aa|up_6|NC_008312.1_7016955_7017339_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|156aa|up_5|NC_008312.1_7017384_7017852_-	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|79aa|up_4|NC_008312.1_7017919_7018156_+	NA	NA|79aa|up_3|NC_008312.1_7018534_7018771_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|48aa|up_2|NC_008312.1_7018842_7018986_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|59aa|up_1|NC_008312.1_7019124_7019301_-	NA	NA|128aa|up_0|NC_008312.1_7020176_7020560_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|107aa|down_0|NC_008312.1_7022660_7022981_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|333aa|down_1|NC_008312.1_7024322_7025321_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|435aa|down_2|NC_008312.1_7025684_7026989_-	PRK06349, PRK06349, homoserine dehydrogenase; Provisional	NA|404aa|down_3|NC_008312.1_7028525_7029737_-	pfam04339, FemAB_like, Peptidogalycan biosysnthesis/recognition	NA|446aa|down_4|NC_008312.1_7030420_7031758_-	PRK07583, PRK07583, cytosine deaminase	NA|253aa|down_5|NC_008312.1_7031876_7032635_+	pfam04481, DUF561, Protein of unknown function (DUF561)	NA|996aa|down_6|NC_008312.1_7033112_7036100_+	pfam01551, Peptidase_M23, Peptidase family M23	NA|65aa|down_7|NC_008312.1_7036096_7036291_-	NA	NA|70aa|down_8|NC_008312.1_7036615_7036825_-	NA	NA|274aa|down_9|NC_008312.1_7037038_7037860_-	COG4735, COG4735, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	47	7278240-7278341	37	CRISPRCasFinder	no	RT	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Unclear	TCTGCAACTATGGATAAAAAAACTTCTAAATTATTA	36	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|87aa|up_9|NC_008312.1_7263601_7263862_-,NA|124aa|up_0|NC_008312.1_7277651_7278023_+,NA|57aa|down_0|NC_008312.1_7278919_7279090_-,NA|65aa|down_3|NC_008312.1_7283270_7283465_-,NA|50aa|down_9|NC_008312.1_7307140_7307290_+	NA|87aa|up_9|NC_008312.1_7263601_7263862_-	NA	NA|126aa|up_8|NC_008312.1_7265338_7265716_+	pfam13655, RVT_N, N-terminal domain of reverse transcriptase	NA|152aa|up_7|NC_008312.1_7265800_7266256_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|168aa|up_6|NC_008312.1_7266264_7266768_-	pfam13565, HTH_32, Homeodomain-like domain	NA|168aa|up_5|NC_008312.1_7268677_7269181_+	pfam13565, HTH_32, Homeodomain-like domain	NA|152aa|up_4|NC_008312.1_7269189_7269645_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|348aa|up_3|NC_008312.1_7273366_7274410_+	COG2138, COG2138, Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism]	NA|327aa|up_2|NC_008312.1_7275613_7276594_+	PRK10717, PRK10717, cysteine synthase A; Provisional	NA|83aa|up_1|NC_008312.1_7276609_7276858_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|124aa|up_0|NC_008312.1_7277651_7278023_+	NA	NA|57aa|down_0|NC_008312.1_7278919_7279090_-	NA	NA|448aa|down_1|NC_008312.1_7279224_7280568_+	PRK06116, PRK06116, glutathione reductase; Validated	NA|388aa|down_2|NC_008312.1_7280858_7282022_-	pfam14234, DUF4336, Domain of unknown function (DUF4336)	NA|65aa|down_3|NC_008312.1_7283270_7283465_-	NA	NA|793aa|down_4|NC_008312.1_7284209_7286588_-	cd04620, CBS_two-component_sensor_histidine_kinase_repeat1, 2 tandem repeats of the CBS domain in the two-component sensor histidine kinase and related-proteins, repeat 1	NA|87aa|down_5|NC_008312.1_7288917_7289178_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|3428aa|down_6|NC_008312.1_7290401_7300685_+	cd13692, PBP2_BztA, Substrate bindng domain of ABC glutamate/glutamine/aspartate/asparagine transporter; the type 2 periplasmic binding protein fold	NA|273aa|down_7|NC_008312.1_7302433_7303252_-	cd01643, Bacterial_IMPase_like_2, Bacterial family of Mg++ dependent phosphatases, related to inositol monophosphatases	NA|208aa|down_8|NC_008312.1_7305069_7305693_-	pfam06080, DUF938, Protein of unknown function (DUF938)	NA|50aa|down_9|NC_008312.1_7307140_7307290_+	NA
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	48	7295115-7295467	38	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	GGTGGAAGTGGTAACGACCAACTTTATGG	29	0	0	NA	NA	NA	5	5	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|124aa|up_6|NC_008312.1_7277651_7278023_+,NA|57aa|up_5|NC_008312.1_7278919_7279090_-,NA|65aa|up_2|NC_008312.1_7283270_7283465_-,NA|50aa|down_2|NC_008312.1_7307140_7307290_+	NA|348aa|up_9|NC_008312.1_7273366_7274410_+	COG2138, COG2138, Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism]	NA|327aa|up_8|NC_008312.1_7275613_7276594_+	PRK10717, PRK10717, cysteine synthase A; Provisional	NA|83aa|up_7|NC_008312.1_7276609_7276858_+	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|124aa|up_6|NC_008312.1_7277651_7278023_+	NA	NA|57aa|up_5|NC_008312.1_7278919_7279090_-	NA	NA|448aa|up_4|NC_008312.1_7279224_7280568_+	PRK06116, PRK06116, glutathione reductase; Validated	NA|388aa|up_3|NC_008312.1_7280858_7282022_-	pfam14234, DUF4336, Domain of unknown function (DUF4336)	NA|65aa|up_2|NC_008312.1_7283270_7283465_-	NA	NA|793aa|up_1|NC_008312.1_7284209_7286588_-	cd04620, CBS_two-component_sensor_histidine_kinase_repeat1, 2 tandem repeats of the CBS domain in the two-component sensor histidine kinase and related-proteins, repeat 1	NA|87aa|up_0|NC_008312.1_7288917_7289178_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|273aa|down_0|NC_008312.1_7302433_7303252_-	cd01643, Bacterial_IMPase_like_2, Bacterial family of Mg++ dependent phosphatases, related to inositol monophosphatases	NA|208aa|down_1|NC_008312.1_7305069_7305693_-	pfam06080, DUF938, Protein of unknown function (DUF938)	NA|50aa|down_2|NC_008312.1_7307140_7307290_+	NA	NA|487aa|down_3|NC_008312.1_7307374_7308835_+	COG0654, UbiH, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases [Coenzyme metabolism / Energy production and conversion]	NA|149aa|down_4|NC_008312.1_7309944_7310391_-	pfam01050, MannoseP_isomer, Mannose-6-phosphate isomerase	NA|392aa|down_5|NC_008312.1_7311741_7312917_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|72aa|down_6|NC_008312.1_7313949_7314165_+	pfam01724, DUF29, Domain of unknown function DUF29	NA|224aa|down_7|NC_008312.1_7314885_7315557_+	COG1413, COG1413, FOG: HEAT repeat [Energy production and conversion]	NA|354aa|down_8|NC_008312.1_7315950_7317012_-	TIGR01151, Photosystem_QB_protein, photosystem II, DI subunit (also called Q(B))	NA|116aa|down_9|NC_008312.1_7317336_7317684_-	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	49	7422123-7422217	39	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	TCTCCAAAATAAAAATTAAGGTAAAATT	28	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA,NA|160aa|down_7|NC_008312.1_7428606_7429086_+,NA|55aa|down_8|NC_008312.1_7432119_7432284_+	NA|52aa|up_9|NC_008312.1_7397681_7397837_+	pfam09565, RE_NgoFVII, NgoFVII restriction endonuclease	NA|398aa|up_8|NC_008312.1_7401162_7402356_-	COG1565, COG1565, Uncharacterized conserved protein [Function unknown]	NA|249aa|up_7|NC_008312.1_7402562_7403309_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|395aa|up_6|NC_008312.1_7406253_7407438_-	PRK07415, PRK07415, NAD(P)H-quinone oxidoreductase subunit H; Validated	NA|284aa|up_5|NC_008312.1_7408134_7408986_+	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|554aa|up_4|NC_008312.1_7409628_7411290_+	cd17569, REC_HupR-like, phosphoacceptor receiver (REC) domain of hydrogen uptake protein regulator (HupR) and similar domains	NA|141aa|up_3|NC_008312.1_7411294_7411717_+	COG2172, RsbW, Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms]	NA|309aa|up_2|NC_008312.1_7413756_7414683_+	TIGR00276, iron-sulfur_cluster-binding_protein, epoxyqueuosine reductase	NA|60aa|up_1|NC_008312.1_7416578_7416758_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|346aa|up_0|NC_008312.1_7420787_7421825_+	PRK00892, lpxD, UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional	NA|78aa|down_0|NC_008312.1_7422356_7422590_+	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|102aa|down_1|NC_008312.1_7422645_7422951_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|59aa|down_2|NC_008312.1_7422952_7423129_+	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|60aa|down_3|NC_008312.1_7423516_7423696_+	COG3677, COG3677, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|34aa|down_4|NC_008312.1_7423846_7423948_+	COG1662, InsB, Transposase and inactivated derivatives, IS1 family [DNA replication, recombination, and repair]	NA|158aa|down_5|NC_008312.1_7425906_7426380_-	pfam13359, DDE_Tnp_4, DDE superfamily endonuclease	NA|125aa|down_6|NC_008312.1_7426400_7426775_-	pfam13613, HTH_Tnp_4, Helix-turn-helix of DDE superfamily endonuclease	NA|160aa|down_7|NC_008312.1_7428606_7429086_+	NA	NA|55aa|down_8|NC_008312.1_7432119_7432284_+	NA	NA|141aa|down_9|NC_008312.1_7432429_7432852_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family
GCF_000014265.1_ASM1426v1	NC_008312	Trichodesmium erythraeum IMS101, complete genome	50	7546756-7546858	40	CRISPRCasFinder	no		RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	Orphan	ATTTTTATTCTTTTATATTTACAGTTT	27	0	0	NA	NA	NA	1	1	Orphan	RT,cas14k,Cas14c_CAS-V-F,cas14j,Cas14u_CAS-V,PD-DExK,cas3,c2c9_V-U4,DinG,WYL	NA|203aa|up_5|NC_008312.1_7540272_7540881_+,NA|334aa|down_3|NC_008312.1_7552349_7553351_+,NA|316aa|down_7|NC_008312.1_7565851_7566799_-,NA|302aa|down_8|NC_008312.1_7566801_7567707_-	NA|450aa|up_9|NC_008312.1_7533288_7534638_-	PLN03192, PLN03192, Voltage-dependent potassium channel; Provisional	NA|751aa|up_8|NC_008312.1_7535045_7537298_-	COG3914, Spy, Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]	NA|226aa|up_7|NC_008312.1_7537609_7538287_+	pfam05838, Glyco_hydro_108, Glycosyl hydrolase 108	NA|346aa|up_6|NC_008312.1_7539210_7540248_+	pfam13358, DDE_3, DDE superfamily endonuclease	NA|203aa|up_5|NC_008312.1_7540272_7540881_+	NA	NA|214aa|up_4|NC_008312.1_7541453_7542095_+	PRK07455, PRK07455, bifunctional 4-hydroxy-2-oxoglutarate aldolase/2-dehydro-3-deoxy-phosphogluconate aldolase	NA|125aa|up_3|NC_008312.1_7542510_7542885_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|318aa|up_2|NC_008312.1_7543283_7544237_+	cd01017, AdcA, Metal binding protein AdcA	NA|267aa|up_1|NC_008312.1_7544233_7545034_+	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|273aa|up_0|NC_008312.1_7545482_7546301_+	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|392aa|down_0|NC_008312.1_7547129_7548305_+	TIGR03472, HpnI, hopanoid biosynthesis associated glycosyl transferase protein HpnI	NA|384aa|down_1|NC_008312.1_7548843_7549995_+	COG0399, WecE, Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis [Cell envelope biogenesis, outer membrane]	NA|509aa|down_2|NC_008312.1_7550165_7551692_+	COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]	NA|334aa|down_3|NC_008312.1_7552349_7553351_+	NA	NA|1545aa|down_4|NC_008312.1_7559013_7563648_-	pfam12770, CHAT, CHAT domain	NA|264aa|down_5|NC_008312.1_7563976_7564768_-	pfam08852, DUF1822, Protein of unknown function (DUF1822)	NA|244aa|down_6|NC_008312.1_7564767_7565499_-	PRK12519, PRK12519, RNA polymerase sigma factor; Provisional	NA|316aa|down_7|NC_008312.1_7565851_7566799_-	NA	NA|302aa|down_8|NC_008312.1_7566801_7567707_-	NA	NA|477aa|down_9|NC_008312.1_7570337_7571768_+	TIGR03471, HpnJ, hopanoid biosynthesis associated radical SAM protein HpnJ
