assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	1	602302-602401	1	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	TTGTCGGGCTGAAGCCGCGACAT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|91aa|up_4|NZ_CP041660.1_595008_595281_+,NA	NA|231aa|up_9|NZ_CP041660.1_589780_590473_-	PRK05584, PRK05584, 5'-methylthioadenosine/adenosylhomocysteine nucleosidase	NA|201aa|up_8|NZ_CP041660.1_590534_591137_-	pfam10881, DUF2726, Protein of unknown function (DUF2726)	NA|266aa|up_7|NZ_CP041660.1_591242_592040_+	COG3148, COG3148, Uncharacterized conserved protein [Function unknown]	NA|676aa|up_6|NZ_CP041660.1_592019_594047_-	COG5001, COG5001, Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain [Signal transduction mechanisms]	NA|235aa|up_5|NZ_CP041660.1_594256_594961_+	pfam04286, DUF445, Protein of unknown function (DUF445)	NA|91aa|up_4|NZ_CP041660.1_595008_595281_+	NA	NA|324aa|up_3|NZ_CP041660.1_595293_596265_+	sd00010, SLR, Sel1-like repeat	NA|155aa|up_2|NZ_CP041660.1_596307_596772_+	pfam05981, CreA, CreA protein	NA|973aa|up_1|NZ_CP041660.1_597367_600286_-	PRK04914, PRK04914, RNA polymerase-associated protein RapA	NA|402aa|up_0|NZ_CP041660.1_600757_601963_-	pfam01609, DDE_Tnp_1, Transposase DDE domain	NA|463aa|down_0|NZ_CP041660.1_602578_603967_+	COG1875, COG1875, NYN ribonuclease and ATPase of PhoH family domains [General    function prediction only]	NA|72aa|down_1|NZ_CP041660.1_604343_604559_-	pfam06945, DUF1289, Protein of unknown function (DUF1289)	NA|107aa|down_2|NZ_CP041660.1_604707_605028_+	COG2198, ArcB, FOG: HPt domain [Signal transduction mechanisms]	NA|309aa|down_3|NZ_CP041660.1_605041_605968_+	COG3706, PleD, Response regulator containing a CheY-like receiver domain and a GGDEF domain [Signal transduction mechanisms]	NA|156aa|down_4|NZ_CP041660.1_605984_606452_-	PRK05066, PRK05066, transcriptional regulator ArgR	NA|279aa|down_5|NZ_CP041660.1_606890_607727_-	PRK10987, PRK10987, beta-lactamase regulator AmpE	NA|192aa|down_6|NZ_CP041660.1_607723_608299_-	PRK11789, PRK11789, 1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD	NA|160aa|down_7|NZ_CP041660.1_608336_608816_+	cd05483, retropepsin_like_bacteria, Bacterial aspartate proteases, retropepsin-like protease family	NA|288aa|down_8|NZ_CP041660.1_609354_610218_+	PRK09125, PRK09125, DNA ligase; Provisional	NA|471aa|down_9|NZ_CP041660.1_610224_611637_+	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	2	619100-619323	2	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	CCTGTAGGTCGTGGCTTTAGCCCGACAAATCTCGCAACATGCAC	44	0	0	NA	NA	NA	2	2	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA,NA	NA|160aa|up_9|NZ_CP041660.1_608336_608816_+	cd05483, retropepsin_like_bacteria, Bacterial aspartate proteases, retropepsin-like protease family	NA|288aa|up_8|NZ_CP041660.1_609354_610218_+	PRK09125, PRK09125, DNA ligase; Provisional	NA|471aa|up_7|NZ_CP041660.1_610224_611637_+	COG1236, YSH1, Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]	NA|286aa|up_6|NZ_CP041660.1_611878_612736_-	PRK09016, PRK09016, carboxylating nicotinate-nucleotide diphosphorylase	NA|470aa|up_5|NZ_CP041660.1_613338_614748_+	TIGR01479, Mannose-1-phosphate_guanylyltransferase, mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase	NA|455aa|up_4|NZ_CP041660.1_615000_616365_+	PRK15414, PRK15414, phosphomannomutase	NA|115aa|up_3|NZ_CP041660.1_616583_616928_+	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|389aa|up_2|NZ_CP041660.1_616989_618156_+	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|145aa|up_1|NZ_CP041660.1_618285_618720_-	pfam01934, DUF86, Protein of unknown function DUF86	NA|127aa|up_0|NZ_CP041660.1_618712_619093_-	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|735aa|down_0|NZ_CP041660.1_619409_621614_-	pfam06082, YjbH, Exopolysaccharide biosynthesis protein YbjH	NA|361aa|down_1|NZ_CP041660.1_621902_622985_-	TIGR02380, ECA_wecA, undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphatetransferase	NA|150aa|down_2|NZ_CP041660.1_623169_623619_-	PRK15434, PRK15434, GDP-mannose mannosyl hydrolase	NA|320aa|down_3|NZ_CP041660.1_623621_624581_-	cd05239, GDP_FS_SDR_e, GDP-fucose synthetase, extended (e) SDRs	NA|251aa|down_4|NZ_CP041660.1_624606_625359_-	cd06433, GT_2_WfgS_like, WfgS and WfeV are involved in O-antigen biosynthesis	NA|373aa|down_5|NZ_CP041660.1_625480_626599_-	COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]	NA|328aa|down_6|NZ_CP041660.1_626595_627579_-	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|364aa|down_7|NZ_CP041660.1_627619_628711_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|347aa|down_8|NZ_CP041660.1_628707_629748_-	cd03809, GT4_MtfB-like, glycosyltransferases MtfB, WbpX, and similar proteins	NA|383aa|down_9|NZ_CP041660.1_629734_630883_-	cd03811, GT4_GT28_WabH-like, family 4 and family 28 glycosyltransferases similar to Klebsiella WabH
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	3	844627-844773	3	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	AAGTTTCGCCAGCATTGTAGGTCGTGGCTTTAGCCCGACAATCAGGCA	48	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA,NA|312aa|down_5|NZ_CP041660.1_852711_853647_+	NA|267aa|up_9|NZ_CP041660.1_825994_826795_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|254aa|up_8|NZ_CP041660.1_827235_827997_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|494aa|up_7|NZ_CP041660.1_828079_829561_-	cd07112, ALDH_GABALDH-PuuC, Escherichia coli NADP+-dependent gamma-glutamyl-gamma-aminobutyraldehyde dehydrogenase PuuC-like	NA|972aa|up_6|NZ_CP041660.1_829817_832733_-	TIGR01782, TonB-dependent_receptor, TonB-dependent receptor	NA|1156aa|up_5|NZ_CP041660.1_832870_836338_-	cd18832, GH43_GsAbnA-like, Glycosyl hydrolase family 43 protein such as Geobacillus stearothermophilus endo-alpha-1,5-L-arabinanase AbnA	NA|298aa|up_4|NZ_CP041660.1_836956_837850_+	pfam00497, SBP_bac_3, Bacterial extracellular solute-binding proteins, family 3	NA|387aa|up_3|NZ_CP041660.1_837882_839043_+	cd09019, galactose_mutarotase_like, galactose mutarotase_like	NA|647aa|up_2|NZ_CP041660.1_839143_841084_+	pfam10566, Glyco_hydro_97, Glycoside hydrolase 97	NA|611aa|up_1|NZ_CP041660.1_841233_843066_-	pfam08787, Alginate_lyase2, Alginate lyase	NA|391aa|up_0|NZ_CP041660.1_843345_844518_-	PRK06753, PRK06753, hypothetical protein; Provisional	NA|479aa|down_0|NZ_CP041660.1_844932_846369_-	cd17359, MFS_XylE_like, D-xylose-proton symporter and similar transporters of the Major Facilitator Superfamily	NA|391aa|down_1|NZ_CP041660.1_846539_847712_+	cd01543, PBP1_XylR, ligand-binding domain of DNA transcription repressor specific for xylose (XylR)	NA|862aa|down_2|NZ_CP041660.1_847801_850387_-	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|244aa|down_3|NZ_CP041660.1_850903_851635_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|257aa|down_4|NZ_CP041660.1_851751_852522_-	pfam06283, ThuA, Trehalose utilisation	NA|312aa|down_5|NZ_CP041660.1_852711_853647_+	NA	NA|723aa|down_6|NZ_CP041660.1_853683_855852_-	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|148aa|down_7|NZ_CP041660.1_856109_856553_+	PRK11639, PRK11639, zinc uptake transcriptional repressor Zur	NA|497aa|down_8|NZ_CP041660.1_856562_858053_-	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|510aa|down_9|NZ_CP041660.1_858083_859613_-	COG3025, COG3025, Uncharacterized conserved protein [Function unknown]
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	4	865592-865678	4	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	AGGCTTTAGCCCGACAATCAAGC	23	1	6	865615-865655|865615-865655|865615-865655|865615-865655|865615-865655|865615-865655	NZ_CP041660.1_863998-864038|NZ_CP041660.1_864062-864102|NZ_CP041660.1_864190-864230|NZ_CP041660.1_864126-864166|NZ_CP041660.1_864254-864294|NZ_CP041660.1_863934-863974	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|312aa|up_6|NZ_CP041660.1_852711_853647_+,NA|110aa|down_3|NZ_CP041660.1_870851_871181_-,NA|281aa|down_5|NZ_CP041660.1_872572_873415_-	NA|862aa|up_9|NZ_CP041660.1_847801_850387_-	COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate transport and metabolism]	NA|244aa|up_8|NZ_CP041660.1_850903_851635_+	COG1051, COG1051, ADP-ribose pyrophosphatase [Nucleotide transport and metabolism]	NA|257aa|up_7|NZ_CP041660.1_851751_852522_-	pfam06283, ThuA, Trehalose utilisation	NA|312aa|up_6|NZ_CP041660.1_852711_853647_+	NA	NA|723aa|up_5|NZ_CP041660.1_853683_855852_-	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|148aa|up_4|NZ_CP041660.1_856109_856553_+	PRK11639, PRK11639, zinc uptake transcriptional repressor Zur	NA|497aa|up_3|NZ_CP041660.1_856562_858053_-	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|510aa|up_2|NZ_CP041660.1_858083_859613_-	COG3025, COG3025, Uncharacterized conserved protein [Function unknown]	NA|202aa|up_1|NZ_CP041660.1_859824_860430_+	TIGR04211, hypothetical_protein, SH3 domain protein	NA|1051aa|up_0|NZ_CP041660.1_860552_863705_+	PRK11904, PRK11904, bifunctional proline dehydrogenase/L-glutamate gamma-semialdehyde dehydrogenase PutA	NA|414aa|down_0|NZ_CP041660.1_866003_867245_-	TIGR01988, Ubiquinone_biosynthesis_monooxygenase_COQ6, Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family	NA|405aa|down_1|NZ_CP041660.1_867246_868461_-	PRK05732, PRK05732, 2-octaprenyl-6-methoxyphenyl hydroxylase; Validated	NA|197aa|down_2|NZ_CP041660.1_869829_870420_-	PRK01736, PRK01736, hypothetical protein; Reviewed	NA|110aa|down_3|NZ_CP041660.1_870851_871181_-	NA	NA|348aa|down_4|NZ_CP041660.1_871371_872415_-	PRK09354, recA, recombinase A; Provisional	NA|281aa|down_5|NZ_CP041660.1_872572_873415_-	NA	NA|168aa|down_6|NZ_CP041660.1_874275_874779_-	pfam02464, CinA, Competence-damaged protein	NA|863aa|down_7|NZ_CP041660.1_874814_877403_+	PRK05399, PRK05399, DNA mismatch repair protein MutS; Provisional	NA|286aa|down_8|NZ_CP041660.1_877460_878318_+	PRK10792, PRK10792, bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase FolD	NA|245aa|down_9|NZ_CP041660.1_878422_879157_-	COG0678, AHP1, Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	5	1187054-1187139	5	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	TTTGTCGGCCTAAAGGCCCGACCTACAC	28	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|47aa|up_9|NZ_CP041660.1_1175184_1175325_+,NA|267aa|up_8|NZ_CP041660.1_1175649_1176450_-,NA|124aa|up_7|NZ_CP041660.1_1176464_1176836_-,NA|281aa|up_0|NZ_CP041660.1_1186050_1186893_-,NA|320aa|down_9|NZ_CP041660.1_1198987_1199947_+	NA|47aa|up_9|NZ_CP041660.1_1175184_1175325_+	NA	NA|267aa|up_8|NZ_CP041660.1_1175649_1176450_-	NA	NA|124aa|up_7|NZ_CP041660.1_1176464_1176836_-	NA	NA|270aa|up_6|NZ_CP041660.1_1177539_1178349_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|653aa|up_5|NZ_CP041660.1_1179237_1181196_-	pfam06241, Castor_Poll_mid, Castor and Pollux, part of voltage-gated ion channel	NA|423aa|up_4|NZ_CP041660.1_1181371_1182640_-	PRK09376, rho, transcription termination factor Rho; Provisional	NA|110aa|up_3|NZ_CP041660.1_1182824_1183154_-	PRK09381, trxA, thioredoxin TrxA	NA|423aa|up_2|NZ_CP041660.1_1183261_1184530_+	PRK04837, PRK04837, ATP-dependent RNA helicase RhlB; Provisional	NA|498aa|up_1|NZ_CP041660.1_1184544_1186038_+	PRK11031, PRK11031, guanosine-5'-triphosphate,3'-diphosphate diphosphatase	NA|281aa|up_0|NZ_CP041660.1_1186050_1186893_-	NA	NA|310aa|down_0|NZ_CP041660.1_1187285_1188215_+	PRK00072, hemC, porphobilinogen deaminase; Reviewed	NA|264aa|down_1|NZ_CP041660.1_1188192_1188984_+	cd06578, HemD, Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme	NA|430aa|down_2|NZ_CP041660.1_1188980_1190270_+	pfam04375, HemX, HemX, putative uroporphyrinogen-III C-methyltransferase	NA|396aa|down_3|NZ_CP041660.1_1190266_1191454_+	COG3071, HemY, Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]	NA|698aa|down_4|NZ_CP041660.1_1191572_1193666_+	COG5001, COG5001, Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain [Signal transduction mechanisms]	NA|841aa|down_5|NZ_CP041660.1_1193675_1196198_+	cd16434, CheB-CheR_fusion, Chemotaxis response regulator protein-glutamate methylesterase, CheB, fused with CheR domain	NA|471aa|down_6|NZ_CP041660.1_1196272_1197685_-	cd01115, SLC13_permease, Permease SLC13 (solute carrier 13)	NA|133aa|down_7|NZ_CP041660.1_1197847_1198246_+	COG5331, COG5331, Uncharacterized protein conserved in bacteria [Function unknown]	NA|189aa|down_8|NZ_CP041660.1_1198298_1198865_+	pfam03831, PhnA, PhnA domain	NA|320aa|down_9|NZ_CP041660.1_1198987_1199947_+	NA
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	6	1210075-1210183	6	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	AAATTGTCGGCCTAAAGGCGCGACCTACA	29	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|320aa|up_9|NZ_CP041660.1_1198987_1199947_+,NA|195aa|up_4|NZ_CP041660.1_1204485_1205070_-,NA|305aa|down_3|NZ_CP041660.1_1213056_1213971_+	NA|320aa|up_9|NZ_CP041660.1_1198987_1199947_+	NA	NA|289aa|up_8|NZ_CP041660.1_1199926_1200793_-	cd08470, PBP2_CrgA_like_1, The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator CrgA-like, contains the type 2 periplasmic binding domain	NA|377aa|up_7|NZ_CP041660.1_1201001_1202132_+	cd08300, alcohol_DH_class_III, class III alcohol dehydrogenases	NA|278aa|up_6|NZ_CP041660.1_1202134_1202968_+	PLN02442, PLN02442, S-formylglutathione hydrolase	NA|426aa|up_5|NZ_CP041660.1_1203142_1204420_+	cd08560, GDPD_EcGlpQ_like_1, Glycerophosphodiester phosphodiesterase domain similar to Escherichia coli periplasmic phosphodiesterase (GlpQ) include uncharacterized proteins	NA|195aa|up_4|NZ_CP041660.1_1204485_1205070_-	NA	NA|401aa|up_3|NZ_CP041660.1_1205082_1206285_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|204aa|up_2|NZ_CP041660.1_1206450_1207062_-	cd17546, REC_hyHK_CKI1_RcsC-like, phosphoacceptor receiver (REC) domain of hybrid sensor histidine kinases/response regulators similar to Arabidopsis thaliana CKI1 and Escherichia coli RcsC	NA|434aa|up_1|NZ_CP041660.1_1207058_1208360_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional	NA|450aa|up_0|NZ_CP041660.1_1208668_1210018_-	PRK06116, PRK06116, glutathione reductase; Validated	NA|122aa|down_0|NZ_CP041660.1_1210376_1210742_+	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	NA|177aa|down_1|NZ_CP041660.1_1211027_1211558_-	PRK05255, PRK05255, ribosome-associated protein	NA|444aa|down_2|NZ_CP041660.1_1211655_1212987_+	PRK11040, PRK11040, peptidase PmbA; Provisional	NA|305aa|down_3|NZ_CP041660.1_1213056_1213971_+	NA	NA|362aa|down_4|NZ_CP041660.1_1216448_1217534_-	COG4299, COG4299, Uncharacterized protein conserved in bacteria [Function unknown]	NA|390aa|down_5|NZ_CP041660.1_1217602_1218772_-	COG4942, COG4942, Membrane-bound metallopeptidase [Cell division and chromosome partitioning]	NA|516aa|down_6|NZ_CP041660.1_1218780_1220328_-	PRK05434, PRK05434, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|143aa|down_7|NZ_CP041660.1_1220515_1220944_+	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|85aa|down_8|NZ_CP041660.1_1220950_1221205_+	TIGR02181, GRX_bact, Glutaredoxin, GrxC family	NA|172aa|down_9|NZ_CP041660.1_1221226_1221742_+	PRK05751, PRK05751, preprotein translocase subunit SecB; Validated
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	7	2795303-2795418	7	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	TTAGGGCGACAATTTTTTATACACTTGTCGGCATAAATACAG	42	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|124aa|up_6|NZ_CP041660.1_2788694_2789066_+,NA	NA|146aa|up_9|NZ_CP041660.1_2786332_2786770_+	pfam03646, FlaG, FlaG protein	NA|489aa|up_8|NZ_CP041660.1_2786800_2788267_+	COG1345, FliD, Flagellar capping protein [Cell motility and secretion]	NA|146aa|up_7|NZ_CP041660.1_2788300_2788738_+	pfam02561, FliS, Flagellar protein FliS	NA|124aa|up_6|NZ_CP041660.1_2788694_2789066_+	NA	NA|333aa|up_5|NZ_CP041660.1_2789077_2790076_+	TIGR03589, PseB, UDP-N-acetylglucosamine 4,6-dehydratase (inverting)	NA|390aa|up_4|NZ_CP041660.1_2790072_2791242_+	TIGR03588, PseC, UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine transaminase	NA|233aa|up_3|NZ_CP041660.1_2791238_2791937_+	TIGR03584, PseF, pseudaminic acid cytidylyltransferase	NA|345aa|up_2|NZ_CP041660.1_2791945_2792980_+	cd03789, GT9_LPS_heptosyltransferase, lipopolysaccharide heptosyltransferase and similar proteins	NA|359aa|up_1|NZ_CP041660.1_2793014_2794091_+	TIGR03586, PseI, pseudaminic acid synthase	NA|364aa|up_0|NZ_CP041660.1_2794044_2795136_+	TIGR03590, PseG, UDP-2,4-diacetamido-2,4,6-trideoxy-beta-L-altropyranose hydrolase	NA|182aa|down_0|NZ_CP041660.1_2795526_2796072_+	TIGR03585, PseH, UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine N-acetyltransferase	NA|77aa|down_1|NZ_CP041660.1_2796077_2796308_+	COG0236, AcpP, Acyl carrier protein [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|248aa|down_2|NZ_CP041660.1_2796324_2797068_+	PRK05653, fabG, 3-oxoacyl-ACP reductase FabG	NA|492aa|down_3|NZ_CP041660.1_2797067_2798543_+	cd05922, FACL_like_6, Uncharacterized subfamily of fatty acid CoA ligase (FACL)	NA|355aa|down_4|NZ_CP041660.1_2798529_2799594_+	pfam04443, LuxE, Acyl-protein synthetase, LuxE	NA|399aa|down_5|NZ_CP041660.1_2799577_2800774_+	pfam05893, LuxC, Acyl-CoA reductase (LuxC)	NA|1184aa|down_6|NZ_CP041660.1_2800799_2804351_+	pfam01973, MAF_flag10, Protein of unknown function DUF115	NA|596aa|down_7|NZ_CP041660.1_2804383_2806171_+	pfam00884, Sulfatase, Sulfatase	NA|305aa|down_8|NZ_CP041660.1_2806163_2807078_+	cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis	NA|402aa|down_9|NZ_CP041660.1_2807105_2808311_+	cd16438, beta_Kdo_transferase_KpsS_like, beta-3-deoxy-D-manno-oct-2-ulosonic acid (Kdo)-transferase KpsS like
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	8	2843833-2843942	8	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	CTTGCTTTGTTGGCCTAAAGACCAACCTACA	31	1	1	2843864-2843911	NZ_CP041660.1_2843587-2843634	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA,NA	NA|352aa|up_9|NZ_CP041660.1_2832539_2833595_-	cd06426, NTP_transferase_like_2, NTP_trnasferase_like_2 is a member of the nucleotidyl transferase family	NA|395aa|up_8|NZ_CP041660.1_2833610_2834795_-	TIGR03568, Polysialic_acid_biosynthesis_protein_P7, UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing	NA|271aa|up_7|NZ_CP041660.1_2834787_2835600_-	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|348aa|up_6|NZ_CP041660.1_2835602_2836646_-	TIGR03569, ORF_8_similar_to_NeuB_family, N-acetylneuraminate synthase	NA|327aa|up_5|NZ_CP041660.1_2836648_2837629_-	TIGR04180, NAD-dependent_epimerase/dehydratase, NAD dependent epimerase/dehydratase, LLPSF_EDH_00030 family	NA|133aa|up_4|NZ_CP041660.1_2837905_2838304_+	cd02171, G3P_Cytidylyltransferase, glycerol-3-phosphate cytidylyltransferase	NA|266aa|up_3|NZ_CP041660.1_2838410_2839208_+	PRK12804, PRK12804, flagellin; Provisional	NA|482aa|up_2|NZ_CP041660.1_2839426_2840872_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|361aa|up_1|NZ_CP041660.1_2841029_2842112_+	PRK11360, PRK11360, two-component system sensor histidine kinase AtoS	NA|444aa|up_0|NZ_CP041660.1_2842124_2843456_+	COG2204, AtoC, Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms]	NA|113aa|down_0|NZ_CP041660.1_2844244_2844583_+	PRK00253, fliE, flagellar hook-basal body protein FliE; Reviewed	NA|562aa|down_1|NZ_CP041660.1_2844637_2846323_+	PRK06007, fliF, flagellar basal body M-ring protein FliF	NA|347aa|down_2|NZ_CP041660.1_2846315_2847356_+	PRK05686, fliG, flagellar motor switch protein G; Validated	NA|264aa|down_3|NZ_CP041660.1_2847368_2848160_+	PRK05687, fliH, flagellar assembly protein FliH	NA|444aa|down_4|NZ_CP041660.1_2848231_2849563_+	PRK08972, fliI, flagellar protein export ATPase FliI	NA|147aa|down_5|NZ_CP041660.1_2849566_2850007_+	PRK05689, fliJ, flagella biosynthesis chaperone FliJ	NA|850aa|down_6|NZ_CP041660.1_2850199_2852749_+	cd17470, T3SS_Flik_C, C-terminal domain of flagellar hook-length control protein FliK and similar domains	NA|180aa|down_7|NZ_CP041660.1_2852832_2853372_+	PRK05696, fliL, flagellar basal body-associated protein FliL; Reviewed	NA|358aa|down_8|NZ_CP041660.1_2853388_2854462_+	PRK06666, fliM, flagellar motor switch protein FliM; Validated	NA|134aa|down_9|NZ_CP041660.1_2854500_2854902_+	PRK08983, fliN, flagellar motor switch protein FliN
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	9	2904732-2904859	9	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	ATTGCAGAATTTGTCGGCGTAAACACCGACCTACCT	36	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|72aa|up_9|NZ_CP041660.1_2895172_2895388_+,NA|184aa|up_6|NZ_CP041660.1_2898350_2898902_+,NA|464aa|up_5|NZ_CP041660.1_2898963_2900355_+,NA	NA|72aa|up_9|NZ_CP041660.1_2895172_2895388_+	NA	NA|307aa|up_8|NZ_CP041660.1_2895470_2896391_+	cd07984, LPLAT_LABLAT-like, Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: LABLAT-like	NA|646aa|up_7|NZ_CP041660.1_2896371_2898309_+	COG1506, DAP2, Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism]	NA|184aa|up_6|NZ_CP041660.1_2898350_2898902_+	NA	NA|464aa|up_5|NZ_CP041660.1_2898963_2900355_+	NA	NA|196aa|up_4|NZ_CP041660.1_2900769_2901357_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|143aa|up_3|NZ_CP041660.1_2901444_2901873_+	pfam14342, DUF4396, Domain of unknown function (DUF4396)	NA|393aa|up_2|NZ_CP041660.1_2901869_2903048_-	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|189aa|up_1|NZ_CP041660.1_2903061_2903628_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|245aa|up_0|NZ_CP041660.1_2903795_2904530_+	PRK08264, PRK08264, SDR family oxidoreductase	NA|265aa|down_0|NZ_CP041660.1_2906686_2907481_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|286aa|down_1|NZ_CP041660.1_2907480_2908338_+	COG1305, COG1305, Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism]	NA|419aa|down_2|NZ_CP041660.1_2908473_2909730_+	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|154aa|down_3|NZ_CP041660.1_2909807_2910269_+	PRK00464, nrdR, transcriptional repressor NrdR	NA|368aa|down_4|NZ_CP041660.1_2910311_2911415_+	PRK10786, ribD, bifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase RibD	NA|217aa|down_5|NZ_CP041660.1_2911418_2912069_+	PRK09289, PRK09289, riboflavin synthase	NA|371aa|down_6|NZ_CP041660.1_2912079_2913192_+	PRK14019, PRK14019, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase/GTP cyclohydrolase II	NA|155aa|down_7|NZ_CP041660.1_2913261_2913726_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|140aa|down_8|NZ_CP041660.1_2913736_2914156_+	PRK00202, nusB, transcription antitermination factor NusB	NA|326aa|down_9|NZ_CP041660.1_2914159_2915137_+	PRK05731, PRK05731, thiamine monophosphate kinase; Provisional
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	10	2917075-2917167	10	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	AGCTTCGAGCTTAAATGTAGGTC	23	0	0	NA	NA	NA	1	1	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA,NA|176aa|down_4|NZ_CP041660.1_2921753_2922281_+,NA|269aa|down_7|NZ_CP041660.1_2923950_2924757_-,NA|219aa|down_9|NZ_CP041660.1_2926586_2927243_-	NA|154aa|up_9|NZ_CP041660.1_2909807_2910269_+	PRK00464, nrdR, transcriptional repressor NrdR	NA|368aa|up_8|NZ_CP041660.1_2910311_2911415_+	PRK10786, ribD, bifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase RibD	NA|217aa|up_7|NZ_CP041660.1_2911418_2912069_+	PRK09289, PRK09289, riboflavin synthase	NA|371aa|up_6|NZ_CP041660.1_2912079_2913192_+	PRK14019, PRK14019, bifunctional 3,4-dihydroxy-2-butanone-4-phosphate synthase/GTP cyclohydrolase II	NA|155aa|up_5|NZ_CP041660.1_2913261_2913726_+	PRK00061, ribH, 6,7-dimethyl-8-ribityllumazine synthase; Provisional	NA|140aa|up_4|NZ_CP041660.1_2913736_2914156_+	PRK00202, nusB, transcription antitermination factor NusB	NA|326aa|up_3|NZ_CP041660.1_2914159_2915137_+	PRK05731, PRK05731, thiamine monophosphate kinase; Provisional	NA|172aa|up_2|NZ_CP041660.1_2915126_2915642_+	pfam04608, PgpA, Phosphatidylglycerophosphatase A	NA|279aa|up_1|NZ_CP041660.1_2915644_2916481_+	COG3491, PcbC, Isopenicillin N synthase and related dioxygenases [General function prediction only]	NA|142aa|up_0|NZ_CP041660.1_2916556_2916982_-	NF033429, ImuA_translesion, translesion DNA synthesis-associated protein ImuA	NA|244aa|down_0|NZ_CP041660.1_2917676_2918408_-	pfam11306, DUF3108, Protein of unknown function (DUF3108)	NA|217aa|down_1|NZ_CP041660.1_2918391_2919042_-	PRK05647, purN, phosphoribosylglycinamide formyltransferase; Reviewed	NA|347aa|down_2|NZ_CP041660.1_2919044_2920085_-	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional	NA|373aa|down_3|NZ_CP041660.1_2920642_2921761_+	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|176aa|down_4|NZ_CP041660.1_2921753_2922281_+	NA	NA|299aa|down_5|NZ_CP041660.1_2922377_2923274_+	cd09020, D-hex-6-P-epi_like, D-hexose-6-phosphate epimerase-like	NA|222aa|down_6|NZ_CP041660.1_2923273_2923939_+	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|269aa|down_7|NZ_CP041660.1_2923950_2924757_-	NA	NA|327aa|down_8|NZ_CP041660.1_2925326_2926307_+	COG3380, COG3380, Predicted NAD/FAD-dependent oxidoreductase [General function prediction only]	NA|219aa|down_9|NZ_CP041660.1_2926586_2927243_-	NA
GCF_007289895.1_ASM728989v1	NZ_CP041660	Catenovulum sediminis strain WS1-A chromosome 1, complete sequence	11	3748443-3748597	11	CRISPRCasFinder	no		cas3,WYL,csa3,DEDDh,DinG,RT	Orphan	CTGGCTGGCTTGGTTTTGCTGCT	23	0	0	NA	NA	NA	2	2	Orphan	cas3,WYL,csa3,DEDDh,DinG,RT,PD-DExK	NA|143aa|up_9|NZ_CP041660.1_3739025_3739454_-,NA|325aa|up_8|NZ_CP041660.1_3739444_3740419_-,NA|171aa|up_7|NZ_CP041660.1_3740470_3740983_-,NA|456aa|up_6|NZ_CP041660.1_3741054_3742422_-,NA|288aa|up_5|NZ_CP041660.1_3742437_3743301_-,NA|383aa|up_4|NZ_CP041660.1_3743303_3744452_-,NA|102aa|up_1|NZ_CP041660.1_3747456_3747762_-,NA|132aa|up_0|NZ_CP041660.1_3747775_3748171_-,NA|148aa|down_0|NZ_CP041660.1_3749684_3750128_-,NA|692aa|down_2|NZ_CP041660.1_3750828_3752904_-,NA|250aa|down_3|NZ_CP041660.1_3752981_3753731_-,NA|197aa|down_5|NZ_CP041660.1_3760639_3761230_-,NA|198aa|down_6|NZ_CP041660.1_3761226_3761820_-,NA|66aa|down_8|NZ_CP041660.1_3763851_3764049_-,NA|124aa|down_9|NZ_CP041660.1_3764066_3764438_-	NA|143aa|up_9|NZ_CP041660.1_3739025_3739454_-	NA	NA|325aa|up_8|NZ_CP041660.1_3739444_3740419_-	NA	NA|171aa|up_7|NZ_CP041660.1_3740470_3740983_-	NA	NA|456aa|up_6|NZ_CP041660.1_3741054_3742422_-	NA	NA|288aa|up_5|NZ_CP041660.1_3742437_3743301_-	NA	NA|383aa|up_4|NZ_CP041660.1_3743303_3744452_-	NA	NA|387aa|up_3|NZ_CP041660.1_3745318_3746479_+	pfam07885, Ion_trans_2, Ion channel	NA|144aa|up_2|NZ_CP041660.1_3746668_3747100_+	COG1525, COG1525, Micrococcal nuclease (thermonuclease) homologs [DNA replication, recombination, and repair]	NA|102aa|up_1|NZ_CP041660.1_3747456_3747762_-	NA	NA|132aa|up_0|NZ_CP041660.1_3747775_3748171_-	NA	NA|148aa|down_0|NZ_CP041660.1_3749684_3750128_-	NA	NA|230aa|down_1|NZ_CP041660.1_3750197_3750887_-	pfam14301, DUF4376, Domain of unknown function (DUF4376)	NA|692aa|down_2|NZ_CP041660.1_3750828_3752904_-	NA	NA|250aa|down_3|NZ_CP041660.1_3752981_3753731_-	NA	NA|2296aa|down_4|NZ_CP041660.1_3753748_3760636_-	PRK08581, PRK08581, amidase domain-containing protein	NA|197aa|down_5|NZ_CP041660.1_3760639_3761230_-	NA	NA|198aa|down_6|NZ_CP041660.1_3761226_3761820_-	NA	NA|674aa|down_7|NZ_CP041660.1_3761819_3763841_-	pfam00836, Stathmin, Stathmin family	NA|66aa|down_8|NZ_CP041660.1_3763851_3764049_-	NA	NA|124aa|down_9|NZ_CP041660.1_3764066_3764438_-	NA
