assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000283575.1_ASM28357v1	NC_016048	Oscillibacter valericigenes Sjm18-20, complete genome	1	599608-599723	1	CRISPRCasFinder	no	csa3	csa3,WYL,RT,DinG,DEDDh,cas3	Type I-A	TTTCCCATGACGGAGCTATCCGGCGGAAATCAGTC	35	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,RT,DinG,DEDDh,cas3	NA,NA|166aa|down_9|NC_016048.1_609449_609947_+	NA|225aa|up_9|NC_016048.1_586962_587637_+	PRK00024, PRK00024, DNA repair protein RadC	NA|656aa|up_8|NC_016048.1_587712_589680_+	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|474aa|up_7|NC_016048.1_589860_591282_+	COG1696, DltB, Predicted membrane protein involved in D-alanine export [Cell envelope biogenesis, outer membrane]	NA|396aa|up_6|NC_016048.1_591294_592482_+	cd14440, AlgX_N_like_3, Uncharacterized proteins similar to putative alginate O-acetyltransferase	NA|270aa|up_5|NC_016048.1_592649_593459_+	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|166aa|up_4|NC_016048.1_593463_593961_+	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|942aa|up_3|NC_016048.1_594112_596938_+	PRK00349, uvrA, excinuclease ABC subunit UvrA	NA|161aa|up_2|NC_016048.1_596993_597476_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|308aa|up_1|NC_016048.1_597530_598454_-	pfam01458, UPF0051, Uncharacterized protein family (UPF0051)	NA|247aa|up_0|NC_016048.1_598450_599191_-	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]	NA|402aa|down_0|NC_016048.1_600444_601650_+	PRK09550, mtnK, methylthioribose kinase; Reviewed	NA|351aa|down_1|NC_016048.1_601662_602715_+	PRK05720, mtnA, methylthioribose-1-phosphate isomerase; Reviewed	NA|216aa|down_2|NC_016048.1_602734_603382_+	PRK06833, PRK06833, L-fuculose-phosphate aldolase	NA|215aa|down_3|NC_016048.1_603644_604289_+	PRK05581, PRK05581, ribulose-phosphate 3-epimerase; Validated	NA|324aa|down_4|NC_016048.1_604301_605273_+	PRK03910, PRK03910, D-cysteine desulfhydrase; Validated	NA|200aa|down_5|NC_016048.1_605295_605895_+	PRK00076, recR, recombination protein RecR; Reviewed	NA|300aa|down_6|NC_016048.1_605948_606848_+	PRK00089, era, GTPase Era; Reviewed	NA|219aa|down_7|NC_016048.1_607313_607970_+	COG1045, CysE, Serine acetyltransferase [Amino acid transport and metabolism]	NA|483aa|down_8|NC_016048.1_607973_609422_+	PRK00260, cysS, cysteinyl-tRNA synthetase; Validated	NA|166aa|down_9|NC_016048.1_609449_609947_+	NA
GCF_000283575.1_ASM28357v1	NC_016048	Oscillibacter valericigenes Sjm18-20, complete genome	2	1015794-1015885	2	CRISPRCasFinder	no		csa3,WYL,RT,DinG,DEDDh,cas3	Orphan	GCGGCCTTTGCGGCTTTCTCGGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,RT,DinG,DEDDh,cas3	NA,NA	NA|161aa|up_9|NC_016048.1_1005165_1005648_-	pfam10925, DUF2680, Protein of unknown function (DUF2680)	NA|392aa|up_8|NC_016048.1_1006029_1007205_+	pfam01548, DEDD_Tnp_IS110, Transposase	NA|314aa|up_7|NC_016048.1_1007396_1008338_-	pfam00395, SLH, S-layer homology domain	NA|523aa|up_6|NC_016048.1_1008372_1009941_-	PRK07764, PRK07764, DNA polymerase III subunits gamma and tau; Validated	NA|142aa|up_5|NC_016048.1_1009937_1010363_-	PRK09415, PRK09415, RNA polymerase factor sigma C; Reviewed	NA|480aa|up_4|NC_016048.1_1010641_1012081_-	TIGR02966, Phosphate_regulon_sensor_protein_PhoR, phosphate regulon sensor kinase PhoR	NA|232aa|up_3|NC_016048.1_1012070_1012766_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|173aa|up_2|NC_016048.1_1013112_1013631_+	pfam13353, Fer4_12, 4Fe-4S single cluster domain	NA|291aa|up_1|NC_016048.1_1013750_1014623_-	smart00729, Elp3, Elongator protein 3, MiaB family, Radical SAM	NA|266aa|up_0|NC_016048.1_1014820_1015618_-	pfam01144, CoA_trans, Coenzyme A transferase	NA|284aa|down_0|NC_016048.1_1016926_1017778_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|323aa|down_1|NC_016048.1_1018069_1019038_+	cd12173, PGDH_4, Phosphoglycerate dehydrogenases, NAD-binding and catalytic domains	NA|262aa|down_2|NC_016048.1_1019411_1020197_+	COG1924, COG1924, Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) [Lipid metabolism]	NA|428aa|down_3|NC_016048.1_1020294_1021578_+	COG1775, HgdB, Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Amino acid transport and metabolism]	NA|381aa|down_4|NC_016048.1_1021574_1022717_+	pfam06050, HGD-D, 2-hydroxyglutaryl-CoA dehydratase, D-component	NA|590aa|down_5|NC_016048.1_1022808_1024578_+	COG4799, COG4799, Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism]	NA|98aa|down_6|NC_016048.1_1024615_1024909_+	pfam04277, OAD_gamma, Oxaloacetate decarboxylase, gamma chain	NA|137aa|down_7|NC_016048.1_1024968_1025379_+	PRK09282, PRK09282, pyruvate carboxylase subunit B; Validated	NA|389aa|down_8|NC_016048.1_1025398_1026565_+	pfam03977, OAD_beta, Na+-transporting oxaloacetate decarboxylase beta subunit	NA|268aa|down_9|NC_016048.1_1026834_1027638_+	COG0714, COG0714, MoxR-like ATPases [General function prediction only]
GCF_000283575.1_ASM28357v1	NC_016048	Oscillibacter valericigenes Sjm18-20, complete genome	3	2444388-2444503	3	CRISPRCasFinder	no		csa3,WYL,RT,DinG,DEDDh,cas3	Orphan	GCGGTGACATTGCCGGAGAGGCTGACGCTGTTCAT	35	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,RT,DinG,DEDDh,cas3	NA,NA|153aa|down_2|NC_016048.1_2449370_2449829_+	NA|347aa|up_9|NC_016048.1_2433772_2434813_-	TIGR00476, selenide_water_dikinase, selenium donor protein	NA|528aa|up_8|NC_016048.1_2435013_2436597_-	PRK00741, prfC, peptide chain release factor 3; Provisional	NA|120aa|up_7|NC_016048.1_2437244_2437604_-	cd00851, MTH1175, This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme	NA|277aa|up_6|NC_016048.1_2437624_2438455_-	pfam10609, ParA, NUBPL iron-transfer P-loop NTPase	NA|276aa|up_5|NC_016048.1_2438471_2439299_-	cd07713, DHPS-like_MBL-fold, Methanocaldococcus jannaschii dihydropteroate synthase, Thermoanaerobacter tengcongensis Tflp, and related proteins; MBL-fold metallo hydrolase domain	NA|293aa|up_4|NC_016048.1_2439295_2440174_-	cd03110, SIMIBI_bact_arch, bacterial and archaeal subfamily of SIMIBI	NA|294aa|up_3|NC_016048.1_2440145_2441027_-	COG1149, COG1149, MinD superfamily P-loop ATPase containing an inserted ferredoxin domain [Energy production and conversion]	NA|122aa|up_2|NC_016048.1_2441016_2441382_-	COG1433, COG1433, Uncharacterized conserved protein [Function unknown]	NA|135aa|up_1|NC_016048.1_2441356_2441761_-	COG1342, COG1342, Predicted DNA-binding proteins [General function prediction only]	NA|86aa|up_0|NC_016048.1_2441821_2442079_-	pfam17253, DUF5320, Family of unknown function (DUF5320)	NA|211aa|down_0|NC_016048.1_2446436_2447069_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|623aa|down_1|NC_016048.1_2447487_2449356_+	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|153aa|down_2|NC_016048.1_2449370_2449829_+	NA	NA|513aa|down_3|NC_016048.1_2449915_2451454_-	pfam13699, DUF4157, Domain of unknown function (DUF4157)	NA|140aa|down_4|NC_016048.1_2451455_2451875_-	pfam13581, HATPase_c_2, Histidine kinase-like ATPase domain	NA|553aa|down_5|NC_016048.1_2451867_2453526_-	COG2208, RsbU, Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]	NA|680aa|down_6|NC_016048.1_2453522_2455562_-	smart00331, PP2C_SIG, Sigma factor PP2C-like phosphatases	NA|583aa|down_7|NC_016048.1_2455667_2457416_-	cd13143, MATE_MepA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Streptococcus aureus MepA	NA|98aa|down_8|NC_016048.1_2457431_2457725_-	cd07043, STAS_anti-anti-sigma_factors, Sulphate Transporter and Anti-Sigma factor antagonist) domain of anti-anti-sigma factors, key regulators of anti-sigma factors by phosphorylation	NA|716aa|down_9|NC_016048.1_2457749_2459897_-	COG0464, SpoVK, ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones]
GCF_000283575.1_ASM28357v1	NC_016048	Oscillibacter valericigenes Sjm18-20, complete genome	4	3592647-3592752	4	CRISPRCasFinder	no		csa3,WYL,RT,DinG,DEDDh,cas3	Orphan	CGTAGCTCAGCCGGATAGAGCGTCC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,WYL,RT,DinG,DEDDh,cas3	NA|174aa|up_8|NC_016048.1_3584677_3585199_-,NA	NA|585aa|up_9|NC_016048.1_3582906_3584661_-	COG1966, CstA, Carbon starvation protein, predicted membrane protein [Signal transduction mechanisms]	NA|174aa|up_8|NC_016048.1_3584677_3585199_-	NA	NA|228aa|up_7|NC_016048.1_3585579_3586263_-	COG2186, FadR, Transcriptional regulators [Transcription]	NA|229aa|up_6|NC_016048.1_3586468_3587155_+	TIGR01007, Tyrosine-protein_kinase_CpsD, capsular exopolysaccharide family	NA|71aa|up_5|NC_016048.1_3587194_3587407_+	TIGR02512, Periplasmic_hydrogenase_large_subunit, [FeFe] hydrogenase, group A	NA|357aa|up_4|NC_016048.1_3587426_3588497_+	PRK07119, PRK07119, 2-ketoisovalerate ferredoxin reductase; Validated	NA|250aa|up_3|NC_016048.1_3588498_3589248_+	cd03375, TPP_OGFOR, Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor	NA|179aa|up_2|NC_016048.1_3589249_3589786_+	pfam01558, POR, Pyruvate ferredoxin/flavodoxin oxidoreductase	NA|362aa|up_1|NC_016048.1_3590230_3591316_+	PRK13357, PRK13357, branched-chain amino acid aminotransferase; Provisional	NA|246aa|up_0|NC_016048.1_3591573_3592311_-	PRK05565, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional	NA|423aa|down_0|NC_016048.1_3593454_3594723_-	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]	NA|163aa|down_1|NC_016048.1_3594719_3595208_-	pfam04290, DctQ, Tripartite ATP-independent periplasmic transporters, DctQ component	NA|350aa|down_2|NC_016048.1_3595286_3596336_-	cd13676, PBP2_TRAP_DctP2_like, Substrate-binding component of Tripartite ATP-independent Periplasmic transporter DctP2 and related proteins;  the type 2 periplasmic-binding protein fold	NA|228aa|down_3|NC_016048.1_3596338_3597022_-	pfam11010, DUF2848, Protein of unknown function (DUF2848)	NA|263aa|down_4|NC_016048.1_3597101_3597890_-	cd07583, nitrilase_5, Uncharacterized subgroup of the nitrilase superfamily (putative class 13 nitrilases)	NA|222aa|down_5|NC_016048.1_3598206_3598872_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|168aa|down_6|NC_016048.1_3599070_3599574_-	cd01189, INT_ICEBs1_C_like, C-terminal catalytic domain of integrases from bacterial phages and conjugate transposons	NA|144aa|down_7|NC_016048.1_3599678_3600110_+	pfam05598, DUF772, Transposase domain (DUF772)	NA|61aa|down_8|NC_016048.1_3600186_3600369_+	pfam03432, Relaxase, Relaxase/Mobilisation nuclease domain	NA|291aa|down_9|NC_016048.1_3600414_3601287_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed
