assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_010119915.1_ASM1011991v1	NZ_CP048210	Cellulomonas sp. H30R-01 chromosome, complete genome	1	347304-347532	1	CRISPRCasFinder	no		cas3,DEDDh,DinG,csa3,cas4,WYL	Orphan	CGTCGTCGGCGGGGACGCCGTCGTG	25	0	0	NA	NA	NA	3	3	Orphan	cas3,DEDDh,DinG,csa3,cas4,WYL	NA|235aa|up_5|NZ_CP048210.1_341389_342094_-,NA|231aa|up_4|NZ_CP048210.1_342090_342783_-,NA|162aa|up_3|NZ_CP048210.1_342939_343425_+,NA|77aa|up_2|NZ_CP048210.1_343503_343734_+,NA	NA|415aa|up_9|NZ_CP048210.1_336976_338221_+	COG4603, COG4603, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|399aa|up_8|NZ_CP048210.1_338319_339516_+	COG1079, COG1079, Uncharacterized ABC-type transport system, permease component [General function prediction only]	NA|138aa|up_7|NZ_CP048210.1_339565_339979_+	PRK05578, PRK05578, cytidine deaminase; Validated	NA|434aa|up_6|NZ_CP048210.1_340013_341315_+	PRK05820, deoA, thymidine phosphorylase; Reviewed	NA|235aa|up_5|NZ_CP048210.1_341389_342094_-	NA	NA|231aa|up_4|NZ_CP048210.1_342090_342783_-	NA	NA|162aa|up_3|NZ_CP048210.1_342939_343425_+	NA	NA|77aa|up_2|NZ_CP048210.1_343503_343734_+	NA	NA|393aa|up_1|NZ_CP048210.1_343789_344968_+	PRK09358, PRK09358, adenosine deaminase; Provisional	NA|228aa|up_0|NZ_CP048210.1_344964_345648_+	PRK00507, PRK00507, deoxyribose-phosphate aldolase; Provisional	NA|617aa|down_0|NZ_CP048210.1_348307_350158_-	cd05799, PGM2, This CD includes PGM2 (phosphoglucomutase 2) and PGM2L1 (phosphoglucomutase 2-like 1)	NA|284aa|down_1|NZ_CP048210.1_350304_351156_-	PRK08202, PRK08202, purine nucleoside phosphorylase; Provisional	NA|487aa|down_2|NZ_CP048210.1_351287_352748_+	PRK07845, PRK07845, flavoprotein disulfide reductase; Reviewed	NA|618aa|down_3|NZ_CP048210.1_352812_354666_-	COG3568, ElsH, Metal-dependent hydrolase [General function prediction only]	NA|469aa|down_4|NZ_CP048210.1_355009_356416_-	TIGR01412, Probable_deferrochelatase/peroxidase_EfeN, Tat-translocated enzyme	NA|400aa|down_5|NZ_CP048210.1_356423_357623_-	cd14656, Imelysin-like_EfeO, EfeO is a component of the EfeUOB operon	NA|301aa|down_6|NZ_CP048210.1_357689_358592_-	pfam03239, FTR1, Iron permease FTR1 family	NA|207aa|down_7|NZ_CP048210.1_358705_359326_-	cd02883, Nudix_Hydrolase, Nudix hydrolase is a superfamily of enzymes found in all three kingdoms of life, and it catalyzes the hydrolysis of NUcleoside DIphosphates linked to other moieties, X	NA|605aa|down_8|NZ_CP048210.1_359329_361144_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|226aa|down_9|NZ_CP048210.1_361333_362011_-	PRK00148, PRK00148, Maf-like protein; Reviewed
GCF_010119915.1_ASM1011991v1	NZ_CP048210	Cellulomonas sp. H30R-01 chromosome, complete genome	2	1006946-1007079	2	CRISPRCasFinder	no		cas3,DEDDh,DinG,csa3,cas4,WYL	Orphan	GGCAACAACCCGTTCGCGCCCTCGCAGGGCATGCCGCG	38	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,csa3,cas4,WYL	NA,NA|275aa|down_9|NZ_CP048210.1_1022209_1023034_+	NA|441aa|up_9|NZ_CP048210.1_994973_996296_-	TIGR03329, Phn_aa_oxid, putative aminophosphonate oxidoreductase	NA|437aa|up_8|NZ_CP048210.1_996507_997818_+	cd06163, S2P-M50_PDZ_RseP-like, RseP-like Site-2 proteases (S2P), zinc metalloproteases (MEROPS family M50A), cleave transmembrane domains of substrate proteins, regulating intramembrane proteolysis (RIP) of diverse signal transduction mechanisms	NA|387aa|up_7|NZ_CP048210.1_997926_999087_+	PRK00366, ispG, flavodoxin-dependent (E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase	NA|303aa|up_6|NZ_CP048210.1_999160_1000069_+	pfam13312, DUF4081, Domain of unknown function (DUF4081)	NA|297aa|up_5|NZ_CP048210.1_1000173_1001064_-	COG0266, Nei, Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]	NA|598aa|up_4|NZ_CP048210.1_1001144_1002938_+	PRK09194, PRK09194, prolyl-tRNA synthetase; Provisional	NA|370aa|up_3|NZ_CP048210.1_1003010_1004120_-	pfam14530, DUF4439, Domain of unknown function (DUF4439)	NA|194aa|up_2|NZ_CP048210.1_1004260_1004842_+	PRK00092, PRK00092, ribosome maturation protein RimP; Reviewed	NA|358aa|up_1|NZ_CP048210.1_1004843_1005917_+	PRK12327, nusA, transcription elongation factor NusA; Provisional	NA|122aa|up_0|NZ_CP048210.1_1005913_1006279_+	pfam04296, DUF448, Protein of unknown function (DUF448)	NA|148aa|down_0|NZ_CP048210.1_1009487_1009931_+	PRK00521, rbfA, 30S ribosome-binding factor RbfA	NA|491aa|down_1|NZ_CP048210.1_1010046_1011519_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|179aa|down_2|NZ_CP048210.1_1011636_1012173_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	NA|316aa|down_3|NZ_CP048210.1_1012169_1013117_+	PRK03287, truB, tRNA pseudouridine synthase B; Provisional	NA|1103aa|down_4|NZ_CP048210.1_1013347_1016656_+	pfam00759, Glyco_hydro_9, Glycosyl hydrolase family 9	NA|347aa|down_5|NZ_CP048210.1_1016831_1017872_+	PRK05627, PRK05627, bifunctional riboflavin kinase/FAD synthetase	NA|90aa|down_6|NZ_CP048210.1_1018008_1018278_+	PRK05626, rpsO, 30S ribosomal protein S15; Reviewed	NA|746aa|down_7|NZ_CP048210.1_1018532_1020770_+	TIGR02696, polyribonucleotide_nucleotidyltransferase, guanosine pentaphosphate synthetase I/polynucleotide phosphorylase	NA|449aa|down_8|NZ_CP048210.1_1020771_1022118_+	COG0612, PqqL, Predicted Zn-dependent peptidases [General function prediction only]	NA|275aa|down_9|NZ_CP048210.1_1022209_1023034_+	NA
GCF_010119915.1_ASM1011991v1	NZ_CP048210	Cellulomonas sp. H30R-01 chromosome, complete genome	3	1899709-1899789	3	CRISPRCasFinder	no		cas3,DEDDh,DinG,csa3,cas4,WYL	Orphan	GACGCCGCTGTTGGTCTCCTCGAGCTC	27	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,DinG,csa3,cas4,WYL	NA,NA	NA|712aa|up_9|NZ_CP048210.1_1883506_1885642_-	PRK05667, dnaG, DNA primase; Validated	NA|429aa|up_8|NZ_CP048210.1_1885717_1887004_-	PRK03007, PRK03007, deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional	NA|260aa|up_7|NZ_CP048210.1_1887072_1887852_-	pfam02522, Antibiotic_NAT, Aminoglycoside 3-N-acetyltransferase	NA|615aa|up_6|NZ_CP048210.1_1887848_1889693_-	cd11338, AmyAc_CMD, Alpha amylase catalytic domain found in cyclomaltodextrinases and related proteins	NA|309aa|up_5|NZ_CP048210.1_1889689_1890616_-	COG3833, MalG, ABC-type maltose transport systems, permease component [Carbohydrate transport and metabolism]	NA|542aa|up_4|NZ_CP048210.1_1890615_1892241_-	PRK10999, malF, maltose ABC transporter permease MalF	NA|415aa|up_3|NZ_CP048210.1_1892407_1893652_-	cd13586, PBP2_Maltose_binding_like, The periplasmic-binding component of ABC transport systems specific for maltose and related polysaccharides; possess type 2 periplasmic binding fold	NA|369aa|up_2|NZ_CP048210.1_1893844_1894951_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|592aa|up_1|NZ_CP048210.1_1895096_1896872_+	cd11332, AmyAc_OligoGlu_TS, Alpha amylase catalytic domain found in oligo-1,6-glucosidase (also called isomaltase; sucrase-isomaltase; alpha-limit dextrinase), trehalose synthase (also called maltose alpha-D-glucosyltransferase), and related proteins	NA|531aa|up_0|NZ_CP048210.1_1897027_1898620_-	pfam07228, SpoIIE, Stage II sporulation protein E (SpoIIE)	NA|344aa|down_0|NZ_CP048210.1_1900328_1901360_-	cd16934, HATPase_RsbT-like, Histidine kinase-like ATPase domain of the anti sigma-B factor Bacillus subtilis serine/threonine-protein kinase RsbT, and related domains	NA|138aa|down_1|NZ_CP048210.1_1901347_1901761_-	cd16934, HATPase_RsbT-like, Histidine kinase-like ATPase domain of the anti sigma-B factor Bacillus subtilis serine/threonine-protein kinase RsbT, and related domains	NA|135aa|down_2|NZ_CP048210.1_1901757_1902162_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|286aa|down_3|NZ_CP048210.1_1902161_1903019_-	cd07041, STAS_RsbR_RsbS_like, Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation	NA|177aa|down_4|NZ_CP048210.1_1903158_1903689_-	smart00632, Aamy_C, Aamy_C domain	NA|302aa|down_5|NZ_CP048210.1_1903727_1904633_+	cd01942, ribokinase_group_A, Ribokinase-like subgroup A	NA|379aa|down_6|NZ_CP048210.1_1904638_1905775_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|397aa|down_7|NZ_CP048210.1_1905841_1907032_-	TIGR00737, Probable_tRNA-dihydrouridine_synthase, putative TIM-barrel protein, nifR3 family	NA|207aa|down_8|NZ_CP048210.1_1907201_1907822_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|416aa|down_9|NZ_CP048210.1_1907818_1909066_-	COG4585, COG4585, Signal transduction histidine kinase [Signal transduction mechanisms]
GCF_010119915.1_ASM1011991v1	NZ_CP048210	Cellulomonas sp. H30R-01 chromosome, complete genome	4	2813025-2813404	4	CRISPRCasFinder	no		cas3,DEDDh,DinG,csa3,cas4,WYL	Orphan	CGCCGGAGCCGTCGCCGTAGTTC	23	0	0	NA	NA	NA	7	7	Orphan	cas3,DEDDh,DinG,csa3,cas4,WYL	NA,NA|367aa|down_3|NZ_CP048210.1_2825024_2826125_+	NA|250aa|up_9|NZ_CP048210.1_2802703_2803453_-	pfam12158, DUF3592, Protein of unknown function (DUF3592)	NA|635aa|up_8|NZ_CP048210.1_2803653_2805558_-	pfam07944, Glyco_hydro_127, Beta-L-arabinofuranosidase, GH127	NA|275aa|up_7|NZ_CP048210.1_2805554_2806379_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|318aa|up_6|NZ_CP048210.1_2806392_2807346_-	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|441aa|up_5|NZ_CP048210.1_2807456_2808779_-	cd13585, PBP2_TMBP_like, The periplasmic-binding component of ABC transport systems specific for trehalose/maltose and similar oligosaccharides; possess type 2 periplasmic binding fold	NA|342aa|up_4|NZ_CP048210.1_2808987_2810013_+	COG1609, PurR, Transcriptional regulators [Transcription]	NA|322aa|up_3|NZ_CP048210.1_2810035_2811001_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|239aa|up_2|NZ_CP048210.1_2810997_2811714_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|95aa|up_1|NZ_CP048210.1_2811731_2812016_-	pfam13828, DUF4190, Domain of unknown function (DUF4190)	NA|189aa|up_0|NZ_CP048210.1_2812012_2812579_-	pfam11259, DUF3060, Protein of unknown function (DUF3060)	NA|1217aa|down_0|NZ_CP048210.1_2813919_2817570_-	cd08990, GH43_AXH_like, Glycosyl hydrolase family 43 protein, includes arabinoxylan arabinofuranohydrolase, beta-xylosidase, endo-1,4-beta-xylanase, and alpha-L-arabinofuranosidase	NA|2071aa|down_1|NZ_CP048210.1_2817572_2823785_-	pfam07532, Big_4, Bacterial Ig-like domain (group 4)	NA|271aa|down_2|NZ_CP048210.1_2824040_2824853_+	cd06193, siderophore_interacting, Siderophore interacting proteins share the domain structure of the ferredoxin reductase like family	NA|367aa|down_3|NZ_CP048210.1_2825024_2826125_+	NA	NA|370aa|down_4|NZ_CP048210.1_2826264_2827374_+	COG3214, COG3214, Uncharacterized protein conserved in bacteria [Function unknown]	NA|112aa|down_5|NZ_CP048210.1_2827423_2827759_+	PRK02237, PRK02237, YnfA family protein	NA|144aa|down_6|NZ_CP048210.1_2828031_2828463_+	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|209aa|down_7|NZ_CP048210.1_2828477_2829104_-	pfam13649, Methyltransf_25, Methyltransferase domain	NA|160aa|down_8|NZ_CP048210.1_2829272_2829752_-	COG1247, COG1247, Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane]	NA|329aa|down_9|NZ_CP048210.1_2829846_2830833_-	PRK08241, PRK08241, RNA polymerase subunit sigma-70
