assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_011300215.1_ASM1130021v1	NZ_CP049866	Nocardioides sp. HDW12A chromosome, complete genome	1	272406-272492	1	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,WYL,cas4	Orphan	TCGTCCCTCACTCGACTCTGTTGCCCGCGG	30	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,WYL,cas4	NA|122aa|up_9|NZ_CP049866.1_262817_263183_-,NA|104aa|up_8|NZ_CP049866.1_263219_263531_-,NA|114aa|up_7|NZ_CP049866.1_263682_264024_+,NA|223aa|up_6|NZ_CP049866.1_264052_264721_+,NA|413aa|up_5|NZ_CP049866.1_264846_266085_+,NA|187aa|up_2|NZ_CP049866.1_268000_268561_+,NA|557aa|down_7|NZ_CP049866.1_280813_282484_+,NA|377aa|down_8|NZ_CP049866.1_282799_283930_+,NA|310aa|down_9|NZ_CP049866.1_284116_285046_-	NA|122aa|up_9|NZ_CP049866.1_262817_263183_-	NA	NA|104aa|up_8|NZ_CP049866.1_263219_263531_-	NA	NA|114aa|up_7|NZ_CP049866.1_263682_264024_+	NA	NA|223aa|up_6|NZ_CP049866.1_264052_264721_+	NA	NA|413aa|up_5|NZ_CP049866.1_264846_266085_+	NA	NA|213aa|up_4|NZ_CP049866.1_266098_266737_+	cd05829, Sortase_F, Sortase domain found in the class F family of sortases	NA|363aa|up_3|NZ_CP049866.1_266915_268004_+	PRK08224, ligC, ATP-dependent DNA ligase; Reviewed	NA|187aa|up_2|NZ_CP049866.1_268000_268561_+	NA	NA|869aa|up_1|NZ_CP049866.1_268637_271244_+	cd03820, GT4_AmsD-like, amylovoran biosynthesis glycosyltransferase AmsD and similar proteins	NA|336aa|up_0|NZ_CP049866.1_271223_272231_-	COG3551, COG3551, Uncharacterized protein conserved in bacteria [Function unknown]	NA|338aa|down_0|NZ_CP049866.1_273574_274588_-	cd05288, PGDH, Prostaglandin dehydrogenases	NA|405aa|down_1|NZ_CP049866.1_274926_276141_-	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|358aa|down_2|NZ_CP049866.1_276153_277227_-	cd06216, FNR_iron_sulfur_binding_2, Iron-sulfur binding ferredoxin reductase (FNR) proteins combine the FAD and NAD(P) binding regions of FNR with an iron-sulfur binding cluster domain	NA|135aa|down_3|NZ_CP049866.1_277330_277735_+	pfam01641, SelR, SelR domain	NA|443aa|down_4|NZ_CP049866.1_277661_278990_+	TIGR03026, NDP-sugDHase, nucleotide sugar dehydrogenase	NA|261aa|down_5|NZ_CP049866.1_278986_279769_+	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|344aa|down_6|NZ_CP049866.1_279765_280797_+	COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|557aa|down_7|NZ_CP049866.1_280813_282484_+	NA	NA|377aa|down_8|NZ_CP049866.1_282799_283930_+	NA	NA|310aa|down_9|NZ_CP049866.1_284116_285046_-	NA
GCF_011300215.1_ASM1130021v1	NZ_CP049866	Nocardioides sp. HDW12A chromosome, complete genome	2	467802-467876	2	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,WYL,cas4	Orphan	CCTGAACTTCCACGCAGGTGCGT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,WYL,cas4	NA|125aa|up_9|NZ_CP049866.1_456039_456414_+,NA|280aa|up_1|NZ_CP049866.1_464553_465393_+,NA|238aa|down_0|NZ_CP049866.1_467943_468657_-,NA|87aa|down_1|NZ_CP049866.1_468743_469004_-,NA|49aa|down_2|NZ_CP049866.1_469152_469299_+,NA|142aa|down_9|NZ_CP049866.1_477360_477786_-	NA|125aa|up_9|NZ_CP049866.1_456039_456414_+	NA	NA|238aa|up_8|NZ_CP049866.1_456394_457108_-	pfam04172, LrgB, LrgB-like family	NA|113aa|up_7|NZ_CP049866.1_457104_457443_-	pfam03788, LrgA, LrgA family	NA|190aa|up_6|NZ_CP049866.1_458143_458713_+	pfam02342, TerD, TerD domain	NA|332aa|up_5|NZ_CP049866.1_458767_459763_-	cd05240, UDP_G4E_3_SDR_e, UDP-glucose 4 epimerase (G4E), subgroup 3, extended (e) SDRs	NA|329aa|up_4|NZ_CP049866.1_459759_460746_-	COG1741, COG1741, Pirin-related protein [General function prediction only]	NA|375aa|up_3|NZ_CP049866.1_460847_461972_+	cd03814, GT4-like, glycosyltransferase family 4 proteins	NA|363aa|up_2|NZ_CP049866.1_462010_463099_+	PRK10605, PRK10605, N-ethylmaleimide reductase; Provisional	NA|280aa|up_1|NZ_CP049866.1_464553_465393_+	NA	NA|403aa|up_0|NZ_CP049866.1_465392_466601_+	TIGR01266, fum_ac_acetase, fumarylacetoacetase	NA|238aa|down_0|NZ_CP049866.1_467943_468657_-	NA	NA|87aa|down_1|NZ_CP049866.1_468743_469004_-	NA	NA|49aa|down_2|NZ_CP049866.1_469152_469299_+	NA	NA|164aa|down_3|NZ_CP049866.1_469810_470302_+	pfam04461, DUF520, Protein of unknown function (DUF520)	NA|328aa|down_4|NZ_CP049866.1_470316_471300_-	cd03296, ABC_CysA_sulfate_importer, ATP-binding cassette domain of the sulfate transporter	NA|293aa|down_5|NZ_CP049866.1_472114_472993_-	TIGR02139, permease_CysT, sulfate ABC transporter, permease protein CysT	NA|352aa|down_6|NZ_CP049866.1_472996_474052_-	cd01005, PBP2_CysP, Substrate binding domain of an active sulfate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|642aa|down_7|NZ_CP049866.1_474327_476253_+	TIGR03710, OAFO_sf, 2-oxoacid:acceptor oxidoreductase, alpha subunit	NA|366aa|down_8|NZ_CP049866.1_476249_477347_+	PRK11867, PRK11867, 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Reviewed	NA|142aa|down_9|NZ_CP049866.1_477360_477786_-	NA
GCF_011300215.1_ASM1130021v1	NZ_CP049866	Nocardioides sp. HDW12A chromosome, complete genome	3	1247536-1247685	1	CRT	no		cas3,DinG,DEDDh,csa3,WYL,cas4	Orphan	CGTCGGCACCGCCGTCCG	18	1	2	1247602-1247619|1247602-1247619	NZ_CP049866.1_173689-173706|NZ_CP049866.1_637570-637587	NA	3	3	Orphan	cas3,DinG,DEDDh,csa3,WYL,cas4	NA|209aa|up_2|NZ_CP049866.1_1244755_1245382_-,NA|88aa|down_0|NZ_CP049866.1_1249402_1249666_-,NA|545aa|down_4|NZ_CP049866.1_1254094_1255729_+	NA|252aa|up_9|NZ_CP049866.1_1236590_1237346_-	cd05328, 3alpha_HSD_SDR_c, alpha hydroxysteroid dehydrogenase (3alpha_HSD), classical (c) SDRs	NA|494aa|up_8|NZ_CP049866.1_1237677_1239159_-	COG0777, AccD, Acetyl-CoA carboxylase beta subunit [Lipid metabolism]	NA|504aa|up_7|NZ_CP049866.1_1239160_1240672_-	cd09603, M1_APN_like, Peptidase M1 family similar to aminopeptidase N catalytic domain	NA|183aa|up_6|NZ_CP049866.1_1240668_1241217_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|493aa|up_5|NZ_CP049866.1_1241237_1242716_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|331aa|up_4|NZ_CP049866.1_1242712_1243705_+	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|346aa|up_3|NZ_CP049866.1_1243714_1244752_+	pfam13704, Glyco_tranf_2_4, Glycosyl transferase family 2	NA|209aa|up_2|NZ_CP049866.1_1244755_1245382_-	NA	NA|292aa|up_1|NZ_CP049866.1_1245387_1246263_-	COG4759, COG4759, Uncharacterized protein conserved in bacteria containing thioredoxin-like domain [Posttranslational modification, protein turnover, chaperones]	NA|414aa|up_0|NZ_CP049866.1_1246259_1247501_-	pfam08007, Cupin_4, Cupin superfamily protein	NA|88aa|down_0|NZ_CP049866.1_1249402_1249666_-	NA	NA|429aa|down_1|NZ_CP049866.1_1249749_1251036_-	pfam13556, HTH_30, PucR C-terminal helix-turn-helix domain	NA|377aa|down_2|NZ_CP049866.1_1251034_1252165_+	cd06216, FNR_iron_sulfur_binding_2, Iron-sulfur binding ferredoxin reductase (FNR) proteins combine the FAD and NAD(P) binding regions of FNR with an iron-sulfur binding cluster domain	NA|415aa|down_3|NZ_CP049866.1_1252190_1253435_+	cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria	NA|545aa|down_4|NZ_CP049866.1_1254094_1255729_+	NA	NA|118aa|down_5|NZ_CP049866.1_1255833_1256187_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|362aa|down_6|NZ_CP049866.1_1256201_1257287_+	PRK10856, PRK10856, cytoskeleton protein RodZ	NA|308aa|down_7|NZ_CP049866.1_1257292_1258216_-	PRK06197, PRK06197, short chain dehydrogenase; Provisional	NA|273aa|down_8|NZ_CP049866.1_1258212_1259031_-	TIGR03083, TIGR03083, uncharacterized Actinobacterial protein TIGR03083	NA|202aa|down_9|NZ_CP049866.1_1259084_1259690_-	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]
GCF_011300215.1_ASM1130021v1	NZ_CP049866	Nocardioides sp. HDW12A chromosome, complete genome	4	1356440-1356684	3	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,WYL,cas4	Orphan	GTCCTCGACTCCGCCGGCAACGT	23	0	0	NA	NA	NA	4	4	Orphan	cas3,DinG,DEDDh,csa3,WYL,cas4	NA|183aa|up_7|NZ_CP049866.1_1345973_1346522_-,NA|106aa|down_5|NZ_CP049866.1_1365762_1366080_-	NA|223aa|up_9|NZ_CP049866.1_1344576_1345245_+	cd18096, SpoU-like, SAM-dependent rRNA or tRNA methylase related to SpoU	NA|166aa|up_8|NZ_CP049866.1_1345479_1345977_-	TIGR02983, putative_RNA_polymerase_ECF-subfamily_sigma_factor, RNA polymerase sigma-70 factor, sigma-E family	NA|183aa|up_7|NZ_CP049866.1_1345973_1346522_-	NA	NA|702aa|up_6|NZ_CP049866.1_1346985_1349091_-	cd03859, M14_CPT, Peptidase M14 Carboxypeptidase T subfamily	NA|184aa|up_5|NZ_CP049866.1_1349181_1349733_-	TIGR02983, putative_RNA_polymerase_ECF-subfamily_sigma_factor, RNA polymerase sigma-70 factor, sigma-E family	NA|345aa|up_4|NZ_CP049866.1_1349820_1350855_+	PRK09197, PRK09197, fructose-bisphosphate aldolase; Provisional	NA|138aa|up_3|NZ_CP049866.1_1350977_1351391_-	pfam11349, DUF3151, Protein of unknown function (DUF3151)	NA|344aa|up_2|NZ_CP049866.1_1351607_1352639_+	pfam13338, AbiEi_4, Transcriptional regulator, AbiEi antitoxin	NA|568aa|up_1|NZ_CP049866.1_1352699_1354403_-	PRK00750, lysK, lysyl-tRNA synthetase; Reviewed	NA|310aa|up_0|NZ_CP049866.1_1354651_1355581_+	cd06170, LuxR_C_like, C-terminal DNA-binding domain of LuxR-like proteins	NA|302aa|down_0|NZ_CP049866.1_1359782_1360688_-	COG1597, LCB5, Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only]	NA|428aa|down_1|NZ_CP049866.1_1360831_1362115_+	PRK01117, PRK01117, adenylosuccinate synthetase; Provisional	NA|269aa|down_2|NZ_CP049866.1_1362166_1362973_+	PRK07825, PRK07825, short chain dehydrogenase; Provisional	NA|428aa|down_3|NZ_CP049866.1_1363046_1364330_+	PRK00885, PRK00885, phosphoribosylamine--glycine ligase; Provisional	NA|473aa|down_4|NZ_CP049866.1_1364331_1365750_+	cd03302, Adenylsuccinate_lyase_2, Adenylsuccinate lyase (ASL)_subgroup 2	NA|106aa|down_5|NZ_CP049866.1_1365762_1366080_-	NA	NA|472aa|down_6|NZ_CP049866.1_1366120_1367536_-	TIGR00711, Uncharacterized_MFS-type_transporter_YhcA, drug resistance transporter, EmrB/QacA subfamily	NA|189aa|down_7|NZ_CP049866.1_1367612_1368179_+	COG3224, COG3224, Uncharacterized protein conserved in bacteria [Function unknown]	NA|299aa|down_8|NZ_CP049866.1_1368200_1369097_+	PRK13961, PRK13961, phosphoribosylaminoimidazole-succinocarboxamide synthase; Provisional	NA|145aa|down_9|NZ_CP049866.1_1369096_1369531_+	cd08865, SRPBCC_10, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins
