assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000024125.1_ASM2412v1	NC_013222	Robiginitalea biformata HTCC2501, complete sequence	1	307767-307882	1	CRISPRCasFinder	no		DEDDh,csa3,WYL,PrimPol	Orphan	ATTGCTTGCCTGGCGGCAAGCTTCCCTACTCGCTC	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,PrimPol	NA,NA	NA|222aa|up_9|NC_013222.1_297339_298005_+	cd06196, FNR_like_1, Ferredoxin reductase-like proteins catalyze electron transfer between an NAD(P)-binding domain of the alpha/beta class and a discrete (usually N-terminal) domain which varies in orientation with respect to the NAD(P) binding domain	NA|229aa|up_8|NC_013222.1_298006_298693_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|175aa|up_7|NC_013222.1_298743_299268_+	pfam13628, DUF4142, Domain of unknown function (DUF4142)	NA|114aa|up_6|NC_013222.1_299267_299609_+	cd13921, Amicyanin, Amicyanin is a type I blue copper protein that plays an essential role in electron transfer	NA|451aa|up_5|NC_013222.1_299682_301035_+	TIGR03791, TTQ_mauG, tryptophan tryptophylquinone biosynthesis enzyme MauG	NA|266aa|up_4|NC_013222.1_301242_302040_+	pfam02517, Abi, CAAX protease self-immunity	NA|1057aa|up_3|NC_013222.1_302036_305207_+	TIGR03144, cytochrome_c_biogenesis_protein_chloroplast, cytochrome c-type biogenesis protein CcsB	NA|239aa|up_2|NC_013222.1_305250_305967_-	cd06196, FNR_like_1, Ferredoxin reductase-like proteins catalyze electron transfer between an NAD(P)-binding domain of the alpha/beta class and a discrete (usually N-terminal) domain which varies in orientation with respect to the NAD(P) binding domain	NA|311aa|up_1|NC_013222.1_306391_307324_+	pfam14321, DUF4382, Domain of unknown function (DUF4382)	NA|62aa|up_0|NC_013222.1_307519_307705_+	PRK10428, PRK10428, hypothetical protein; Provisional	NA|379aa|down_0|NC_013222.1_308061_309198_+	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|604aa|down_1|NC_013222.1_310238_312050_-	pfam16576, HlyD_D23, Barrel-sandwich domain of CusB or HlyD membrane-fusion	NA|163aa|down_2|NC_013222.1_312078_312567_-	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|236aa|down_3|NC_013222.1_312591_313299_-	COG3182, PiuB, Uncharacterized iron-regulated membrane protein [Function unknown]	NA|136aa|down_4|NC_013222.1_313300_313708_-	pfam11138, DUF2911, Protein of unknown function (DUF2911)	NA|251aa|down_5|NC_013222.1_313734_314487_-	cd00371, HMA, Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones	NA|409aa|down_6|NC_013222.1_314489_315716_-	pfam03773, ArsP_1, Predicted permease	NA|157aa|down_7|NC_013222.1_315729_316200_-	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|186aa|down_8|NC_013222.1_316204_316762_-	pfam07885, Ion_trans_2, Ion channel	NA|502aa|down_9|NC_013222.1_316758_318264_-	cd02808, GltS_FMN, Glutamate synthase (GltS) FMN-binding domain
GCF_000024125.1_ASM2412v1	NC_013222	Robiginitalea biformata HTCC2501, complete sequence	2	440758-440871	2	CRISPRCasFinder	no		DEDDh,csa3,WYL,PrimPol	Orphan	TGTATTTTCAGAATGGAGCCGCCAGGAACTGCGGAGC	37	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,PrimPol	NA|165aa|up_9|NC_013222.1_431451_431946_-,NA|103aa|up_6|NC_013222.1_434400_434709_-,NA|236aa|up_2|NC_013222.1_438568_439276_+,NA|148aa|down_1|NC_013222.1_443303_443747_+	NA|165aa|up_9|NC_013222.1_431451_431946_-	NA	NA|317aa|up_8|NC_013222.1_432008_432959_+	cd05272, TDH_SDR_e, L-threonine dehydrogenase, extended (e) SDRs	NA|458aa|up_7|NC_013222.1_432946_434320_-	PRK10446, PRK10446, 30S ribosomal protein S6--L-glutamate ligase	NA|103aa|up_6|NC_013222.1_434400_434709_-	NA	NA|325aa|up_5|NC_013222.1_434711_435686_-	pfam01916, DS, Deoxyhypusine synthase	NA|291aa|up_4|NC_013222.1_435742_436615_-	cd11593, Agmatinase-like_2, Agmatinase and related proteins	NA|488aa|up_3|NC_013222.1_436668_438132_-	cd06830, PLPDE_III_ADC, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Arginine Decarboxylase	NA|236aa|up_2|NC_013222.1_438568_439276_+	NA	NA|145aa|up_1|NC_013222.1_439382_439817_-	PRK03604, moaC, bifunctional molybdenum cofactor biosynthesis protein MoaC/MogA; Provisional	NA|156aa|up_0|NC_013222.1_439836_440304_-	cd00756, MoaE, MoaE family	NA|222aa|down_0|NC_013222.1_442098_442764_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|148aa|down_1|NC_013222.1_443303_443747_+	NA	NA|183aa|down_2|NC_013222.1_443743_444292_+	pfam11611, DUF4352, Domain of unknown function (DUF4352)	NA|177aa|down_3|NC_013222.1_444539_445070_+	pfam18735, HEPN_RiboL-PSP, RiboL-PSP-HEPN	NA|107aa|down_4|NC_013222.1_445062_445383_+	pfam07411, DUF1508, Domain of unknown function (DUF1508)	NA|325aa|down_5|NC_013222.1_445945_446920_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|700aa|down_6|NC_013222.1_446955_449055_-	COG5616, COG5616, Predicted integral membrane protein [Function unknown]	NA|190aa|down_7|NC_013222.1_449236_449806_+	sd00045, ANK, ankyrin repeats	NA|368aa|down_8|NC_013222.1_449826_450930_+	PRK03854, opgC, glucans biosynthesis protein MdoC	NA|208aa|down_9|NC_013222.1_450956_451580_-	cd02199, YjgF_YER057c_UK114_like_1, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function
GCF_000024125.1_ASM2412v1	NC_013222	Robiginitalea biformata HTCC2501, complete sequence	3	921305-921681	3	CRISPRCasFinder	no		DEDDh,csa3,WYL,PrimPol	Orphan	CATCGTCCTTGTCGGCAACGCCGTCTCCGTCAGCATCGGGGCATCCG	47	0	0	NA	NA	NA	4	4	Orphan	DEDDh,csa3,WYL,PrimPol	NA|143aa|up_2|NC_013222.1_916562_916991_+,NA	NA|253aa|up_9|NC_013222.1_909748_910507_-	cd07398, MPP_YbbF-LpxH, Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain	NA|152aa|up_8|NC_013222.1_910576_911032_-	pfam01242, PTPS, 6-pyruvoyl tetrahydropterin synthase	NA|620aa|up_7|NC_013222.1_911104_912964_+	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|258aa|up_6|NC_013222.1_913048_913822_-	COG1024, CaiD, Enoyl-CoA hydratase/carnithine racemase [Lipid metabolism]	NA|445aa|up_5|NC_013222.1_913837_915172_-	cd13136, MATE_DinF_like, DinF and similar proteins, a subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins	NA|171aa|up_4|NC_013222.1_915168_915681_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|266aa|up_3|NC_013222.1_915735_916533_+	pfam00582, Usp, Universal stress protein family	NA|143aa|up_2|NC_013222.1_916562_916991_+	NA	NA|237aa|up_1|NC_013222.1_917375_918086_+	PRK07326, PRK07326, SDR family oxidoreductase	NA|912aa|up_0|NC_013222.1_918089_920825_-	pfam12705, PDDEXK_1, PD-(D/E)XK nuclease superfamily	NA|398aa|down_0|NC_013222.1_922365_923559_-	PRK06939, PRK06939, 2-amino-3-ketobutyrate coenzyme A ligase; Provisional	NA|1040aa|down_1|NC_013222.1_923561_926681_-	COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) [DNA replication, recombination, and repair]	NA|203aa|down_2|NC_013222.1_926794_927403_+	COG0605, SodA, Superoxide dismutase [Inorganic ion transport and metabolism]	NA|633aa|down_3|NC_013222.1_928179_930078_-	COG0034, PurF, Glutamine phosphoribosylpyrophosphate amidotransferase [Nucleotide transport and metabolism]	NA|309aa|down_4|NC_013222.1_930116_931043_-	cd01946, ribokinase_group_C, Ribokinase-like subgroup C	NA|212aa|down_5|NC_013222.1_931148_931784_-	cd13935, RNase_H_bacteria_like, RNase H is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a sequence non-specific manner	NA|193aa|down_6|NC_013222.1_931776_932355_-	cd08645, FMT_core_GART, Phosphoribosylglycinamide formyltransferase (GAR transformylase, GART)	NA|78aa|down_7|NC_013222.1_932507_932741_+	PRK00982, acpP, acyl carrier protein; Provisional	NA|417aa|down_8|NC_013222.1_932853_934104_+	TIGR03150, fabF, beta-ketoacyl-acyl-carrier-protein synthase II	NA|246aa|down_9|NC_013222.1_934111_934849_+	TIGR02191, Ribonuclease_3, ribonuclease III, bacterial
GCF_000024125.1_ASM2412v1	NC_013222	Robiginitalea biformata HTCC2501, complete sequence	4	2963072-2963154	4	CRISPRCasFinder	no		DEDDh,csa3,WYL,PrimPol	Orphan	ATCGACGAGAACGGGGTGGACAT	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,WYL,PrimPol	NA|66aa|up_9|NC_013222.1_2953954_2954152_-,NA|126aa|up_4|NC_013222.1_2958970_2959348_-,NA|147aa|down_3|NC_013222.1_2966397_2966838_+,NA|242aa|down_6|NC_013222.1_2969862_2970588_-	NA|66aa|up_9|NC_013222.1_2953954_2954152_-	NA	NA|419aa|up_8|NC_013222.1_2954313_2955570_+	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|203aa|up_7|NC_013222.1_2955563_2956172_-	PRK15009, PRK15009, GDP-mannose pyrophosphatase NudK; Provisional	NA|529aa|up_6|NC_013222.1_2956345_2957932_+	pfam07364, DUF1485, Metallopeptidase family M81	NA|269aa|up_5|NC_013222.1_2958037_2958844_-	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|126aa|up_4|NC_013222.1_2958970_2959348_-	NA	NA|152aa|up_3|NC_013222.1_2959419_2959875_-	pfam14539, DUF4442, Domain of unknown function (DUF4442)	NA|161aa|up_2|NC_013222.1_2960009_2960492_+	pfam09685, DUF4870, Domain of unknown function (DUF4870)	NA|116aa|up_1|NC_013222.1_2960635_2960983_+	pfam14534, DUF4440, Domain of unknown function (DUF4440)	NA|112aa|up_0|NC_013222.1_2961093_2961429_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|241aa|down_0|NC_013222.1_2963321_2964044_+	pfam10988, DUF2807, Putative auto-transporter adhesin, head GIN domain	NA|340aa|down_1|NC_013222.1_2964158_2965178_+	cd03319, L-Ala-DL-Glu_epimerase, L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides	NA|356aa|down_2|NC_013222.1_2965219_2966287_+	COG0156, BioF, 7-keto-8-aminopelargonate synthetase and related enzymes [Coenzyme metabolism]	NA|147aa|down_3|NC_013222.1_2966397_2966838_+	NA	NA|326aa|down_4|NC_013222.1_2966932_2967910_-	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|580aa|down_5|NC_013222.1_2968126_2969866_-	pfam16576, HlyD_D23, Barrel-sandwich domain of CusB or HlyD membrane-fusion	NA|242aa|down_6|NC_013222.1_2969862_2970588_-	NA	NA|836aa|down_7|NC_013222.1_2970592_2973100_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|179aa|down_8|NC_013222.1_2973176_2973713_-	pfam12833, HTH_18, Helix-turn-helix domain	NA|186aa|down_9|NC_013222.1_2973764_2974322_-	pfam11827, DUF3347, Protein of unknown function (DUF3347)
