assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_008693785.1_ASM869378v1	NZ_CP044115	Roseomonas mucosa strain FDAARGOS_658 chromosome 2, complete sequence	1	52996-56075	1,1,1,2	PILER-CR,CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	Type I-E	GTGTTCCCCGCGAGCGCGGGGATGAACCG,GTGTTCCCCGCGAGCGCGGGGATGAACCG,GTGTTCCCCGCGAGCGCGGGGATGAACCG,GTGTTCCCCGCGAGCGCGGGGATGAACCG	29,29,29,29	1	1	54672-54703	NZ_CP044117.1_2369297-2369266	I-E:I-E:I-E:I-E	48,50,50,48	50	TypeI-E	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,cas8c,cas4,DinG,DEDDh	NA,NA	NA|75aa|up_9|NZ_CP044115.1_43516_43741_+	COG4456, VagC, Virulence-associated protein and related proteins [Function unknown]	NA|137aa|up_8|NZ_CP044115.1_43737_44148_+	cd18745, PIN_VapC4-5_FitB-like, uncharacterized subgroup of the PIN_VapC4-5_FitB-like subfamily of the PIN domain superfamily	cas3|892aa|up_7|NZ_CP044115.1_44205_46881_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|539aa|up_6|NZ_CP044115.1_46957_48574_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|187aa|up_5|NZ_CP044115.1_48570_49131_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|360aa|up_4|NZ_CP044115.1_49127_50207_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|256aa|up_3|NZ_CP044115.1_50211_50979_+	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|230aa|up_2|NZ_CP044115.1_50975_51665_+	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas1|323aa|up_1|NZ_CP044115.1_51664_52633_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|111aa|up_0|NZ_CP044115.1_52613_52946_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|500aa|down_0|NZ_CP044115.1_56207_57707_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|243aa|down_1|NZ_CP044115.1_57706_58435_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|84aa|down_2|NZ_CP044115.1_58589_58841_+	pfam13610, DDE_Tnp_IS240, DDE domain	NA|151aa|down_3|NZ_CP044115.1_58905_59358_-	cd09873, PIN_Pae0151-like, VapC-like PIN domain of the Pyrobaculum aerophilum Pae0151 and Pae2754 proteins and homologs	NA|332aa|down_4|NZ_CP044115.1_60056_61052_+	pfam04796, RepA_C, Plasmid encoded RepA protein	NA|269aa|down_5|NZ_CP044115.1_61093_61900_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|381aa|down_6|NZ_CP044115.1_62582_63724_+	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|86aa|down_7|NZ_CP044115.1_63711_63969_+	pfam13545, HTH_Crp_2, Crp-like helix-turn-helix domain	NA|533aa|down_8|NZ_CP044115.1_64221_65820_-	cd08498, PBP2_NikA_DppA_OppA_like_2, The substrate-binding component of an uncharacterized ABC-type nickel/dipeptide/oligopeptide-like import system contains the type 2 periplasmic binding fold	NA|362aa|down_9|NZ_CP044115.1_66537_67623_-	PRK11650, ugpC, sn-glycerol-3-phosphate ABC transporter ATP-binding protein UgpC
GCF_008693785.1_ASM869378v1	NZ_CP044117	Roseomonas mucosa strain FDAARGOS_658 chromosome 4, complete sequence	1	421160-423228	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,DEDDh	Type I-C,Type I-U, Type I-U?	GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC,GTCGCTCCCCGTGCGGGAGCGTGGATCGAAAC,TCGCTCCCCGTGCGGGAGCGTGGATCGAAAC	32,32,31	0	0	NA	NA	I-C:I-C:NA	31,31,31	31	TypeI-C,TypeI-U,TypeI-U?	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,cas8c,cas4,DinG,DEDDh	NA,NA|92aa|down_4|NZ_CP044117.1_429207_429483_-	NA|411aa|up_9|NZ_CP044117.1_409834_411067_+	TIGR00937, Chromate_transport_protein, chromate transporter, chromate ion transporter (CHR) family	NA|98aa|up_8|NZ_CP044117.1_411129_411423_+	COG3316, COG3316, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	cas3|752aa|up_7|NZ_CP044117.1_411994_414250_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|226aa|up_6|NZ_CP044117.1_414446_415124_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|597aa|up_5|NZ_CP044117.1_415120_416911_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	NA|346aa|up_4|NZ_CP044117.1_417394_418432_+	pfam13358, DDE_3, DDE superfamily endonuclease	cas7|170aa|up_3|NZ_CP044117.1_418459_418969_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|215aa|up_2|NZ_CP044117.1_418979_419624_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|347aa|up_1|NZ_CP044117.1_419628_420669_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_CP044117.1_420678_420969_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|500aa|down_0|NZ_CP044117.1_424133_425633_+	COG4584, COG4584, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|243aa|down_1|NZ_CP044117.1_425632_426361_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|169aa|down_2|NZ_CP044117.1_427447_427954_-	cd07909, YciF, YciF bacterial stress response protein, ferritin-like iron-binding domain	NA|170aa|down_3|NZ_CP044117.1_428328_428838_-	pfam05974, DUF892, Domain of unknown function (DUF892)	NA|92aa|down_4|NZ_CP044117.1_429207_429483_-	NA	NA|703aa|down_5|NZ_CP044117.1_429909_432018_-	PRK14507, PRK14507, malto-oligosyltrehalose synthase	NA|607aa|down_6|NZ_CP044117.1_432014_433835_-	TIGR02402, Malto-oligosyltrehalose_trehalohydrolase, malto-oligosyltrehalose trehalohydrolase	NA|703aa|down_7|NZ_CP044117.1_433831_435940_-	TIGR02100, Glycogen_operon_protein_GlgX_homolog, glycogen debranching enzyme GlgX	NA|737aa|down_8|NZ_CP044117.1_435923_438134_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|1083aa|down_9|NZ_CP044117.1_438123_441372_-	TIGR02456, Trehalose_synthase, trehalose synthase
GCF_008693785.1_ASM869378v1	NZ_CP044117	Roseomonas mucosa strain FDAARGOS_658 chromosome 4, complete sequence	2	1145959-1146054	2	CRISPRCasFinder	no	csa3	csa3,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,DinG,DEDDh	Type I-A	CTGACAGTGCTGCATCATCTGCTGCAT	27	0	0	NA	NA	NA	1	1	Orphan	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,cas8c,cas4,DinG,DEDDh	NA|78aa|up_3|NZ_CP044117.1_1141945_1142179_+,NA|121aa|up_0|NZ_CP044117.1_1145412_1145775_-,NA	NA|500aa|up_9|NZ_CP044117.1_1136170_1137670_+	pfam00665, rve, Integrase core domain	NA|243aa|up_8|NZ_CP044117.1_1137669_1138398_+	pfam01695, IstB_IS21, IstB-like ATP binding protein	NA|119aa|up_7|NZ_CP044117.1_1138523_1138880_+	cd00887, MoeA, MoeA family	NA|162aa|up_6|NZ_CP044117.1_1139189_1139675_-	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|475aa|up_5|NZ_CP044117.1_1139812_1141237_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|111aa|up_4|NZ_CP044117.1_1141410_1141743_-	cd08026, DUF326, Cysteine-rich 4 helical bundle widely conserved in bacteria	NA|78aa|up_3|NZ_CP044117.1_1141945_1142179_+	NA	NA|789aa|up_2|NZ_CP044117.1_1142587_1144954_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|68aa|up_1|NZ_CP044117.1_1145118_1145322_+	cd00371, HMA, Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones	NA|121aa|up_0|NZ_CP044117.1_1145412_1145775_-	NA	NA|158aa|down_0|NZ_CP044117.1_1146451_1146925_-	COG3019, COG3019, Predicted metal-binding protein [General function prediction only]	NA|412aa|down_1|NZ_CP044117.1_1146935_1148171_-	PRK09467, envZ, osmolarity sensor protein; Provisional	NA|256aa|down_2|NZ_CP044117.1_1148167_1148935_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|152aa|down_3|NZ_CP044117.1_1148915_1149371_-	cd04218, Pseudoazurin, Pseudoazurin (Paz) is a type I blue copper electron-transfer protein	NA|252aa|down_4|NZ_CP044117.1_1149385_1150141_-	pfam05275, CopB, Copper resistance protein B precursor (CopB)	NA|652aa|down_5|NZ_CP044117.1_1150146_1152102_-	TIGR01480, unnamed_protein_product, copper-resistance protein, CopA family	NA|190aa|down_6|NZ_CP044117.1_1152147_1152717_-	cd04211, Cupredoxin_like_2, Uncharacterized Cupredoxin-like subfamily	csa3|118aa|down_7|NZ_CP044117.1_1154433_1154787_+	pfam12840, HTH_20, Helix-turn-helix domain	NA|142aa|down_8|NZ_CP044117.1_1154783_1155209_+	PRK10026, PRK10026, arsenate reductase (glutaredoxin)	NA|438aa|down_9|NZ_CP044117.1_1155317_1156631_+	PRK15445, PRK15445, arsenical efflux pump membrane protein ArsB
