assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001563285.1_ASM156328v1	NZ_CP005188	Sphingobium sp. MI1205 chromosome 1, complete sequence	1	552110-552198	1	CRISPRCasFinder	no	DinG	DEDDh,cas3,csa3,WYL,DinG	Type IV-A	CGCGAATATCGCAAGGATGTGCG	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,csa3,WYL,DinG,RT	NA|205aa|up_0|NZ_CP005188.1_551175_551790_+,NA|102aa|down_0|NZ_CP005188.1_552712_553018_-	NA|243aa|up_9|NZ_CP005188.1_541729_542458_-	COG4117, COG4117, Thiosulfate reductase cytochrome B subunit (membrane anchoring protein) [Energy production and conversion]	NA|169aa|up_8|NZ_CP005188.1_542872_543379_+	pfam13505, OMP_b-brl, Outer membrane protein beta-barrel domain	NA|306aa|up_7|NZ_CP005188.1_543591_544509_-	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|135aa|up_6|NZ_CP005188.1_544511_544916_-	cd01038, Endonuclease_DUF559, Domain of unknown function, appears to be related to a diverse group of endonucleases	NA|507aa|up_5|NZ_CP005188.1_544989_546510_-	COG1951, TtdA, Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain [Energy production and conversion]	NA|215aa|up_4|NZ_CP005188.1_546572_547217_-	COG0625, Gst, Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]	NA|645aa|up_3|NZ_CP005188.1_547744_549679_+	PRK09102, PRK09102, ribonucleoside-diphosphate reductase subunit alpha	NA|85aa|up_2|NZ_CP005188.1_549743_549998_-	pfam09939, DUF2171, Uncharacterized protein conserved in bacteria (DUF2171)	NA|352aa|up_1|NZ_CP005188.1_550086_551142_+	PRK09614, nrdF, ribonucleotide-diphosphate reductase subunit beta; Reviewed	NA|205aa|up_0|NZ_CP005188.1_551175_551790_+	NA	NA|102aa|down_0|NZ_CP005188.1_552712_553018_-	NA	NA|127aa|down_1|NZ_CP005188.1_553153_553534_+	cd06154, YjgF_YER057c_UK114_like_6, This group of proteins belong to a large family of YjgF/YER057c/UK114-like proteins present in bacteria, archaea, and eukaryotes with no definitive function	NA|121aa|down_2|NZ_CP005188.1_553539_553902_+	cd03035, ArsC_Yffb, Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC	NA|142aa|down_3|NZ_CP005188.1_553954_554380_+	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|219aa|down_4|NZ_CP005188.1_554419_555076_+	cd07737, YcbL-like_MBL-fold, Salmonella enterica serovar typhimurium YcbL and related proteins; MBL-fold metallo hydrolase domain	NA|143aa|down_5|NZ_CP005188.1_555080_555509_-	COG3788, COG3788, Uncharacterized relative of glutathione S-transferase, MAPEG superfamily [General function prediction only]	NA|60aa|down_6|NZ_CP005188.1_555721_555901_+	PRK12286, rpmF, 50S ribosomal protein L32; Reviewed	NA|353aa|down_7|NZ_CP005188.1_555933_556992_+	PRK05331, PRK05331, phosphate acyltransferase PlsX	NA|323aa|down_8|NZ_CP005188.1_556988_557957_+	PRK09352, PRK09352, beta-ketoacyl-ACP synthase 3	NA|100aa|down_9|NZ_CP005188.1_558081_558381_+	PRK00285, ihfA, integration host factor subunit alpha; Reviewed
GCF_001563285.1_ASM156328v1	NZ_CP005188	Sphingobium sp. MI1205 chromosome 1, complete sequence	2	2380574-2380762	2	CRISPRCasFinder	no		DEDDh,cas3,csa3,WYL,DinG	Orphan	GGAAGCCACTTGCAAGGTCGCCTTGGA	27	0	0	NA	NA	NA	3	3	Orphan	DEDDh,cas3,csa3,WYL,DinG,RT	NA|102aa|up_9|NZ_CP005188.1_2375030_2375336_-,NA|60aa|up_8|NZ_CP005188.1_2375446_2375626_-,NA|52aa|up_6|NZ_CP005188.1_2376748_2376904_-,NA|161aa|up_4|NZ_CP005188.1_2378154_2378637_-,NA|107aa|up_3|NZ_CP005188.1_2378636_2378957_-,NA|129aa|up_2|NZ_CP005188.1_2378956_2379343_-,NA|147aa|up_0|NZ_CP005188.1_2380014_2380455_-,NA|324aa|down_0|NZ_CP005188.1_2380773_2381745_-,NA|65aa|down_6|NZ_CP005188.1_2390547_2390742_-,NA|209aa|down_7|NZ_CP005188.1_2390867_2391494_-,NA|388aa|down_8|NZ_CP005188.1_2391501_2392665_-,NA|77aa|down_9|NZ_CP005188.1_2392693_2392924_-	NA|102aa|up_9|NZ_CP005188.1_2375030_2375336_-	NA	NA|60aa|up_8|NZ_CP005188.1_2375446_2375626_-	NA	NA|240aa|up_7|NZ_CP005188.1_2375880_2376600_+	PRK14716, PRK14716, glycosyl transferase family protein	NA|52aa|up_6|NZ_CP005188.1_2376748_2376904_-	NA	NA|284aa|up_5|NZ_CP005188.1_2376946_2377798_-	COG0338, Dam, Site-specific DNA methylase [DNA replication, recombination, and repair]	NA|161aa|up_4|NZ_CP005188.1_2378154_2378637_-	NA	NA|107aa|up_3|NZ_CP005188.1_2378636_2378957_-	NA	NA|129aa|up_2|NZ_CP005188.1_2378956_2379343_-	NA	NA|162aa|up_1|NZ_CP005188.1_2379339_2379825_-	cd00737, lyz_endolysin_autolysin, endolysin and autolysin	NA|147aa|up_0|NZ_CP005188.1_2380014_2380455_-	NA	NA|324aa|down_0|NZ_CP005188.1_2380773_2381745_-	NA	NA|224aa|down_1|NZ_CP005188.1_2381741_2382413_-	pfam10983, DUF2793, Protein of unknown function (DUF2793)	NA|1080aa|down_2|NZ_CP005188.1_2382421_2385661_-	pfam13550, Phage-tail_3, Putative phage tail protein	NA|357aa|down_3|NZ_CP005188.1_2386102_2387173_-	pfam09931, DUF2163, Uncharacterized conserved protein (DUF2163)	NA|209aa|down_4|NZ_CP005188.1_2387169_2387796_-	pfam09343, DUF2460, Conserved hypothetical protein 2217 (DUF2460)	NA|917aa|down_5|NZ_CP005188.1_2387792_2390543_-	pfam06791, TMP_2, Prophage tail length tape measure protein	NA|65aa|down_6|NZ_CP005188.1_2390547_2390742_-	NA	NA|209aa|down_7|NZ_CP005188.1_2390867_2391494_-	NA	NA|388aa|down_8|NZ_CP005188.1_2391501_2392665_-	NA	NA|77aa|down_9|NZ_CP005188.1_2392693_2392924_-	NA
GCF_001563285.1_ASM156328v1	NZ_CP005192	Sphingobium sp. MI1205 plasmid pMI3, complete sequence	1	31123-31212	1	CRISPRCasFinder	no			Orphan	GACTCGACCCCAAAACTCACTTTT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,csa3,WYL,DinG,RT	NA|190aa|up_8|NZ_CP005192.1_23379_23949_-,NA|231aa|up_4|NZ_CP005192.1_27433_28126_+,NA|151aa|up_3|NZ_CP005192.1_28154_28607_+,NA|94aa|up_1|NZ_CP005192.1_29756_30038_+,NA|241aa|down_3|NZ_CP005192.1_36833_37556_+,NA|85aa|down_5|NZ_CP005192.1_38758_39013_+,NA|85aa|down_8|NZ_CP005192.1_40941_41196_+	NA|225aa|up_9|NZ_CP005192.1_22545_23220_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|190aa|up_8|NZ_CP005192.1_23379_23949_-	NA	NA|326aa|up_7|NZ_CP005192.1_24049_25027_-	COG3267, ExeA, Type II secretory pathway, component ExeA (predicted ATPase) [Intracellular trafficking and secretion]	NA|551aa|up_6|NZ_CP005192.1_25026_26679_-	pfam00665, rve, Integrase core domain	NA|199aa|up_5|NZ_CP005192.1_26671_27268_-	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|231aa|up_4|NZ_CP005192.1_27433_28126_+	NA	NA|151aa|up_3|NZ_CP005192.1_28154_28607_+	NA	NA|212aa|up_2|NZ_CP005192.1_29095_29731_+	PHA02518, PHA02518, ParA-like protein; Provisional	NA|94aa|up_1|NZ_CP005192.1_29756_30038_+	NA	NA|304aa|up_0|NZ_CP005192.1_30094_31006_-	pfam01051, Rep_3, Initiator Replication protein	NA|75aa|down_0|NZ_CP005192.1_32951_33176_-	pfam06412, TraD, Conjugal transfer protein TraD	NA|102aa|down_1|NZ_CP005192.1_33213_33519_-	pfam06412, TraD, Conjugal transfer protein TraD	NA|1045aa|down_2|NZ_CP005192.1_33691_36826_+	PRK13889, PRK13889, conjugal transfer relaxase TraA; Provisional	NA|241aa|down_3|NZ_CP005192.1_36833_37556_+	NA	NA|255aa|down_4|NZ_CP005192.1_37965_38730_-	COG3316, COG3316, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|85aa|down_5|NZ_CP005192.1_38758_39013_+	NA	NA|297aa|down_6|NZ_CP005192.1_39098_39989_+	PRK03592, PRK03592, haloalkane dehalogenase; Provisional	NA|255aa|down_7|NZ_CP005192.1_40148_40913_-	COG3316, COG3316, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|85aa|down_8|NZ_CP005192.1_40941_41196_+	NA	NA|297aa|down_9|NZ_CP005192.1_41281_42172_+	PRK03592, PRK03592, haloalkane dehalogenase; Provisional
GCF_001563285.1_ASM156328v1	NZ_CP005193	Sphingobium sp. MI1205 plasmid pMI4, complete sequence	1	18214-18303	1	CRISPRCasFinder	no			Orphan	AAAAGTGAGTTTTGGGGTCGAGTC	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,csa3,WYL,DinG,RT	NA|80aa|up_9|NZ_CP005193.1_8557_8797_-,NA|71aa|up_5|NZ_CP005193.1_10290_10503_-,NA|241aa|up_3|NZ_CP005193.1_11869_12592_-,NA|94aa|down_1|NZ_CP005193.1_19387_19669_-	NA|80aa|up_9|NZ_CP005193.1_8557_8797_-	NA	NA|83aa|up_8|NZ_CP005193.1_8994_9243_+	pfam13711, DUF4160, Domain of unknown function (DUF4160)	NA|138aa|up_7|NZ_CP005193.1_9229_9643_+	pfam10387, DUF2442, Protein of unknown function (DUF2442)	NA|128aa|up_6|NZ_CP005193.1_9910_10294_-	pfam00072, Response_reg, Response regulator receiver domain	NA|71aa|up_5|NZ_CP005193.1_10290_10503_-	NA	NA|255aa|up_4|NZ_CP005193.1_10695_11460_+	COG3316, COG3316, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|241aa|up_3|NZ_CP005193.1_11869_12592_-	NA	NA|1045aa|up_2|NZ_CP005193.1_12599_15734_-	PRK13889, PRK13889, conjugal transfer relaxase TraA; Provisional	NA|102aa|up_1|NZ_CP005193.1_15906_16212_+	pfam06412, TraD, Conjugal transfer protein TraD	NA|75aa|up_0|NZ_CP005193.1_16249_16474_+	pfam06412, TraD, Conjugal transfer protein TraD	NA|304aa|down_0|NZ_CP005193.1_18419_19331_+	pfam01051, Rep_3, Initiator Replication protein	NA|94aa|down_1|NZ_CP005193.1_19387_19669_-	NA	NA|212aa|down_2|NZ_CP005193.1_19694_20330_-	PHA02518, PHA02518, ParA-like protein; Provisional	NA|219aa|down_3|NZ_CP005193.1_20584_21241_-	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|370aa|down_4|NZ_CP005193.1_21496_22606_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|255aa|down_5|NZ_CP005193.1_22571_23336_+	COG3316, COG3316, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|304aa|down_6|NZ_CP005193.1_24381_25293_-	cd08459, PBP2_DntR_NahR_LinR_like, The C-terminal substrate binding domain of LysR-type transcriptional regulators that are involved in the catabolism of dinitrotoluene, naphthalene and gamma-hexachlorohexane; contains the type 2 periplasmic binding fold	NA|322aa|down_7|NZ_CP005193.1_25424_26390_+	cd08346, PcpA_N_like, N-terminal domain of Sphingobium chlorophenolicum 2,6-dichloro-p-hydroquinone 1,2-dioxygenase (PcpA), and similar proteins	NA|270aa|down_8|NZ_CP005193.1_26414_27224_+	TIGR02427, b-ketoadipate_enol-lactone_hydrolase, 3-oxoadipate enol-lactonase	NA|239aa|down_9|NZ_CP005193.1_27220_27937_+	COG0400, COG0400, Predicted esterase [General function prediction only]
