assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000018145.1_ASM1814v1	NC_009952	Dinoroseobacter shibae DFL 12 = DSM 16493, complete sequence	1	17751-17907	1	CRISPRCasFinder	no		csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh	Orphan	AAGGGCATCGCGCCGCGTGCGGTAAAGCGGTTTTCCTCGAAAACCTGCGACCG	53	0	0	NA	NA	NA	1	1	Orphan	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh,RT,PrimPol	NA|158aa|up_8|NC_009952.1_10219_10693_+,NA|156aa|up_7|NC_009952.1_10787_11255_+,NA|116aa|up_4|NC_009952.1_12776_13124_+,NA|98aa|down_5|NC_009952.1_25972_26266_-,NA|185aa|down_6|NC_009952.1_26399_26954_+	NA|276aa|up_9|NC_009952.1_9027_9855_+	PRK11830, dapD, 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional	NA|158aa|up_8|NC_009952.1_10219_10693_+	NA	NA|156aa|up_7|NC_009952.1_10787_11255_+	NA	NA|69aa|up_6|NC_009952.1_11746_11953_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|149aa|up_5|NC_009952.1_12120_12567_-	pfam03713, DUF305, Domain of unknown function (DUF305)	NA|116aa|up_4|NC_009952.1_12776_13124_+	NA	NA|380aa|up_3|NC_009952.1_13128_14268_+	PRK13009, PRK13009, succinyl-diaminopimelate desuccinylase; Reviewed	NA|143aa|up_2|NC_009952.1_14264_14693_+	COG2153, ElaA, Predicted acyltransferase [General function prediction only]	NA|193aa|up_1|NC_009952.1_14692_15271_+	TIGR04282, hypothetical_protein, transferase 1, rSAM/selenodomain-associated	NA|752aa|up_0|NC_009952.1_15471_17727_+	TIGR02063, Ribonuclease_R, ribonuclease R	NA|416aa|down_0|NC_009952.1_18178_19426_+	pfam13406, SLT_2, Transglycosylase SLT domain	NA|217aa|down_1|NC_009952.1_19515_20166_-	pfam06197, DUF998, Protein of unknown function (DUF998)	NA|768aa|down_2|NC_009952.1_20325_22629_-	cd13853, CuRO_1_Tth-MCO_like, The first cupredoxin domain of the bacterial laccases similar to Tth-MCO from Thermus Thermophilus	NA|606aa|down_3|NC_009952.1_22865_24683_+	COG1217, TypA, Predicted membrane GTPase involved in stress response [Signal transduction mechanisms]	NA|235aa|down_4|NC_009952.1_25296_26001_-	COG3710, CadC, DNA-binding winged-HTH domains [Transcription]	NA|98aa|down_5|NC_009952.1_25972_26266_-	NA	NA|185aa|down_6|NC_009952.1_26399_26954_+	NA	NA|207aa|down_7|NC_009952.1_26997_27618_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|369aa|down_8|NC_009952.1_27712_28819_+	COG0863, COG0863, DNA modification methylase [DNA replication, recombination, and repair]	NA|224aa|down_9|NC_009952.1_29142_29814_+	cd20303, cupin_ChrR_1, Marinobacter hydrocarbonoclasticus anti-ECFsigma factor ChrR, and similar proteins; 2 heterologous tandem repeats of cupin domain
GCF_000018145.1_ASM1814v1	NC_009952	Dinoroseobacter shibae DFL 12 = DSM 16493, complete sequence	2	396424-397710	1,2,1	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh	Type II-A, or Type II-C?, Type II-B,Type II-C,Type II-B	AGTTTAGCTGTTCAGAATTCGGGGTCCAGCCGCAAC,AGTTTAGCTGTTCAGAATTCGGGGTCCAGCCGCAAC,AGTTTAGCTGTTCAGAATTCGGGGTCCAGCCGCAAC	36,36,36	0	0	NA	NA	NA:NA:NA	18,18,19	19	TypeII-A,orTypeII-C?,TypeII-B,TypeII-C,TypeII-B	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh,RT,PrimPol	NA|261aa|up_8|NC_009952.1_383977_384760_-,NA|462aa|down_1|NC_009952.1_399226_400612_-,NA|74aa|down_2|NC_009952.1_401391_401613_+,NA|307aa|down_5|NC_009952.1_404245_405166_-,NA|283aa|down_6|NC_009952.1_405173_406022_-	NA|74aa|up_9|NC_009952.1_383696_383918_-	pfam12728, HTH_17, Helix-turn-helix domain	NA|261aa|up_8|NC_009952.1_383977_384760_-	NA	NA|398aa|up_7|NC_009952.1_384764_385958_-	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	NA|546aa|up_6|NC_009952.1_386136_387774_-	smart00857, Resolvase, Resolvase, N terminal domain	NA|359aa|up_5|NC_009952.1_387909_388986_-	TIGR02224, Tyrosine_recombinase_XerC, tyrosine recombinase XerC	NA|560aa|up_4|NC_009952.1_388978_390658_-	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|71aa|up_3|NC_009952.1_390816_391029_-	pfam12728, HTH_17, Helix-turn-helix domain	cas9|1080aa|up_2|NC_009952.1_391797_395037_+	TIGR01865, conserved_hypothetical_protein, CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1	cas1|304aa|up_1|NC_009952.1_395097_396009_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|102aa|up_0|NC_009952.1_396044_396350_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|367aa|down_0|NC_009952.1_397988_399088_+	pfam13683, rve_3, Integrase core domain	NA|462aa|down_1|NC_009952.1_399226_400612_-	NA	NA|74aa|down_2|NC_009952.1_401391_401613_+	NA	NA|109aa|down_3|NC_009952.1_402103_402430_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|241aa|down_4|NC_009952.1_402440_403163_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|307aa|down_5|NC_009952.1_404245_405166_-	NA	NA|283aa|down_6|NC_009952.1_405173_406022_-	NA	NA|511aa|down_7|NC_009952.1_406253_407786_+	COG2335, COG2335, Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]	NA|120aa|down_8|NC_009952.1_407835_408195_+	pfam13173, AAA_14, AAA domain	NA|149aa|down_9|NC_009952.1_408426_408873_-	PRK05753, PRK05753, nucleoside diphosphate kinase regulator; Provisional
GCF_000018145.1_ASM1814v1	NC_009952	Dinoroseobacter shibae DFL 12 = DSM 16493, complete sequence	3	3390219-3392259	2,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cse2gr11,cas7,cas5,cas6e,cas1,cas2	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh	Type I-E	GGCTCCCCCGCCCCCGCGGGGATAGACCC,GGCTCCCCCGCCCCCGCGGGGATAGACCC,GGCTCCCCCGCCCCCGCGGGGATAGACCC	29,29,29	0	0	NA	NA	NA:NA:NA	32,33,33	33	TypeI-E	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh,RT,PrimPol	NA,NA|257aa|down_3|NC_009952.1_3396007_3396778_-	NA|207aa|up_9|NC_009952.1_3377372_3377993_-	pfam00034, Cytochrom_C, Cytochrome c	NA|424aa|up_8|NC_009952.1_3377979_3379251_-	TIGR04555, probable_sulfite_oxidase_molybdopterin_subunit, sulfite dehydrogenase	NA|389aa|up_7|NC_009952.1_3379873_3381040_+	cd00796, INT_Rci_Hp1_C, Shufflon-specific DNA recombinase Rci and Bacteriophage Hp1_like integrase, C-terminal catalytic domain	cas3|851aa|up_6|NC_009952.1_3381531_3384084_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cse2gr11|192aa|up_5|NC_009952.1_3385789_3386365_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|374aa|up_4|NC_009952.1_3386357_3387479_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|229aa|up_3|NC_009952.1_3387475_3388162_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|264aa|up_2|NC_009952.1_3388158_3388950_+	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas1|292aa|up_1|NC_009952.1_3388954_3389830_+	cd09719, Cas1_I-E, CRISPR/Cas system-associated protein Cas1	cas2|98aa|up_0|NC_009952.1_3389861_3390155_+	pfam09707, Cas_Cas2CT1978, CRISPR-associated protein (Cas_Cas2CT1978)	NA|68aa|down_0|NC_009952.1_3392484_3392688_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|169aa|down_1|NC_009952.1_3392687_3393194_+	pfam12728, HTH_17, Helix-turn-helix domain	NA|353aa|down_2|NC_009952.1_3393171_3394230_+	cd00397, DNA_BRE_C, DNA breaking-rejoining enzymes, C-terminal catalytic domain	NA|257aa|down_3|NC_009952.1_3396007_3396778_-	NA	NA|283aa|down_4|NC_009952.1_3397086_3397935_+	PRK00450, dapF, diaminopimelate epimerase; Provisional	NA|417aa|down_5|NC_009952.1_3397931_3399182_+	COG0621, MiaB, 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis]	NA|553aa|down_6|NC_009952.1_3399662_3401321_+	cd03468, PolY_like, DNA Polymerase Y-family	NA|117aa|down_7|NC_009952.1_3401705_3402056_+	pfam10984, DUF2794, Protein of unknown function (DUF2794)	NA|280aa|down_8|NC_009952.1_3402059_3402899_-	pfam00877, NLPC_P60, NlpC/P60 family	NA|464aa|down_9|NC_009952.1_3402895_3404287_-	cd00433, Peptidase_M17, Cytosol aminopeptidase family, N-terminal and catalytic domains
GCF_000018145.1_ASM1814v1	NC_009956	Dinoroseobacter shibae DFL 12 = DSM 16493 plasmid pDSHI02, complete sequence	1	87657-87843	1	CRISPRCasFinder	no			Orphan	GGCGGCGCGGGCGATGACAGCCTG	24	0	0	NA	NA	NA	3	3	Orphan	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh,RT,PrimPol	NA|330aa|up_5|NC_009956.1_74803_75793_-,NA|74aa|down_6|NC_009956.1_100168_100390_+,NA|271aa|down_8|NC_009956.1_102304_103117_+,NA|175aa|down_9|NC_009956.1_103400_103925_+	NA|389aa|up_9|NC_009956.1_69591_70758_+	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|646aa|up_8|NC_009956.1_70762_72700_+	pfam07940, Hepar_II_III, Heparinase II/III-like protein	NA|377aa|up_7|NC_009956.1_72754_73885_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|260aa|up_6|NC_009956.1_73901_74681_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|330aa|up_5|NC_009956.1_74803_75793_-	NA	NA|1303aa|up_4|NC_009956.1_75904_79813_-	cd03799, GT4_AmsK-like, Erwinia amylovora AmsK and similar proteins	NA|297aa|up_3|NC_009956.1_79817_80708_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|360aa|up_2|NC_009956.1_80803_81883_-	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|392aa|up_1|NC_009956.1_81945_83121_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|268aa|up_0|NC_009956.1_83555_84359_-	cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms	NA|1883aa|down_0|NC_009956.1_88392_94041_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|195aa|down_1|NC_009956.1_94096_94681_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|275aa|down_2|NC_009956.1_94736_95561_-	COG2842, COG2842, Uncharacterized ATPase, putative transposase [General function prediction only]	NA|472aa|down_3|NC_009956.1_95557_96973_-	pfam00665, rve, Integrase core domain	NA|200aa|down_4|NC_009956.1_97189_97789_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|603aa|down_5|NC_009956.1_98309_100118_+	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|74aa|down_6|NC_009956.1_100168_100390_+	NA	NA|461aa|down_7|NC_009956.1_100521_101904_-	pfam05598, DUF772, Transposase domain (DUF772)	NA|271aa|down_8|NC_009956.1_102304_103117_+	NA	NA|175aa|down_9|NC_009956.1_103400_103925_+	NA
GCF_000018145.1_ASM1814v1	NC_009956	Dinoroseobacter shibae DFL 12 = DSM 16493 plasmid pDSHI02, complete sequence	2	87927-88004	2	CRISPRCasFinder	no			Orphan	GGCGGCGCGGGCGATGACAGCCTG	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas9,cas1,cas2,cas3,WYL,cse2gr11,cas7,cas5,cas6e,DEDDh,RT,PrimPol	NA|330aa|up_5|NC_009956.1_74803_75793_-,NA|74aa|down_6|NC_009956.1_100168_100390_+,NA|271aa|down_8|NC_009956.1_102304_103117_+,NA|175aa|down_9|NC_009956.1_103400_103925_+	NA|389aa|up_9|NC_009956.1_69591_70758_+	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|646aa|up_8|NC_009956.1_70762_72700_+	pfam07940, Hepar_II_III, Heparinase II/III-like protein	NA|377aa|up_7|NC_009956.1_72754_73885_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|260aa|up_6|NC_009956.1_73901_74681_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|330aa|up_5|NC_009956.1_74803_75793_-	NA	NA|1303aa|up_4|NC_009956.1_75904_79813_-	cd03799, GT4_AmsK-like, Erwinia amylovora AmsK and similar proteins	NA|297aa|up_3|NC_009956.1_79817_80708_-	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|360aa|up_2|NC_009956.1_80803_81883_-	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|392aa|up_1|NC_009956.1_81945_83121_-	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|268aa|up_0|NC_009956.1_83555_84359_-	cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms	NA|1883aa|down_0|NC_009956.1_88392_94041_+	NF033203, entero_EhxA, enterohemolysin EhxA	NA|195aa|down_1|NC_009956.1_94096_94681_-	pfam13469, Sulfotransfer_3, Sulfotransferase family	NA|275aa|down_2|NC_009956.1_94736_95561_-	COG2842, COG2842, Uncharacterized ATPase, putative transposase [General function prediction only]	NA|472aa|down_3|NC_009956.1_95557_96973_-	pfam00665, rve, Integrase core domain	NA|200aa|down_4|NC_009956.1_97189_97789_+	cd03768, SR_ResInv, Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain	NA|603aa|down_5|NC_009956.1_98309_100118_+	PRK09284, PRK09284, thiamine biosynthesis protein ThiC; Provisional	NA|74aa|down_6|NC_009956.1_100168_100390_+	NA	NA|461aa|down_7|NC_009956.1_100521_101904_-	pfam05598, DUF772, Transposase domain (DUF772)	NA|271aa|down_8|NC_009956.1_102304_103117_+	NA	NA|175aa|down_9|NC_009956.1_103400_103925_+	NA
