assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001548155.2_BV133_assembly_1.0	NZ_AP014854	Blastochloris viridis strain DSM 133	1	2557437-2560836	1,1,1,2	CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	cas2,cas1,cas4,cas7b,cas8c,cas5,cas3	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	Type I-C,Type I-U, Type I-U?	GTTTCGATCCACGCCCCTGCGCGAGGGGCGAC,GTTTCGATCCACGCCCCTGCGCGAGGGGCGAC,GTTTCGATCCACGCCCCTGCGCGAGGGGCGAC,GTTTCGATCCACGCCCCTGCGCGAGGGGCGAC	32,32,32,32	0	0	NA	NA	NA:NA:NA:NA	51,51,48,48	51	TypeI-C,TypeI-U,TypeI-U?	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	NA,NA|402aa|down_8|NZ_AP014854.2_2570142_2571348_-	NA|242aa|up_9|NZ_AP014854.2_2545193_2545919_-	pfam17036, CBP_BcsS, Cellulose biosynthesis protein BcsS	NA|242aa|up_8|NZ_AP014854.2_2546116_2546842_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|259aa|up_7|NZ_AP014854.2_2546838_2547615_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|318aa|up_6|NZ_AP014854.2_2547611_2548565_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|383aa|up_5|NZ_AP014854.2_2548740_2549889_+	pfam12698, ABC2_membrane_3, ABC-2 family transporter protein	NA|101aa|up_4|NZ_AP014854.2_2550065_2550368_-	cd03214, ABC_Iron-Siderophores_B12_Hemin, ATP-binding component of iron-siderophores, vitamin B12 and hemin transporters and related proteins	NA|519aa|up_3|NZ_AP014854.2_2550439_2551996_-	TIGR01845, Outer_membrane_protein_OprM, efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family	NA|1054aa|up_2|NZ_AP014854.2_2551992_2555154_-	TIGR00915, Probable_aminoglycoside_efflux_pump, The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family	NA|421aa|up_1|NZ_AP014854.2_2555161_2556424_-	PRK15030, PRK15030, multidrug efflux RND transporter periplasmic adaptor subunit AcrA	NA|235aa|up_0|NZ_AP014854.2_2556558_2557263_+	pfam00440, TetR_N, Bacterial regulatory proteins, tetR family	cas2|97aa|down_0|NZ_AP014854.2_2561051_2561342_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|345aa|down_1|NZ_AP014854.2_2561420_2562455_-	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas4|218aa|down_2|NZ_AP014854.2_2562451_2563105_-	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas7b|322aa|down_3|NZ_AP014854.2_2563116_2564082_-	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas8c|587aa|down_4|NZ_AP014854.2_2564078_2565839_-	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas5|222aa|down_5|NZ_AP014854.2_2565835_2566501_-	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas3|756aa|down_6|NZ_AP014854.2_2566591_2568859_-	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	NA|200aa|down_7|NZ_AP014854.2_2569433_2570033_-	cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands	NA|402aa|down_8|NZ_AP014854.2_2570142_2571348_-	NA	NA|71aa|down_9|NZ_AP014854.2_2571446_2571659_-	pfam11003, DUF2842, Protein of unknown function (DUF2842)
GCF_001548155.2_BV133_assembly_1.0	NZ_AP014854	Blastochloris viridis strain DSM 133	2	2725011-2726541	3,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8u2,cas7,csb2gr5,cas1,cas2	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	Unclear	GCCTTTCCCGAGCAGCACGCTCGGGCCTCATTGAAG,GCCTTTCCCGAGCAGCACGCTCGGGCCTCATTGAAGC,GCCTTTCCCGAGCAGCACGCTCGGGCCTCATTGAAGC	36,37,37	0	0	NA	NA	NA:NA:NA	19,19,20	20	Unclear	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	NA,NA|310aa|down_1|NZ_AP014854.2_2729580_2730510_-,NA|145aa|down_5|NZ_AP014854.2_2734749_2735184_+	NA|327aa|up_9|NZ_AP014854.2_2698486_2699467_+	COG3712, FecR, periplasmic ferric-dicitrate binding protein FecR, regulates iron transport through sigma-19 [Inorganic ion transport and metabolism, Signal transduction mechanisms]	NA|4249aa|up_8|NZ_AP014854.2_2699820_2712567_+	pfam12545, DUF3739, Filamentous haemagglutinin family outer membrane protein	NA|231aa|up_7|NZ_AP014854.2_2713257_2713950_-	pfam13358, DDE_3, DDE superfamily endonuclease	NA|201aa|up_6|NZ_AP014854.2_2714390_2714993_-	pfam11994, DUF3489, Protein of unknown function (DUF3489)	cas3|909aa|up_5|NZ_AP014854.2_2715357_2718084_+	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	cas8u2|297aa|up_4|NZ_AP014854.2_2718080_2718971_+	TIGR04106, hypothetical_protein_GobsU_11505, CRISPR-associated protein GSU0052/csb3, Dpsyc system	cas7|367aa|up_3|NZ_AP014854.2_2718967_2720068_+	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	csb2gr5|722aa|up_2|NZ_AP014854.2_2720076_2722242_+	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas1|605aa|up_1|NZ_AP014854.2_2722276_2724091_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|102aa|up_0|NZ_AP014854.2_2724202_2724508_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|627aa|down_0|NZ_AP014854.2_2727713_2729594_-	cd06453, SufS_like, Cysteine desulfurase (SufS)-like	NA|310aa|down_1|NZ_AP014854.2_2729580_2730510_-	NA	NA|406aa|down_2|NZ_AP014854.2_2731025_2732243_-	pfam06050, HGD-D, 2-hydroxyglutaryl-CoA dehydratase, D-component	NA|422aa|down_3|NZ_AP014854.2_2732239_2733505_-	pfam06050, HGD-D, 2-hydroxyglutaryl-CoA dehydratase, D-component	NA|351aa|down_4|NZ_AP014854.2_2733521_2734574_-	COG1924, COG1924, Activator of 2-hydroxyglutaryl-CoA dehydratase (HSP70-class ATPase domain) [Lipid metabolism]	NA|145aa|down_5|NZ_AP014854.2_2734749_2735184_+	NA	NA|346aa|down_6|NZ_AP014854.2_2735210_2736248_+	COG0600, TauC, ABC-type nitrate/sulfonate/bicarbonate transport system, permease component [Inorganic ion transport and metabolism]	NA|269aa|down_7|NZ_AP014854.2_2736241_2737048_+	cd03293, ABC_NrtD_SsuB_transporters, ATP-binding cassette domain of the nitrate and sulfonate transporters	NA|150aa|down_8|NZ_AP014854.2_2737229_2737679_-	COG1959, COG1959, Predicted transcriptional regulator [Transcription]	NA|433aa|down_9|NZ_AP014854.2_2738094_2739393_-	pfam06050, HGD-D, 2-hydroxyglutaryl-CoA dehydratase, D-component
GCF_001548155.2_BV133_assembly_1.0	NZ_AP014854	Blastochloris viridis strain DSM 133	3	2953813-2953905	3	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	Orphan	CGTTTCGATCCACGCCCTCGCGTGAGG	27	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	NA|51aa|up_7|NZ_AP014854.2_2944593_2944746_-,NA|65aa|up_0|NZ_AP014854.2_2953136_2953331_+,NA|105aa|down_0|NZ_AP014854.2_2954515_2954830_+	NA|290aa|up_9|NZ_AP014854.2_2942772_2943642_-	COG1108, ZnuB, ABC-type Mn2+/Zn2+ transport systems, permease components [Inorganic ion transport and metabolism]	NA|246aa|up_8|NZ_AP014854.2_2943845_2944583_-	COG1121, ZnuC, ABC-type Mn/Zn transport systems, ATPase component [Inorganic ion transport and metabolism]	NA|51aa|up_7|NZ_AP014854.2_2944593_2944746_-	NA	NA|746aa|up_6|NZ_AP014854.2_2945018_2947256_-	cd02094, P-type_ATPase_Cu-like, P-type heavy metal-transporting ATPase, similar to human copper-transporting ATPases, ATP7A and ATP7B	NA|129aa|up_5|NZ_AP014854.2_2947658_2948045_-	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|366aa|up_4|NZ_AP014854.2_2948256_2949354_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|364aa|up_3|NZ_AP014854.2_2949571_2950663_+	COG2153, ElaA, Predicted acyltransferase [General function prediction only]	NA|200aa|up_2|NZ_AP014854.2_2950659_2951259_+	pfam00857, Isochorismatase, Isochorismatase family	NA|381aa|up_1|NZ_AP014854.2_2951852_2952995_-	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|65aa|up_0|NZ_AP014854.2_2953136_2953331_+	NA	NA|105aa|down_0|NZ_AP014854.2_2954515_2954830_+	NA	NA|1057aa|down_1|NZ_AP014854.2_2954823_2957994_+	pfam03797, Autotransporter, Autotransporter beta-domain	NA|207aa|down_2|NZ_AP014854.2_2958900_2959521_+	smart00965, STN, Secretin and TonB N terminus short domain	NA|209aa|down_3|NZ_AP014854.2_2959591_2960218_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|310aa|down_4|NZ_AP014854.2_2960282_2961212_+	COG3712, FecR, periplasmic ferric-dicitrate binding protein FecR, regulates iron transport through sigma-19 [Inorganic ion transport and metabolism, Signal transduction mechanisms]	NA|4529aa|down_5|NZ_AP014854.2_2961741_2975328_+	pfam12545, DUF3739, Filamentous haemagglutinin family outer membrane protein	NA|221aa|down_6|NZ_AP014854.2_2975717_2976380_+	smart00965, STN, Secretin and TonB N terminus short domain	NA|203aa|down_7|NZ_AP014854.2_2976481_2977090_+	COG0810, TonB, Periplasmic protein TonB, links inner and outer membranes [Cell envelope biogenesis, outer membrane]	NA|188aa|down_8|NZ_AP014854.2_2977212_2977776_+	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|325aa|down_9|NZ_AP014854.2_2977826_2978801_+	COG3712, FecR, periplasmic ferric-dicitrate binding protein FecR, regulates iron transport through sigma-19 [Inorganic ion transport and metabolism, Signal transduction mechanisms]
GCF_001548155.2_BV133_assembly_1.0	NZ_AP014854	Blastochloris viridis strain DSM 133	4	3250558-3251790	4,4,3	PILER-CR,CRISPRCasFinder,CRT	no	csb1gr7,csb2gr5,cas3,cas8u1,cas1,cas2	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	Unclear	GTATTCCCCGAGTAATCCGCTCGGGCCTCATTGAAGC,GTATTCCCCGAGTAATCCGCTCGGGCCTCATTGAAGC,GTATTCCCCGAGTAATCCGCTCGGGCCTCATTGAAGC	37,37,37	0	0	NA	NA	NA:NA:NA	15,15,16	16	Unclear	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	cas8u1|332aa|up_2|NZ_AP014854.2_3245787_3246783_+,NA	NA|347aa|up_9|NZ_AP014854.2_3235478_3236519_-	PRK07471, PRK07471, DNA polymerase III subunit delta'; Validated	NA|228aa|up_8|NZ_AP014854.2_3236515_3237199_-	PRK13973, PRK13973, thymidylate kinase; Provisional	NA|404aa|up_7|NZ_AP014854.2_3237195_3238407_-	COG1686, DacC, D-alanyl-D-alanine carboxypeptidase [Cell envelope biogenesis, outer membrane]	NA|297aa|up_6|NZ_AP014854.2_3238509_3239400_-	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	csb1gr7|408aa|up_5|NZ_AP014854.2_3239958_3241182_+	pfam09617, Cas_GSU0053, CRISPR-associated protein GSU0053 (Cas_GSU0053)	csb2gr5|499aa|up_4|NZ_AP014854.2_3241178_3242675_+	TIGR02165, CRISPR-associated_protein_GSU0054_family, CRISPR-associated protein GSU0054/csb2, Dpsyc system	cas3|1040aa|up_3|NZ_AP014854.2_3242671_3245791_+	cd09696, Cas3_I, CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain	cas8u1|332aa|up_2|NZ_AP014854.2_3245787_3246783_+	NA	cas1|605aa|up_1|NZ_AP014854.2_3247792_3249607_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|102aa|up_0|NZ_AP014854.2_3249718_3250024_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|260aa|down_0|NZ_AP014854.2_3251852_3252632_+	COG2128, COG2128, Uncharacterized conserved protein [Function unknown]	NA|325aa|down_1|NZ_AP014854.2_3252679_3253654_+	COG0535, COG0535, Predicted Fe-S oxidoreductases [General function prediction only]	NA|185aa|down_2|NZ_AP014854.2_3254203_3254758_+	pfam05974, DUF892, Domain of unknown function (DUF892)	NA|144aa|down_3|NZ_AP014854.2_3255085_3255517_-	cd04623, CBS_pair_bac_euk, Two tandem repeats of the cystathionine beta-synthase (CBS pair) domains present in bacteria and eukaryotes	NA|244aa|down_4|NZ_AP014854.2_3255622_3256354_-	COG0705, COG0705, Membrane associated serine protease [Amino acid transport and metabolism]	NA|182aa|down_5|NZ_AP014854.2_3256984_3257530_+	COG5388, COG5388, Uncharacterized protein conserved in bacteria [Function unknown]	NA|208aa|down_6|NZ_AP014854.2_3257623_3258247_+	pfam07238, PilZ, PilZ domain	NA|292aa|down_7|NZ_AP014854.2_3258343_3259219_+	COG0697, RhaT, Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only]	NA|225aa|down_8|NZ_AP014854.2_3259460_3260135_+	pfam06035, Peptidase_C93, Bacterial transglutaminase-like cysteine proteinase BTLCP	NA|176aa|down_9|NZ_AP014854.2_3260518_3261046_-	cd04645, LbH_gamma_CA_like, Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E
GCF_001548155.2_BV133_assembly_1.0	NZ_AP014854	Blastochloris viridis strain DSM 133	5	3471976-3472078	5	CRISPRCasFinder	no		DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	Orphan	CGCGCATTAACCTGAGCGCGTCATCC	26	0	0	NA	NA	NA	1	1	Orphan	DEDDh,csa3,cas3,cas2,cas1,cas4,cas7b,cas8c,cas5,cas8u2,cas7,csb2gr5,csb1gr7,cas8u1,WYL	NA|48aa|up_9|NZ_AP014854.2_3461531_3461675_-,NA	NA|48aa|up_9|NZ_AP014854.2_3461531_3461675_-	NA	NA|87aa|up_8|NZ_AP014854.2_3461804_3462065_+	pfam11625, DUF3253, Protein of unknown function (DUF3253)	NA|263aa|up_7|NZ_AP014854.2_3462170_3462959_-	cd06259, YdcF-like, YdcF-like	NA|188aa|up_6|NZ_AP014854.2_3463116_3463680_-	pfam02592, Vut_1, Putative vitamin uptake transporter	NA|236aa|up_5|NZ_AP014854.2_3463679_3464387_-	pfam06508, QueC, Queuosine biosynthesis protein QueC	NA|373aa|up_4|NZ_AP014854.2_3464471_3465590_-	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|231aa|up_3|NZ_AP014854.2_3465672_3466365_-	cd10917, CE4_NodB_like_6s_7s, Catalytic NodB homology domain of rhizobial NodB-like proteins	NA|471aa|up_2|NZ_AP014854.2_3466712_3468125_+	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|726aa|up_1|NZ_AP014854.2_3468321_3470499_+	COG5001, COG5001, Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain [Signal transduction mechanisms]	NA|391aa|up_0|NZ_AP014854.2_3470590_3471763_+	cd01156, IVD, Isovaleryl-CoA dehydrogenase	NA|1527aa|down_0|NZ_AP014854.2_3472761_3477342_-	cd05561, Peptidases_S8_4, Peptidase S8 family domain, uncharacterized subfamily 4	NA|568aa|down_1|NZ_AP014854.2_3477353_3479057_-	COG2831, FhaC, Hemolysin activation/secretion protein [Intracellular trafficking and secretion]	NA|81aa|down_2|NZ_AP014854.2_3479716_3479959_-	PRK06033, PRK06033, flagellar motor switch protein FliN	NA|237aa|down_3|NZ_AP014854.2_3480042_3480753_+	PRK14341, PRK14341, lipoyl(octanoyl) transferase LipB	NA|98aa|down_4|NZ_AP014854.2_3480769_3481063_-	COG1254, AcyP, Acylphosphatases [Energy production and conversion]	NA|190aa|down_5|NZ_AP014854.2_3481271_3481841_+	PRK14951, PRK14951, DNA polymerase III subunits gamma and tau; Provisional	NA|409aa|down_6|NZ_AP014854.2_3481844_3483071_-	COG2230, Cfa, Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane]	NA|679aa|down_7|NZ_AP014854.2_3483265_3485302_-	COG4770, COG4770, Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism]	NA|423aa|down_8|NZ_AP014854.2_3485381_3486650_-	COG3174, COG3174, Predicted membrane protein [Function unknown]	NA|301aa|down_9|NZ_AP014854.2_3486798_3487701_+	cd08347, PcpA_C_like, C-terminal domain of Sphingobium chlorophenolicum 2,6-dichloro-p-hydroquinone 1,2-dioxygenase (PcpA), and similar proteins
