assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002157875.1_ASM215787v1	NZ_CP021376	Oceanisphaera avium strain AMac2203 chromosome, complete genome	1	129389-129517	1	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	Orphan	AGAGCGAGATACCGGAGCAAGTCCGGTATGACGGC	35	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	NA|409aa|up_8|NZ_CP021376.1_116270_117497_+,NA	NA|246aa|up_9|NZ_CP021376.1_115532_116270_+	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|409aa|up_8|NZ_CP021376.1_116270_117497_+	NA	NA|264aa|up_7|NZ_CP021376.1_117543_118335_+	pfam00535, Glycos_transf_2, Glycosyl transferase family 2	NA|400aa|up_6|NZ_CP021376.1_118357_119557_+	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional	NA|319aa|up_5|NZ_CP021376.1_119613_120570_+	cd05232, UDP_G4E_4_SDR_e, UDP-glucose 4 epimerase, subgroup 4, extended (e) SDRs	NA|183aa|up_4|NZ_CP021376.1_120573_121122_+	pfam02397, Bac_transf, Bacterial sugar transferase	NA|127aa|up_3|NZ_CP021376.1_121309_121690_+	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|637aa|up_2|NZ_CP021376.1_121792_123703_+	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|121aa|up_1|NZ_CP021376.1_126017_126380_-	TIGR04256, conserved_hypothetical_protein, GxxExxY protein	NA|547aa|up_0|NZ_CP021376.1_126570_128211_+	PRK00179, pgi, glucose-6-phosphate isomerase; Reviewed	NA|241aa|down_0|NZ_CP021376.1_129699_130422_-	pfam04932, Wzy_C, O-Antigen ligase	NA|247aa|down_1|NZ_CP021376.1_130937_131678_-	pfam01755, Glyco_transf_25, Glycosyltransferase family 25 (LPS biosynthesis protein)	NA|267aa|down_2|NZ_CP021376.1_131800_132601_-	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|243aa|down_3|NZ_CP021376.1_133503_134232_+	PRK05299, rpsB, 30S ribosomal protein S2; Provisional	NA|294aa|down_4|NZ_CP021376.1_134339_135221_+	PRK09377, tsf, elongation factor Ts; Provisional	NA|244aa|down_5|NZ_CP021376.1_135290_136022_+	cd04254, AAK_UMPK-PyrH-Ec, UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway)	NA|186aa|down_6|NZ_CP021376.1_136792_137350_+	PRK00083, frr, ribosome recycling factor; Reviewed	NA|253aa|down_7|NZ_CP021376.1_137441_138200_+	cd00475, Cis_IPPS, Cis (Z)-Isoprenyl Diphosphate Synthases	NA|292aa|down_8|NZ_CP021376.1_138590_139466_+	PRK11624, cdsA, phosphatidate cytidylyltransferase	NA|397aa|down_9|NZ_CP021376.1_139465_140656_+	PRK05447, PRK05447, 1-deoxy-D-xylulose 5-phosphate reductoisomerase; Provisional
GCF_002157875.1_ASM215787v1	NZ_CP021376	Oceanisphaera avium strain AMac2203 chromosome, complete genome	2	163816-163992	2	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	Orphan	ATCCTGATGCACATCAGGATCTGGTCTAGTTTTAGTATCTAAGAGCGAGATACCG	55	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	NA,NA	NA|154aa|up_9|NZ_CP021376.1_147016_147478_+	PRK00006, fabZ, 3-hydroxyacyl-ACP dehydratase FabZ	NA|264aa|up_8|NZ_CP021376.1_147480_148272_+	PRK05289, PRK05289, acyl-ACP--UDP-N-acetylglucosamine O-acyltransferase	NA|406aa|up_7|NZ_CP021376.1_148246_149464_+	PRK00025, lpxB, lipid-A-disaccharide synthase; Reviewed	NA|204aa|up_6|NZ_CP021376.1_149463_150075_+	PRK00015, rnhB, ribonuclease HII; Validated	NA|1169aa|up_5|NZ_CP021376.1_150071_153578_+	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|317aa|up_4|NZ_CP021376.1_153586_154537_+	PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated	NA|978aa|up_3|NZ_CP021376.1_154979_157913_+	COG3292, COG3292, Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]	NA|1004aa|up_2|NZ_CP021376.1_158137_161149_+	COG3292, COG3292, Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]	NA|250aa|up_1|NZ_CP021376.1_161208_161958_+	cd07334, M48C_loiP_like, Peptidase M48C Ste24p loiP-like, integral membrane protein	NA|447aa|up_0|NZ_CP021376.1_162069_163410_+	PRK10660, tilS, tRNA(Ile)-lysidine synthetase; Provisional	NA|269aa|down_0|NZ_CP021376.1_164165_164972_-	PRK11756, PRK11756, exonuclease III; Provisional	NA|353aa|down_1|NZ_CP021376.1_165251_166310_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|1033aa|down_2|NZ_CP021376.1_166296_169395_+	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|160aa|down_3|NZ_CP021376.1_169763_170243_-	pfam11993, Ribosomal_S4Pg, Ribosomal S4P (gammaproteobacterial)	NA|362aa|down_4|NZ_CP021376.1_170795_171881_-	PRK12879, PRK12879, 3-oxoacyl-(acyl carrier protein) synthase III; Reviewed	NA|126aa|down_5|NZ_CP021376.1_171990_172368_-	pfam04134, DUF393, Protein of unknown function, DUF393	NA|493aa|down_6|NZ_CP021376.1_172469_173948_-	cd06460, M32_Taq, Peptidase family M32, which includes thermostable carboxypeptidases TaqCP, PfuCP and FisCP	NA|50aa|down_7|NZ_CP021376.1_174320_174470_-	pfam11346, DUF3149, Protein of unknown function (DUF3149)	NA|289aa|down_8|NZ_CP021376.1_174728_175595_-	cd14789, Tiki, Tiki homology domain antagonizes Wnt function via cleavage of amino-terminal residues	NA|112aa|down_9|NZ_CP021376.1_175668_176004_+	pfam08921, DUF1904, Domain of unknown function (DUF1904)
GCF_002157875.1_ASM215787v1	NZ_CP021376	Oceanisphaera avium strain AMac2203 chromosome, complete genome	3	200836-200995	3	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	Orphan	TTTTCTGCTGAGATTTTAAAAAGCGAGATACCGGAGCAAGCCCGATAAGACGG	53	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	NA,NA	NA|275aa|up_9|NZ_CP021376.1_190250_191075_+	COG0613, COG0613, Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only]	NA|207aa|up_8|NZ_CP021376.1_191093_191714_+	PRK11630, PRK11630, threonylcarbamoyl-AMP synthase	NA|311aa|up_7|NZ_CP021376.1_191736_192669_+	PRK10700, PRK10700, 23S rRNA pseudouridine(2605) synthase RluB	NA|338aa|up_6|NZ_CP021376.1_193039_194053_-	TIGR02291, conserved_protein_of_unknown_function, alpha-L-glutamate ligase-related protein	NA|510aa|up_5|NZ_CP021376.1_194045_195575_-	pfam14402, 7TM_transglut, 7 transmembrane helices usually fused to an inactive transglutaminase	NA|258aa|up_4|NZ_CP021376.1_195582_196356_-	COG4067, COG4067, Uncharacterized protein conserved in archaea [Posttranslational modification, protein turnover, chaperones]	NA|336aa|up_3|NZ_CP021376.1_196348_197356_-	PRK15068, PRK15068, tRNA 5-methoxyuridine(34)/uridine 5-oxyacetic acid(34) synthase CmoB	NA|243aa|up_2|NZ_CP021376.1_197418_198147_-	TIGR00740, tRNA_cmo5U34-methyltransferase, tRNA (cmo5U34)-methyltransferase	NA|282aa|up_1|NZ_CP021376.1_198411_199257_-	PRK10302, PRK10302, hypothetical protein; Provisional	NA|458aa|up_0|NZ_CP021376.1_199411_200785_+	cd13149, MATE_like_2, Uncharacterized subfamily of the multidrug and toxic compound extrusion (MATE) proteins	NA|601aa|down_0|NZ_CP021376.1_201227_203030_+	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|146aa|down_1|NZ_CP021376.1_203032_203470_+	PRK09438, nudB, dihydroneopterin triphosphate pyrophosphatase; Provisional	NA|247aa|down_2|NZ_CP021376.1_203535_204276_+	PRK00110, PRK00110, YebC/PmpR family DNA-binding transcriptional regulator	NA|174aa|down_3|NZ_CP021376.1_204400_204922_+	PRK00039, ruvC, Holliday junction resolvase; Reviewed	NA|202aa|down_4|NZ_CP021376.1_205078_205684_+	PRK00116, ruvA, Holliday junction branch migration protein RuvA	NA|345aa|down_5|NZ_CP021376.1_205686_206721_+	PRK00080, ruvB, Holliday junction branch migration DNA helicase RuvB	NA|137aa|down_6|NZ_CP021376.1_206900_207311_+	TIGR02799, Acyl-CoA_thioesterase_YbgC, tol-pal system-associated acyl-CoA thioesterase	NA|227aa|down_7|NZ_CP021376.1_207300_207981_+	TIGR02796, Protein_TolQ, TolQ protein	NA|142aa|down_8|NZ_CP021376.1_207980_208406_+	TIGR02801, Protein_TolR, TolR protein	NA|388aa|down_9|NZ_CP021376.1_208413_209577_+	PTZ00121, PTZ00121, MAEBL; Provisional
GCF_002157875.1_ASM215787v1	NZ_CP021376	Oceanisphaera avium strain AMac2203 chromosome, complete genome	4	538869-539024	4	CRISPRCasFinder	no	DinG	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	Type IV-A	CGAAAGTCAGGATCTGTTAAGGTCAGATGCTATATGTGTGAGATACCGGA	50	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	NA,NA|123aa|down_3|NZ_CP021376.1_544311_544680_-,NA|255aa|down_7|NZ_CP021376.1_547840_548605_-	NA|438aa|up_9|NZ_CP021376.1_526428_527742_+	PRK08963, fadI, 3-ketoacyl-CoA thiolase; Reviewed	NA|728aa|up_8|NZ_CP021376.1_527738_529922_+	PRK11154, fadJ, fatty acid oxidation complex subunit alpha FadJ	NA|639aa|up_7|NZ_CP021376.1_530289_532206_+	PRK05218, PRK05218, heat shock protein 90; Provisional	NA|215aa|up_6|NZ_CP021376.1_532484_533129_+	PRK00279, adk, adenylate kinase; Reviewed	NA|327aa|up_5|NZ_CP021376.1_533236_534217_+	PRK00035, hemH, ferrochelatase; Reviewed	NA|142aa|up_4|NZ_CP021376.1_534299_534725_-	NF033429, ImuA_translesion, translesion DNA synthesis-associated protein ImuA	NA|68aa|up_3|NZ_CP021376.1_534903_535107_+	pfam14056, DUF4250, Domain of unknown function (DUF4250)	NA|228aa|up_2|NZ_CP021376.1_535172_535856_+	TIGR03840, TMPT_Se_Te, thiopurine S-methyltransferase, Se/Te detoxification family	NA|456aa|up_1|NZ_CP021376.1_536131_537499_+	COG3314, COG3314, Uncharacterized protein conserved in bacteria [Function unknown]	NA|191aa|up_0|NZ_CP021376.1_537583_538156_+	pfam06041, DUF924, Bacterial protein of unknown function (DUF924)	NA|209aa|down_0|NZ_CP021376.1_539072_539699_-	pfam09829, DUF2057, Uncharacterized protein conserved in bacteria (DUF2057)	DinG|703aa|down_1|NZ_CP021376.1_539868_541977_-	PRK11747, dinG, ATP-dependent DNA helicase DinG; Provisional	NA|352aa|down_2|NZ_CP021376.1_542355_543411_+	COG3203, OmpC, Outer membrane protein (porin) [Cell envelope biogenesis, outer membrane]	NA|123aa|down_3|NZ_CP021376.1_544311_544680_-	NA	NA|312aa|down_4|NZ_CP021376.1_544748_545684_-	PRK10550, PRK10550, tRNA dihydrouridine(16) synthase DusC	NA|394aa|down_5|NZ_CP021376.1_545924_547106_+	cd17324, MFS_NepI_like, Purine ribonucleoside efflux pump NepI and similar transporters of the Major Facilitator Superfamily	NA|180aa|down_6|NZ_CP021376.1_547307_547847_-	pfam06185, YecM, YecM protein	NA|255aa|down_7|NZ_CP021376.1_547840_548605_-	NA	NA|325aa|down_8|NZ_CP021376.1_548813_549788_+	PRK12681, cysB, HTH-type transcriptional regulator CysB	NA|136aa|down_9|NZ_CP021376.1_549887_550295_-	TIGR00068, Lactoylglutathione_lyase, lactoylglutathione lyase
GCF_002157875.1_ASM215787v1	NZ_CP021376	Oceanisphaera avium strain AMac2203 chromosome, complete genome	5	954577-956345	5,1,1	CRISPRCasFinder,PILER-CR,CRT	no	cas6f,cas7f,cas5f,cas8f,cas3f,cas1	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	Type I-F	TTTCTAAGCCGCCTGTGTGGCGGTGAAG,TTTCTAAGCCGCCTGTGTGGCGGTGAAG,TTTCTAAGCCGCCTGTGTGGCGGTGAAG	28,28,28	0	0	NA	NA	I-F:I-F:I-F	29,28,28	29	TypeI-F	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	NA|95aa|up_0|NZ_CP021376.1_954175_954460_-,NA	NA|255aa|up_9|NZ_CP021376.1_943801_944566_-	PRK03695, PRK03695, vitamin B12-transporter ATPase; Provisional	NA|261aa|up_8|NZ_CP021376.1_945605_946388_-	PRK10621, PRK10621, hypothetical protein; Provisional	NA|205aa|up_7|NZ_CP021376.1_946593_947208_+	PRK14054, PRK14054, peptide-methionine (S)-S-oxide reductase	NA|157aa|up_6|NZ_CP021376.1_947210_947681_+	PRK00222, PRK00222, peptide-methionine (R)-S-oxide reductase MsrB	NA|496aa|up_5|NZ_CP021376.1_947717_949205_-	COG0189, RimK, Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis]	NA|373aa|up_4|NZ_CP021376.1_949339_950458_+	pfam11814, DUF3335, Peptidase_C39 like family	NA|152aa|up_3|NZ_CP021376.1_951491_951947_+	pfam05818, TraT, Enterobacterial TraT complement resistance protein	NA|200aa|up_2|NZ_CP021376.1_952292_952892_-	pfam18433, DUF5610, Domain of unknown function (DUF5610)	NA|350aa|up_1|NZ_CP021376.1_953078_954128_+	cd19094, AKR_Tas-like, Escherichia coli Tas protein and similar proteins	NA|95aa|up_0|NZ_CP021376.1_954175_954460_-	NA	cas6f|211aa|down_0|NZ_CP021376.1_956473_957106_-	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	cas7f|342aa|down_1|NZ_CP021376.1_957115_958141_-	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas5f|316aa|down_2|NZ_CP021376.1_958170_959118_-	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas8f|422aa|down_3|NZ_CP021376.1_959114_960380_-	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas3f|1136aa|down_4|NZ_CP021376.1_960652_964060_-	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	cas1|326aa|down_5|NZ_CP021376.1_964056_965034_-	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	NA|291aa|down_6|NZ_CP021376.1_965258_966131_-	cd05355, SDR_c1, classical (c) SDR, subgroup 1	NA|363aa|down_7|NZ_CP021376.1_966363_967452_+	cd11386, MCP_signal, Methyl-accepting chemotaxis protein (MCP), signaling domain	NA|290aa|down_8|NZ_CP021376.1_968031_968901_-	PRK05457, PRK05457, protease HtpX	NA|231aa|down_9|NZ_CP021376.1_968991_969684_-	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD
GCF_002157875.1_ASM215787v1	NZ_CP021376	Oceanisphaera avium strain AMac2203 chromosome, complete genome	6	2460707-2460795	6	CRISPRCasFinder	no		cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	Orphan	CTTAATCTTTACGTCGTCATCCT	23	0	0	NA	NA	NA	1	1	Orphan	cas3,DinG,DEDDh,csa3,cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL	NA,NA	NA|259aa|up_9|NZ_CP021376.1_2450472_2451249_-	cd13619, PBP2_GlnP, Glutamine-binding domain of ABC transporter, a member of the type 2 periplasmic binding fold protein superfamily	NA|153aa|up_8|NZ_CP021376.1_2451481_2451940_-	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|317aa|up_7|NZ_CP021376.1_2452116_2453067_+	COG0385, COG0385, Predicted Na+-dependent transporter [General function prediction only]	NA|163aa|up_6|NZ_CP021376.1_2453125_2453614_-	COG3762, COG3762, Predicted membrane protein [Function unknown]	NA|291aa|up_5|NZ_CP021376.1_2453663_2454536_-	COG1512, COG1512, Beta-propeller domains of methanol dehydrogenase type [General function prediction only]	NA|208aa|up_4|NZ_CP021376.1_2454551_2455175_-	COG1704, LemA, Uncharacterized conserved protein [Function unknown]	NA|189aa|up_3|NZ_CP021376.1_2455224_2455791_-	pfam04343, DUF488, Protein of unknown function, DUF488	NA|139aa|up_2|NZ_CP021376.1_2456090_2456507_+	pfam17723, RHH_8, Ribbon-Helix-Helix transcriptional regulator family	NA|427aa|up_1|NZ_CP021376.1_2456561_2457842_+	pfam10503, Esterase_phd, Esterase PHB depolymerase	NA|881aa|up_0|NZ_CP021376.1_2457916_2460559_-	PRK00009, PRK00009, phosphoenolpyruvate carboxylase; Reviewed	NA|387aa|down_0|NZ_CP021376.1_2460921_2462082_-	PRK05111, PRK05111, acetylornithine deacetylase; Provisional	NA|338aa|down_1|NZ_CP021376.1_2462325_2463339_+	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|263aa|down_2|NZ_CP021376.1_2463343_2464132_+	cd04249, AAK_NAGK-NC, AAK_NAGK-NC: N-Acetyl-L-glutamate kinase - noncyclic (NAGK-NC) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis using the acetylated, noncyclic route of ornithine biosynthesis	NA|306aa|down_3|NZ_CP021376.1_2464159_2465077_+	PRK14805, PRK14805, ornithine carbamoyltransferase; Provisional	NA|404aa|down_4|NZ_CP021376.1_2465076_2466288_+	PRK00509, PRK00509, argininosuccinate synthase; Provisional	NA|625aa|down_5|NZ_CP021376.1_2466503_2468378_+	PRK12308, PRK12308, argininosuccinate lyase	NA|104aa|down_6|NZ_CP021376.1_2468965_2469277_+	PRK00596, rpsJ, 30S ribosomal protein S10; Reviewed	NA|213aa|down_7|NZ_CP021376.1_2469294_2469933_+	PRK00001, rplC, 50S ribosomal protein L3; Validated	NA|202aa|down_8|NZ_CP021376.1_2469950_2470556_+	PRK05319, rplD, 50S ribosomal protein L4; Provisional	NA|101aa|down_9|NZ_CP021376.1_2470552_2470855_+	PRK05738, rplW, 50S ribosomal protein L23; Reviewed
