assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000828475.1_ASM82847v1	NZ_AP014648	Methyloceanibacter caenitepidi strain Gela4	1	912851-912972	1	CRISPRCasFinder	no	cas3	DEDDh,cas3,csa3,RT	Unclear	GCCAGGTCGCCGTTATTTGTGAGATTGATTGG	32	0	0	NA	NA	NA	1	1	Unclear	DEDDh,cas3,csa3,RT	NA,NA|73aa|down_8|NZ_AP014648.1_921871_922090_-,NA|77aa|down_9|NZ_AP014648.1_922364_922595_-	NA|116aa|up_9|NZ_AP014648.1_897320_897668_+	cd03034, ArsC_ArsC, Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli	cas3|850aa|up_8|NZ_AP014648.1_898176_900726_+	TIGR04121, ATP-dependent_helicase, DEXH box helicase, DNA ligase-associated	NA|234aa|up_7|NZ_AP014648.1_900736_901438_+	TIGR04123, hypothetical_protein, metallophosphoesterase, DNA ligase-associated	NA|271aa|up_6|NZ_AP014648.1_901461_902274_-	pfam09608, Alph_Pro_TM, Putative transmembrane protein (Alph_Pro_TM)	NA|309aa|up_5|NZ_AP014648.1_902275_903202_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|972aa|up_4|NZ_AP014648.1_903346_906262_-	sd00010, SLR, Sel1-like repeat	NA|145aa|up_3|NZ_AP014648.1_906559_906994_+	pfam01894, UPF0047, Uncharacterized protein family UPF0047	NA|321aa|up_2|NZ_AP014648.1_906997_907960_-	TIGR01249, Putative_proline_iminopeptidase, proline iminopeptidase, Neisseria-type subfamily	NA|615aa|up_1|NZ_AP014648.1_907979_909824_-	TIGR03648, Na_symport_lg, probable sodium:solute symporter, VC_2705 subfamily	NA|102aa|up_0|NZ_AP014648.1_909827_910133_-	pfam13937, DUF4212, Domain of unknown function (DUF4212)	NA|120aa|down_0|NZ_AP014648.1_913706_914066_+	PLN02593, PLN02593, adrenodoxin-like ferredoxin protein	NA|482aa|down_1|NZ_AP014648.1_914065_915511_+	PRK10637, cysG, siroheme synthase CysG	NA|99aa|down_2|NZ_AP014648.1_915503_915800_+	pfam11011, DUF2849, Protein of unknown function (DUF2849)	NA|552aa|down_3|NZ_AP014648.1_915786_917442_+	COG0155, CysI, Sulfite reductase, beta subunit (hemoprotein) [Inorganic ion transport and metabolism]	NA|385aa|down_4|NZ_AP014648.1_917428_918583_+	PRK02090, PRK02090, phosphoadenylyl-sulfate reductase	NA|302aa|down_5|NZ_AP014648.1_918579_919485_+	PRK05253, PRK05253, sulfate adenylyltransferase subunit CysD	NA|495aa|down_6|NZ_AP014648.1_919484_920969_+	COG2895, CysN, GTPases - Sulfate adenylate transferase subunit 1 [Inorganic ion transport and metabolism]	NA|266aa|down_7|NZ_AP014648.1_921030_921828_+	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|73aa|down_8|NZ_AP014648.1_921871_922090_-	NA	NA|77aa|down_9|NZ_AP014648.1_922364_922595_-	NA
GCF_000828475.1_ASM82847v1	NZ_AP014648	Methyloceanibacter caenitepidi strain Gela4	2	1823741-1823971	2	CRISPRCasFinder	no		DEDDh,cas3,csa3,RT	Orphan	TCGTCGTCGGCATCGTCGTCCGAG	24	0	0	NA	NA	NA	4	4	Orphan	DEDDh,cas3,csa3,RT	NA,NA	NA|128aa|up_9|NZ_AP014648.1_1812602_1812986_+	PRK08183, PRK08183, NADH:ubiquinone oxidoreductase subunit NDUFA12	NA|251aa|up_8|NZ_AP014648.1_1813055_1813808_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|200aa|up_7|NZ_AP014648.1_1813957_1814557_+	COG4765, COG4765, Uncharacterized protein conserved in bacteria [Function unknown]	NA|241aa|up_6|NZ_AP014648.1_1814649_1815372_-	PRK00301, aat, leucyl/phenylalanyl-tRNA--protein transferase; Reviewed	NA|450aa|up_5|NZ_AP014648.1_1815456_1816806_-	PRK08591, PRK08591, acetyl-CoA carboxylase biotin carboxylase subunit; Validated	NA|154aa|up_4|NZ_AP014648.1_1816822_1817284_-	PRK06302, PRK06302, acetyl-CoA carboxylase biotin carboxyl carrier protein	NA|148aa|up_3|NZ_AP014648.1_1817311_1817755_-	PRK05395, PRK05395, type II 3-dehydroquinate dehydratase	NA|268aa|up_2|NZ_AP014648.1_1817950_1818754_-	cd03023, DsbA_Com1_like, DsbA family, Com1-like subfamily; composed of proteins similar to Com1, a 27-kDa outer membrane-associated immunoreactive protein originally found in both acute and chronic disease strains of the pathogenic bacteria Coxiella burnetti	NA|477aa|up_1|NZ_AP014648.1_1818790_1820221_-	COG4783, COG4783, Putative Zn-dependent protease, contains TPR repeats [General function prediction only]	NA|370aa|up_0|NZ_AP014648.1_1820490_1821600_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|411aa|down_0|NZ_AP014648.1_1825221_1826454_+	PRK10319, PRK10319, N-acetylmuramoyl-L-alanine amidase AmiA	NA|876aa|down_1|NZ_AP014648.1_1826601_1829229_+	COG5009, MrcA, Membrane carboxypeptidase/penicillin-binding protein [Cell envelope biogenesis, outer membrane]	NA|373aa|down_2|NZ_AP014648.1_1829350_1830470_+	PRK00578, prfB, peptide chain release factor 2; Validated	NA|135aa|down_3|NZ_AP014648.1_1830554_1830959_-	pfam04519, Bactofilin, Polymer-forming cytoskeletal	NA|1229aa|down_4|NZ_AP014648.1_1831595_1835282_+	pfam13502, AsmA_2, AsmA-like C-terminal region	NA|418aa|down_5|NZ_AP014648.1_1835278_1836532_-	PRK05912, PRK05912, tyrosyl-tRNA synthetase; Validated	NA|228aa|down_6|NZ_AP014648.1_1836676_1837360_-	COG2945, COG2945, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|384aa|down_7|NZ_AP014648.1_1837669_1838821_+	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|491aa|down_8|NZ_AP014648.1_1838875_1840348_+	PRK11814, PRK11814, cysteine desulfurase activator complex subunit SufB; Provisional	NA|253aa|down_9|NZ_AP014648.1_1840411_1841170_+	COG0396, sufC, Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones]
