assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002081995.1_ASM208199v1	NZ_CP015108	Sporosarcina ureae strain S204, complete genome	1	174071-174179	1	CRISPRCasFinder	no		cas3,RT,csa3,cas14j,DEDDh,PrimPol,DinG	Orphan	CCCAACTTGCAACCGCTCATAAGGA	25	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,cas14j,DEDDh,PrimPol,DinG	NA,NA|106aa|down_1|NZ_CP015108.1_174800_175118_+	NA|171aa|up_9|NZ_CP015108.1_165903_166416_+	pfam07301, DUF1453, Protein of unknown function (DUF1453)	NA|137aa|up_8|NZ_CP015108.1_166913_167324_-	pfam11084, DUF2621, Protein of unknown function (DUF2621)	NA|74aa|up_7|NZ_CP015108.1_167457_167679_+	pfam07872, DUF1659, Protein of unknown function (DUF1659)	NA|73aa|up_6|NZ_CP015108.1_167702_167921_+	pfam11148, DUF2922, Protein of unknown function (DUF2922)	NA|43aa|up_5|NZ_CP015108.1_167996_168125_+	pfam12841, YvrJ, YvrJ protein family	NA|197aa|up_4|NZ_CP015108.1_168244_168835_+	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|228aa|up_3|NZ_CP015108.1_168855_169539_+	cd13399, Slt35-like, Slt35-like lytic transglycosylase	NA|85aa|up_2|NZ_CP015108.1_169837_170092_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|183aa|up_1|NZ_CP015108.1_170223_170772_-	cd00431, cysteine_hydrolases, Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle	NA|905aa|up_0|NZ_CP015108.1_171148_173863_+	PRK09277, PRK09277, aconitate hydratase AcnA	NA|142aa|down_0|NZ_CP015108.1_174375_174801_+	COG0824, FcbC, Predicted thioesterase [General function prediction only]	NA|106aa|down_1|NZ_CP015108.1_174800_175118_+	NA	NA|98aa|down_2|NZ_CP015108.1_175146_175440_-	COG4841, COG4841, Uncharacterized protein conserved in bacteria [Function unknown]	NA|201aa|down_3|NZ_CP015108.1_175469_176072_-	PRK00220, PRK00220, glycerol-3-phosphate 1-O-acyltransferase PlsY	NA|658aa|down_4|NZ_CP015108.1_176345_178319_+	PRK05644, gyrB, DNA gyrase subunit B; Validated	NA|809aa|down_5|NZ_CP015108.1_178315_180742_+	PRK05561, PRK05561, DNA topoisomerase 4 subunit A	NA|715aa|down_6|NZ_CP015108.1_181159_183304_+	COG1042, COG1042, Acyl-CoA synthetase (NDP forming) [Energy production and conversion]	NA|148aa|down_7|NZ_CP015108.1_183318_183762_+	pfam13452, MaoC_dehydrat_N, N-terminal half of MaoC dehydratase	NA|142aa|down_8|NZ_CP015108.1_183777_184203_+	cd03453, SAV4209_like, SAV4209_like	NA|387aa|down_9|NZ_CP015108.1_184232_185393_+	PRK07855, PRK07855, lipid-transfer protein; Provisional
GCF_002081995.1_ASM208199v1	NZ_CP015108	Sporosarcina ureae strain S204, complete genome	2	1008656-1008731	2	CRISPRCasFinder	no		cas3,RT,csa3,cas14j,DEDDh,PrimPol,DinG	Orphan	GGAAACTATTCTGAATTGCGCCA	23	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,cas14j,DEDDh,PrimPol,DinG	NA|222aa|up_1|NZ_CP015108.1_1006484_1007150_-,NA|88aa|down_0|NZ_CP015108.1_1008848_1009112_-,NA|92aa|down_9|NZ_CP015108.1_1015453_1015729_-	NA|195aa|up_9|NZ_CP015108.1_996513_997098_-	pfam17932, TetR_C_24, Tetracyclin repressor-like, C-terminal domain	NA|400aa|up_8|NZ_CP015108.1_997754_998954_+	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|179aa|up_7|NZ_CP015108.1_999054_999591_-	pfam07553, Lipoprotein_Ltp, Host cell surface-exposed lipoprotein	NA|214aa|up_6|NZ_CP015108.1_999830_1000472_-	pfam04307, YdjM, LexA-binding, inner membrane-associated putative hydrolase	NA|438aa|up_5|NZ_CP015108.1_1000622_1001936_-	COG2031, AtoE, Short chain fatty acids transporter [Lipid metabolism]	NA|553aa|up_4|NZ_CP015108.1_1002147_1003806_-	PRK07843, PRK07843, 3-oxosteroid 1-dehydrogenase	NA|258aa|up_3|NZ_CP015108.1_1004243_1005017_+	PRK06172, PRK06172, SDR family oxidoreductase	NA|389aa|up_2|NZ_CP015108.1_1005086_1006252_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|222aa|up_1|NZ_CP015108.1_1006484_1007150_-	NA	NA|307aa|up_0|NZ_CP015108.1_1007617_1008538_-	cd07025, Peptidase_S66, LD-Carboxypeptidase, a serine protease, includes microcin C7 self immunity protein	NA|88aa|down_0|NZ_CP015108.1_1008848_1009112_-	NA	NA|379aa|down_1|NZ_CP015108.1_1009801_1010938_+	COG1125, OpuBA, ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism]	NA|218aa|down_2|NZ_CP015108.1_1010930_1011584_+	COG1174, OpuBB, ABC-type proline/glycine betaine transport systems, permease component [Amino acid transport and metabolism]	NA|297aa|down_3|NZ_CP015108.1_1011585_1012476_+	cd13609, PBP2_Opu_like_1, Substrate-binding domain of putative ABC-type osmoprotectant uptake system; the type 2 periplasmic-binding protein fold	NA|151aa|down_4|NZ_CP015108.1_1012651_1013104_-	cd08865, SRPBCC_10, Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins	NA|124aa|down_5|NZ_CP015108.1_1013222_1013594_-	pfam00903, Glyoxalase, Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily	NA|129aa|down_6|NZ_CP015108.1_1013654_1014041_-	pfam08818, DUF1801, Domain of unknown function (DU1801)	NA|143aa|down_7|NZ_CP015108.1_1014227_1014656_-	COG2764, PhnB, Uncharacterized protein conserved in bacteria [Function unknown]	NA|132aa|down_8|NZ_CP015108.1_1014762_1015158_-	COG3324, COG3324, Predicted enzyme related to lactoylglutathione lyase [General function prediction only]	NA|92aa|down_9|NZ_CP015108.1_1015453_1015729_-	NA
GCF_002081995.1_ASM208199v1	NZ_CP015108	Sporosarcina ureae strain S204, complete genome	3	2014300-2014408	3	CRISPRCasFinder	no		cas3,RT,csa3,cas14j,DEDDh,PrimPol,DinG	Orphan	CTTGCTTAAGCGCTCATAGTTCGTC	25	0	0	NA	NA	NA	1	1	Orphan	cas3,RT,csa3,cas14j,DEDDh,PrimPol,DinG	NA|317aa|up_7|NZ_CP015108.1_2004396_2005347_+,NA	NA|199aa|up_9|NZ_CP015108.1_2002658_2003255_-	pfam13508, Acetyltransf_7, Acetyltransferase (GNAT) domain	NA|205aa|up_8|NZ_CP015108.1_2003663_2004278_-	COG2323, COG2323, Predicted membrane protein [Function unknown]	NA|317aa|up_7|NZ_CP015108.1_2004396_2005347_+	NA	NA|460aa|up_6|NZ_CP015108.1_2005353_2006733_-	TIGR00931, Uncharacterized_Na+/H+_antiporter_HI_1107, Na+/H+ antiporter NhaC	NA|420aa|up_5|NZ_CP015108.1_2006997_2008257_+	COG2124, CypX, Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism]	NA|499aa|up_4|NZ_CP015108.1_2008522_2010019_+	pfam00939, Na_sulph_symp, Sodium:sulfate symporter transmembrane region	NA|504aa|up_3|NZ_CP015108.1_2010126_2011638_-	COG0531, PotE, Amino acid transporters [Amino acid transport and metabolism]	NA|274aa|up_2|NZ_CP015108.1_2011873_2012695_+	cd13711, PBP2_Ngo0372_TcyA, Substrate binding domain of ABC transporters involved in cystine import; the type 2 periplasmic binding protein fold	NA|239aa|up_1|NZ_CP015108.1_2012684_2013401_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|246aa|up_0|NZ_CP015108.1_2013397_2014135_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|105aa|down_0|NZ_CP015108.1_2014535_2014850_-	COG3695, COG3695, Predicted methylated DNA-protein cysteine methyltransferase [DNA replication, recombination, and repair]	NA|424aa|down_1|NZ_CP015108.1_2014973_2016245_+	TIGR01814, kynureninase, kynureninase	NA|211aa|down_2|NZ_CP015108.1_2016241_2016874_+	TIGR03035, trp_arylform, arylformamidase	NA|273aa|down_3|NZ_CP015108.1_2016870_2017689_+	TIGR03036, trp_2_3_diox, tryptophan 2,3-dioxygenase	NA|547aa|down_4|NZ_CP015108.1_2017754_2019395_+	cd04506, SGNH_hydrolase_YpmR_like, Members of the SGNH-hydrolase superfamily, a diverse family of lipases and esterases	NA|410aa|down_5|NZ_CP015108.1_2019496_2020726_-	pfam00375, SDF, Sodium:dicarboxylate symporter family	NA|140aa|down_6|NZ_CP015108.1_2020915_2021335_+	smart00347, HTH_MARR, helix_turn_helix multiple antibiotic resistance protein	NA|395aa|down_7|NZ_CP015108.1_2021347_2022532_+	cd17489, MFS_YfcJ_like, Escherichia coli YfcJ, YhhS, and similar transporters of the Major Facilitator Superfamily	NA|315aa|down_8|NZ_CP015108.1_2022582_2023527_-	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|271aa|down_9|NZ_CP015108.1_2023530_2024343_-	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]
