assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000023465.1_ASM2346v1	NC_014041	Zunongwangia profunda SM-A87, complete sequence	1	1642261-1647855	1,1,1,2,3,4,5,6	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR,PILER-CR,PILER-CR	no	cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	Type I-B	CTTCCAGACCATTCCATCAAACAACTAGGATTGAAAC,CTTCCAGACCATTCCATCAAACAACTAGGATTGAAAC,CTTCCAGACCATTCCATNNAANAACTAGGATTGAAAC,GATAATCTTCCAGACCATTCCATCAAACAACTAGGATTGAAACATCGAATCAAG,TTCCAGAGCATTCCATTGAAAAACTAGGATTGAAA,CTTCCAGAACATTCCATCAAACAACTAGGATTGAAAC,TAACTTCCAGAGCATTCCATTGAAAAACTAGGATTGAAACT,TTCCAGACCATTCCATTGAAAAACTAGGATTGAAAC	37,37,37,54,35,37,41,36	0	0	NA	NA	NA:NA:NA:NA:NA:NA:NA:NA	62,76,76,62,62,62,62,62	76	TypeI-B	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	NA|575aa|up_9|NC_014041.1_1630827_1632552_+,NA|562aa|down_2|NC_014041.1_1654638_1656324_-	NA|575aa|up_9|NC_014041.1_1630827_1632552_+	NA	NA|111aa|up_8|NC_014041.1_1632572_1632905_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	cas8b1|670aa|up_7|NC_014041.1_1633046_1635056_+	pfam09484, Cas_TM1802, CRISPR-associated protein TM1802 (cas_TM1802)	cas7b|338aa|up_6|NC_014041.1_1635076_1636090_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas5|267aa|up_5|NC_014041.1_1636117_1636918_+	TIGR02592, hypothetical_protein_CTC_01466, CRISPR-associated protein Cas5, subtype I-B/HMARI	cas3|809aa|up_4|NC_014041.1_1636914_1639341_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas6|229aa|up_3|NC_014041.1_1639372_1640059_+	pfam17262, DUF5328, Family of unknown function (DUF5328)	cas1|333aa|up_2|NC_014041.1_1640058_1641057_+	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas2|97aa|up_1|NC_014041.1_1641201_1641492_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas4|192aa|up_0|NC_014041.1_1641491_1642067_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	NA|1193aa|down_0|NC_014041.1_1648040_1651619_-	PRK10340, ebgA, cryptic beta-D-galactosidase subunit alpha; Reviewed	NA|880aa|down_1|NC_014041.1_1651906_1654546_-	pfam17389, Bac_rhamnosid6H, Bacterial alpha-L-rhamnosidase 6 hairpin glycosidase domain	NA|562aa|down_2|NC_014041.1_1654638_1656324_-	NA	NA|286aa|down_3|NC_014041.1_1656360_1657218_-	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|572aa|down_4|NC_014041.1_1657234_1658950_-	pfam07980, SusD_RagB, SusD family	NA|1037aa|down_5|NC_014041.1_1658960_1662071_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|347aa|down_6|NC_014041.1_1662216_1663257_-	cd07377, WHTH_GntR, Winged helix-turn-helix (WHTH) DNA-binding domain of the GntR family of transcriptional regulators	NA|702aa|down_7|NC_014041.1_1663404_1665510_+	PRK08324, PRK08324, bifunctional aldolase/short-chain dehydrogenase	NA|427aa|down_8|NC_014041.1_1665606_1666887_+	COG4952, COG4952, Predicted sugar isomerase [Cell envelope biogenesis, outer membrane]	NA|458aa|down_9|NC_014041.1_1666905_1668279_+	cd07772, FGGY_NaCK_like, Novosphingobium aromaticivorans carbohydrate kinase-like proteins; belongs to the FGGY family of carbohydrate kinases
GCF_000023465.1_ASM2346v1	NC_014041	Zunongwangia profunda SM-A87, complete sequence	2	4599707-4599807	2	CRISPRCasFinder	no		csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	Orphan	CGTGTCTACCAATTCCACCACGCCCGCA	28	0	0	NA	NA	NA	1	1	Orphan	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	NA|211aa|up_7|NC_014041.1_4588323_4588956_-,NA|72aa|up_5|NC_014041.1_4589711_4589927_-,NA|152aa|up_2|NC_014041.1_4595805_4596261_-,NA|131aa|down_9|NC_014041.1_4610776_4611169_+	NA|83aa|up_9|NC_014041.1_4586731_4586980_-	cd02980, TRX_Fd_family, Thioredoxin (TRX)-like [2Fe-2S] Ferredoxin (Fd) family; composed of [2Fe-2S] Fds with a TRX fold (TRX-like Fds) and proteins containing domains similar to TRX-like Fd including formate dehydrogenases, NAD-reducing hydrogenases and the subunit E of NADH:ubiquinone oxidoreductase (NuoE)	NA|411aa|up_8|NC_014041.1_4586984_4588217_-	TIGR00900, multidrug_transporter, H+ Antiporter protein	NA|211aa|up_7|NC_014041.1_4588323_4588956_-	NA	NA|230aa|up_6|NC_014041.1_4588968_4589658_-	pfam13795, HupE_UreJ_2, HupE / UreJ protein	NA|72aa|up_5|NC_014041.1_4589711_4589927_-	NA	NA|400aa|up_4|NC_014041.1_4589960_4591160_-	pfam16576, HlyD_D23, Barrel-sandwich domain of CusB or HlyD membrane-fusion	NA|1467aa|up_3|NC_014041.1_4591194_4595595_-	COG3696, COG3696, Putative silver efflux pump [Inorganic ion transport and metabolism]	NA|152aa|up_2|NC_014041.1_4595805_4596261_-	NA	NA|279aa|up_1|NC_014041.1_4596370_4597207_-	pfam06067, DUF932, Domain of unknown function (DUF932)	NA|599aa|up_0|NC_014041.1_4597472_4599269_-	cd16393, SPO0J_N, Thermus thermophilus stage 0 sporulation protein J-like N-terminal domain, ParB family member	NA|463aa|down_0|NC_014041.1_4599933_4601322_+	cd05680, M20_dipept_like, uncharacterized M20 dipeptidase	NA|319aa|down_1|NC_014041.1_4601696_4602653_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|572aa|down_2|NC_014041.1_4602645_4604361_-	pfam05960, DUF885, Bacterial protein of unknown function (DUF885)	NA|335aa|down_3|NC_014041.1_4604834_4605839_-	pfam04307, YdjM, LexA-binding, inner membrane-associated putative hydrolase	NA|236aa|down_4|NC_014041.1_4605838_4606546_-	COG0428, COG0428, Predicted divalent heavy-metal cations transporter [Inorganic ion transport and metabolism]	NA|593aa|down_5|NC_014041.1_4606555_4608334_-	cd03283, ABC_MutS-like, ATP-binding cassette domain of MutS-like homolog	NA|120aa|down_6|NC_014041.1_4608333_4608693_-	COG2315, MmcQ, Uncharacterized protein conserved in bacteria [Function unknown]	NA|206aa|down_7|NC_014041.1_4608710_4609328_-	pfam14014, DUF4230, Protein of unknown function (DUF4230)	NA|249aa|down_8|NC_014041.1_4609684_4610431_-	pfam04199, Cyclase, Putative cyclase	NA|131aa|down_9|NC_014041.1_4610776_4611169_+	NA
GCF_000023465.1_ASM2346v1	NC_014041	Zunongwangia profunda SM-A87, complete sequence	3	4750658-4750743	3	CRISPRCasFinder	no	c2c10_CAS-V-U3,cas9,cas1,cas2	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	Type II-A, Type II-B,Type II-C,Type II-B, or Type II-C?	AAAACCACACTATAATGTGATTTTT	25	0	0	NA	NA	NA	1	1	TypeII-A,TypeII-B,TypeII-C,TypeII-B,orTypeII-C?	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	NA|55aa|up_5|NC_014041.1_4746280_4746445_-,NA|62aa|up_4|NC_014041.1_4746482_4746668_-,NA|342aa|down_4|NC_014041.1_4754766_4755792_-,NA|176aa|down_5|NC_014041.1_4755804_4756332_-	NA|287aa|up_9|NC_014041.1_4740150_4741011_-	pfam02517, Abi, CAAX protease self-immunity	NA|412aa|up_8|NC_014041.1_4741028_4742264_-	cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms	NA|723aa|up_7|NC_014041.1_4742287_4744456_-	COG2274, SunT, ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms]	NA|205aa|up_6|NC_014041.1_4745528_4746143_-	pfam02517, Abi, CAAX protease self-immunity	NA|55aa|up_5|NC_014041.1_4746280_4746445_-	NA	NA|62aa|up_4|NC_014041.1_4746482_4746668_-	NA	NA|114aa|up_3|NC_014041.1_4746721_4747063_-	pfam08279, HTH_11, HTH domain	NA|382aa|up_2|NC_014041.1_4747085_4748231_-	TIGR01843, Hemolysin_secretion_protein_D_plasmid, type I secretion membrane fusion protein, HlyD family	NA|206aa|up_1|NC_014041.1_4748591_4749209_-	smart00850, LytTR, LytTr DNA-binding domain	NA|370aa|up_0|NC_014041.1_4749419_4750529_-	COG3177, COG3177, Fic family protein [Function unknown]	NA|70aa|down_0|NC_014041.1_4750842_4751052_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|110aa|down_1|NC_014041.1_4751059_4751389_+	TIGR03071, couple_hipA, HipA N-terminal domain	NA|328aa|down_2|NC_014041.1_4751388_4752372_+	pfam07804, HipA_C, HipA-like C-terminal domain	NA|535aa|down_3|NC_014041.1_4753149_4754754_-	pfam12696, TraG-D_C, TraM recognition site of TraD and TraG	NA|342aa|down_4|NC_014041.1_4754766_4755792_-	NA	NA|176aa|down_5|NC_014041.1_4755804_4756332_-	NA	NA|150aa|down_6|NC_014041.1_4757442_4757892_-	cd08071, MPN_DUF2466, Mov34/MPN/PAD-1 family	NA|113aa|down_7|NC_014041.1_4758046_4758385_-	pfam00436, SSB, Single-strand binding protein family	NA|255aa|down_8|NC_014041.1_4758957_4759722_+	pfam08378, NERD, Nuclease-related domain	c2c10_CAS-V-U3|366aa|down_9|NC_014041.1_4759897_4760995_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]
GCF_000023465.1_ASM2346v1	NC_014041	Zunongwangia profunda SM-A87, complete sequence	4	4768893-4771007	7,4	PILER-CR,CRISPRCasFinder	no	c2c10_CAS-V-U3,cas9,cas1,cas2,DEDDh	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	Type II-A,Type II-C,Type II-B	CCTGTGAATCCAGTACTAAAAGTACAATTCTGAAAGCAATTCACAAC,CCTGTGAATCCAGTACTAAAAGTACAATTCTGAAAGCAATTCACAAC	47,47	0	0	NA	NA	NA:NA	27,26	27	TypeII-A,TypeII-C,TypeII-B	csa3,DEDDh,cas8b1,cas7b,cas5,cas3,cas6,cas1,cas2,cas4,DinG,WYL,RT,PD-DExK,c2c10_CAS-V-U3,cas9	NA,NA	NA|150aa|up_9|NC_014041.1_4757442_4757892_-	cd08071, MPN_DUF2466, Mov34/MPN/PAD-1 family	NA|113aa|up_8|NC_014041.1_4758046_4758385_-	pfam00436, SSB, Single-strand binding protein family	NA|255aa|up_7|NC_014041.1_4758957_4759722_+	pfam08378, NERD, Nuclease-related domain	c2c10_CAS-V-U3|366aa|up_6|NC_014041.1_4759897_4760995_-	COG0675, COG0675, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|259aa|up_5|NC_014041.1_4760997_4761774_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|80aa|up_4|NC_014041.1_4762209_4762449_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	cas9|1389aa|up_3|NC_014041.1_4762750_4766917_+	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	cas1|299aa|up_2|NC_014041.1_4766923_4767820_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	NA|100aa|up_1|NC_014041.1_4767894_4768194_+	cd10456, GIY-YIG_UPF0213, The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1	cas2|116aa|up_0|NC_014041.1_4768429_4768777_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|310aa|down_0|NC_014041.1_4771287_4772217_-	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|290aa|down_1|NC_014041.1_4772850_4773720_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|119aa|down_2|NC_014041.1_4773731_4774088_-	pfam01527, HTH_Tnp_1, Transposase	NA|106aa|down_3|NC_014041.1_4774559_4774877_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	DEDDh|1463aa|down_4|NC_014041.1_4775091_4779480_-	PRK05673, dnaE, DNA polymerase III subunit alpha; Validated	NA|623aa|down_5|NC_014041.1_4779709_4781578_+	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|253aa|down_6|NC_014041.1_4781579_4782338_+	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|150aa|down_7|NC_014041.1_4782469_4782919_-	pfam09537, DUF2383, Domain of unknown function (DUF2383)	NA|183aa|down_8|NC_014041.1_4783193_4783742_+	PRK14521, rpsP, 30S ribosomal protein S16; Provisional	NA|176aa|down_9|NC_014041.1_4783756_4784284_+	PRK00122, rimM, 16S rRNA-processing protein RimM; Provisional
