assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003966975.1_ASM396697v1	AP018437	Pelolinea submarina MO-CFX1 DNA, complete genome	1	303185-303326	1	CRISPRCasFinder	no		cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	Orphan	TGTGTGACCTTTTTTAACCACAGATAAACACGGATAAAAACA	42	0	0	NA	NA	NA	1	1	Orphan	cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	NA|245aa|up_9|AP018437.1_296544_297279_+,NA|55aa|up_6|AP018437.1_298559_298724_-,NA|215aa|up_5|AP018437.1_298712_299357_+,NA|162aa|down_1|AP018437.1_304822_305308_-	NA|245aa|up_9|AP018437.1_296544_297279_+	NA	NA|238aa|up_8|AP018437.1_297342_298056_+	pfam11716, MDMPI_N, Mycothiol maleylpyruvate isomerase N-terminal domain	NA|142aa|up_7|AP018437.1_298121_298547_-	PRK09256, PRK09256, aminoacyl-tRNA hydrolase	NA|55aa|up_6|AP018437.1_298559_298724_-	NA	NA|215aa|up_5|AP018437.1_298712_299357_+	NA	NA|101aa|up_4|AP018437.1_299353_299656_+	pfam13601, HTH_34, Winged helix DNA-binding domain	NA|313aa|up_3|AP018437.1_299662_300601_+	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|246aa|up_2|AP018437.1_300593_301331_+	COG1668, NatB, ABC-type Na+ efflux pump, permease component [Energy production and conversion / Inorganic ion transport and metabolism]	NA|216aa|up_1|AP018437.1_301378_302026_+	cd00956, Transaldolase_FSA, Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea	NA|340aa|up_0|AP018437.1_302063_303083_+	cd05247, UDP_G4E_1_SDR_e, UDP-glucose 4 epimerase, subgroup 1, extended (e) SDRs	NA|343aa|down_0|AP018437.1_303789_304818_-	TIGR01128, DNA_polymerase_III_subunit_delta, DNA polymerase III, delta subunit	NA|162aa|down_1|AP018437.1_304822_305308_-	NA	NA|472aa|down_2|AP018437.1_305367_306783_-	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|261aa|down_3|AP018437.1_306853_307636_+	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|527aa|down_4|AP018437.1_307613_309194_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|274aa|down_5|AP018437.1_309364_310186_+	pfam14196, ATC_hydrolase, L-2-amino-thiazoline-4-carboxylic acid hydrolase	NA|307aa|down_6|AP018437.1_310192_311113_+	cd03268, ABC_BcrA_bacitracin_resist, ATP-binding cassette domain of the bacitracin-resistance transporter	NA|255aa|down_7|AP018437.1_311117_311882_+	COG4200, COG4200, Uncharacterized protein conserved in bacteria [Function unknown]	NA|835aa|down_8|AP018437.1_312195_314700_-	COG0542, clpA, ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones]	NA|148aa|down_9|AP018437.1_314867_315311_-	COG0824, FcbC, Predicted thioesterase [General function prediction only]
GCA_003966975.1_ASM396697v1	AP018437	Pelolinea submarina MO-CFX1 DNA, complete genome	2	698158-698261	2	CRISPRCasFinder	no		cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	Orphan	TCCTTGACACCCACCTCGCATGCC	24	0	0	NA	NA	NA	1	1	Orphan	cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	NA,NA|419aa|down_1|AP018437.1_699122_700379_+,NA|232aa|down_2|AP018437.1_700827_701523_+,NA|680aa|down_7|AP018437.1_706005_708045_-	NA|321aa|up_9|AP018437.1_686932_687895_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|439aa|up_8|AP018437.1_687896_689213_+	COG0677, WecC, UDP-N-acetyl-D-mannosaminuronate dehydrogenase [Cell envelope biogenesis, outer membrane]	NA|286aa|up_7|AP018437.1_689221_690079_+	cd06442, DPM1_like, DPM1_like represents putative enzymes similar to eukaryotic DPM1	NA|631aa|up_6|AP018437.1_690089_691982_+	TIGR03108, eps_aminotran_1, exosortase A system-associated amidotransferase 1	NA|119aa|up_5|AP018437.1_691983_692340_-	PRK00823, phhB, pterin-4-alpha-carbinolamine dehydratase; Validated	NA|223aa|up_4|AP018437.1_693076_693745_-	pfam02810, SEC-C, SEC-C motif	NA|397aa|up_3|AP018437.1_693744_694935_-	cd05400, NT_2-5OAS_ClassI-CCAase, Nucleotidyltransferase (NT) domain of 2'5'-oligoadenylate (2-5A)synthetase (2-5OAS) and class I CCA-adding enzyme	NA|378aa|up_2|AP018437.1_694936_696070_-	pfam18145, SAVED, SMODS-associated and fused to various effectors sensor domain	NA|131aa|up_1|AP018437.1_696248_696641_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|292aa|up_0|AP018437.1_696643_697519_+	pfam06114, Peptidase_M78, IrrE N-terminal-like domain	NA|191aa|down_0|AP018437.1_698436_699009_+	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|419aa|down_1|AP018437.1_699122_700379_+	NA	NA|232aa|down_2|AP018437.1_700827_701523_+	NA	NA|213aa|down_3|AP018437.1_702170_702809_+	COG1309, AcrR, Transcriptional regulator [Transcription]	NA|328aa|down_4|AP018437.1_702798_703782_+	COG1477, ApbE, Membrane-associated lipoprotein involved in thiamine biosynthesis [Coenzyme metabolism]	NA|141aa|down_5|AP018437.1_703798_704221_+	COG3976, COG3976, Uncharacterized protein conserved in bacteria [Function unknown]	NA|509aa|down_6|AP018437.1_704257_705784_+	pfam00144, Beta-lactamase, Beta-lactamase	NA|680aa|down_7|AP018437.1_706005_708045_-	NA	NA|189aa|down_8|AP018437.1_708041_708608_-	COG1595, RpoE, DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog [Transcription]	NA|664aa|down_9|AP018437.1_709691_711683_-	TIGR02800, Protein_TolB, tol-pal system beta propeller repeat protein TolB
GCA_003966975.1_ASM396697v1	AP018437	Pelolinea submarina MO-CFX1 DNA, complete genome	3	2672716-2675778	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	 Type I-U?,Type I-U,Type I-C	GTCGCTCCCCGCATGGGGGGCGTGGATTGAAAT,GTCGCTCCCCGCATGGGGGGCGTGGATTGAAAT,GTCGCTCCCCGCATGGGGGGCGTGGATTGAAAT	33,33,33	0	0	NA	NA	NA:NA:NA	44,45,45	45	TypeI-U,TypeI-U?,TypeI-C	cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	NA,NA	NA|508aa|up_9|AP018437.1_2660949_2662473_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|419aa|up_8|AP018437.1_2662450_2663707_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|119aa|up_7|AP018437.1_2663703_2664060_-	cd01276, PKCI_related, Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides	cas3|826aa|up_6|AP018437.1_2664380_2666858_+	COG1203, COG1203, CRISPR-associated helicase Cas3 [Defense mechanisms]	cas5|240aa|up_5|AP018437.1_2666868_2667588_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|636aa|up_4|AP018437.1_2667584_2669492_+	cd09757, Cas8c_I-C, CRISPR/Cas system-associated protein Cas8c	cas7|295aa|up_3|AP018437.1_2669525_2670410_+	cd09689, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas4|140aa|up_2|AP018437.1_2670794_2671214_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|344aa|up_1|AP018437.1_2671214_2672246_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|AP018437.1_2672257_2672548_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|617aa|down_0|AP018437.1_2676072_2677923_+	COG1001, AdeC, Adenine deaminase [Nucleotide transport and metabolism]	NA|151aa|down_1|AP018437.1_2679219_2679672_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|500aa|down_2|AP018437.1_2679693_2681193_+	COG1001, AdeC, Adenine deaminase [Nucleotide transport and metabolism]	NA|375aa|down_3|AP018437.1_2682550_2683675_-	pfam13517, VCBS, Repeat domain in Vibrio, Colwellia, Bradyrhizobium and Shewanella	NA|107aa|down_4|AP018437.1_2683907_2684228_+	pfam03551, PadR, Transcriptional regulator PadR-like family	NA|84aa|down_5|AP018437.1_2684312_2684564_+	pfam17253, DUF5320, Family of unknown function (DUF5320)	NA|93aa|down_6|AP018437.1_2684717_2684996_+	pfam02579, Nitro_FeMo-Co, Dinitrogenase iron-molybdenum cofactor	NA|585aa|down_7|AP018437.1_2685184_2686939_-	cd11356, AmyAc_Sucrose_phosphorylase-like_1, Alpha amylase catalytic domain found in sucrose phosphorylase-like proteins (also called sucrose glucosyltransferase, disaccharide glucosyltransferase, and sucrose-phosphate alpha-D glucosyltransferase)	NA|423aa|down_8|AP018437.1_2686953_2688222_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|304aa|down_9|AP018437.1_2688231_2689143_-	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]
GCA_003966975.1_ASM396697v1	AP018437	Pelolinea submarina MO-CFX1 DNA, complete genome	4	3056164-3056257	4	CRISPRCasFinder	no		cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	Orphan	CGCTCTGGGCAGGCCCCACGCTTC	24	0	0	NA	NA	NA	1	1	Orphan	cas4,csa3,cas3,DinG,cas5,cas8c,cas7,cas1,cas2,WYL	NA|184aa|up_8|AP018437.1_3040341_3040893_+,NA|142aa|down_9|AP018437.1_3065062_3065488_-	NA|308aa|up_9|AP018437.1_3039404_3040328_+	PRK05481, PRK05481, lipoyl synthase; Provisional	NA|184aa|up_8|AP018437.1_3040341_3040893_+	NA	NA|160aa|up_7|AP018437.1_3040903_3041383_+	cd00002, YbaK_deacylase, This CD includes cysteinyl-tRNA(Pro) deacylases from Haemophilus influenzae and Escherichia coli and other related bacterial proteins	NA|222aa|up_6|AP018437.1_3041387_3042053_-	COG0637, COG0637, Predicted phosphatase/phosphohexomutase [General function prediction only]	NA|1153aa|up_5|AP018437.1_3042379_3045838_+	TIGR02082, Methionine_synthase, 5-methyltetrahydrofolate--homocysteine methyltransferase	NA|622aa|up_4|AP018437.1_3045845_3047711_+	PRK08645, PRK08645, bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase protein; Reviewed	NA|94aa|up_3|AP018437.1_3047916_3048198_+	cd06170, LuxR_C_like, C-terminal DNA-binding domain of LuxR-like proteins	NA|583aa|up_2|AP018437.1_3048204_3049953_-	pfam13413, HTH_25, Helix-turn-helix domain	NA|200aa|up_1|AP018437.1_3050033_3050633_-	PRK00137, rplI, 50S ribosomal protein L9; Reviewed	NA|1674aa|up_0|AP018437.1_3050884_3055906_-	pfam13229, Beta_helix, Right handed beta helix region	NA|502aa|down_0|AP018437.1_3056318_3057824_-	cd09604, M1_APN_like, Peptidase M1 family similar to aminopeptidase N catalytic domain	NA|127aa|down_1|AP018437.1_3057834_3058215_-	cd07247, SgaA_N_like, N-terminal domain of Streptomyces griseus SgaA and similar domains	NA|263aa|down_2|AP018437.1_3058529_3059318_+	pfam09858, DUF2085, Predicted membrane protein (DUF2085)	NA|190aa|down_3|AP018437.1_3059358_3059928_+	pfam13548, DUF4126, Domain of unknown function (DUF4126)	NA|169aa|down_4|AP018437.1_3059924_3060431_-	pfam04167, DUF402, Protein of unknown function (DUF402)	NA|350aa|down_5|AP018437.1_3060403_3061453_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|566aa|down_6|AP018437.1_3061445_3063143_-	COG1178, ThiP, ABC-type Fe3+ transport system, permease component [Inorganic ion transport and metabolism]	NA|345aa|down_7|AP018437.1_3063142_3064177_-	cd13545, PBP2_TbpA, Substrate binding domain of thiamin transporter, a member of the type 2 periplasmic binding fold superfamily	NA|215aa|down_8|AP018437.1_3064202_3064847_-	cd07995, TPK, Thiamine pyrophosphokinase	NA|142aa|down_9|AP018437.1_3065062_3065488_-	NA
