assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000968135.1_ASM96813v1	NZ_FO538765	Magnetospira sp. QH-2	1	2181838-2184355	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7,cas4,cas1,cas2	csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	Type I-C, Type I-U?,Type I-U	GTCGCCCCCACACGGGGGCGCGGATTGAAAC,GTCGCCCCCACACGGGGGCGCGGATTGAAAC,NGTCGCCCCCACACGGGGGCGCGGATTGAAAC	31,31,32	4	4	2182264-2182297|2183638-2183670|2183702-2183737|2184226-2184259	NZ_FO538766.1_30058-30025|NZ_FO538766.1_26469-26437|NZ_FO538766.1_18563-18598|NZ_FO538766.1_21350-21317	NA:NA:NA	37,37,38	38	TypeI-C,TypeI-U?,TypeI-U	csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	NA|115aa|up_9|NZ_FO538765.1_2171540_2171885_+,NA|79aa|down_2|NZ_FO538765.1_2186986_2187223_-,NA|263aa|down_9|NZ_FO538765.1_2195566_2196355_+	NA|115aa|up_9|NZ_FO538765.1_2171540_2171885_+	NA	NA|132aa|up_8|NZ_FO538765.1_2171902_2172298_+	COG0511, AccB, Biotin carboxyl carrier protein [Lipid metabolism]	NA|401aa|up_7|NZ_FO538765.1_2172311_2173514_+	TIGR03136, decarboxylase_subunit_of_malonate_decarboxylase, Na+-transporting malonate decarboxylase, carboxybiotin decarboxylase subunit	cas3|725aa|up_6|NZ_FO538765.1_2174180_2176355_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|226aa|up_5|NZ_FO538765.1_2176368_2177046_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|577aa|up_4|NZ_FO538765.1_2177042_2178773_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7|286aa|up_3|NZ_FO538765.1_2178776_2179634_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|211aa|up_2|NZ_FO538765.1_2179638_2180271_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	cas1|339aa|up_1|NZ_FO538765.1_2180267_2181284_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_FO538765.1_2181363_2181654_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|370aa|down_0|NZ_FO538765.1_2184722_2185832_+	COG0579, COG0579, Predicted dehydrogenase [General function prediction only]	NA|370aa|down_1|NZ_FO538765.1_2185876_2186986_+	COG2342, COG2342, Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase [Carbohydrate transport and metabolism]	NA|79aa|down_2|NZ_FO538765.1_2186986_2187223_-	NA	NA|165aa|down_3|NZ_FO538765.1_2187693_2188188_-	pfam01627, Hpt, Hpt domain	NA|171aa|down_4|NZ_FO538765.1_2188291_2188804_-	cd17546, REC_hyHK_CKI1_RcsC-like, phosphoacceptor receiver (REC) domain of hybrid sensor histidine kinases/response regulators similar to Arabidopsis thaliana CKI1 and Escherichia coli RcsC	NA|228aa|down_5|NZ_FO538765.1_2188933_2189617_-	cd05325, carb_red_sniffer_like_SDR_c, carbonyl reductase sniffer-like, classical (c) SDRs	NA|599aa|down_6|NZ_FO538765.1_2189689_2191486_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|466aa|down_7|NZ_FO538765.1_2191538_2192936_-	COG2239, MgtE, Mg/Co/Ni transporter MgtE (contains CBS domain) [Inorganic ion transport and metabolism]	NA|758aa|down_8|NZ_FO538765.1_2192932_2195206_-	PRK07232, PRK07232, bifunctional malic enzyme oxidoreductase/phosphotransacetylase; Reviewed	NA|263aa|down_9|NZ_FO538765.1_2195566_2196355_+	NA
GCF_000968135.1_ASM96813v1	NZ_FO538765	Magnetospira sp. QH-2	2	2425608-2425711	2	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	Orphan	AATGGTGGAGCCGACCGGGATCGAACC	27	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	NA|342aa|up_8|NZ_FO538765.1_2413924_2414950_-,NA|156aa|up_7|NZ_FO538765.1_2415004_2415472_-,NA|87aa|up_5|NZ_FO538765.1_2416367_2416628_-,NA|55aa|up_3|NZ_FO538765.1_2417949_2418114_-,NA|57aa|down_8|NZ_FO538765.1_2436695_2436866_+	NA|124aa|up_9|NZ_FO538765.1_2413219_2413591_-	pfam11171, DUF2958, Protein of unknown function (DUF2958)	NA|342aa|up_8|NZ_FO538765.1_2413924_2414950_-	NA	NA|156aa|up_7|NZ_FO538765.1_2415004_2415472_-	NA	NA|254aa|up_6|NZ_FO538765.1_2415571_2416332_-	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|87aa|up_5|NZ_FO538765.1_2416367_2416628_-	NA	NA|306aa|up_4|NZ_FO538765.1_2416697_2417615_-	COG4227, COG4227, Antirestriction protein [DNA replication, recombination, and repair]	NA|55aa|up_3|NZ_FO538765.1_2417949_2418114_-	NA	NA|299aa|up_2|NZ_FO538765.1_2418367_2419264_+	cd01126, TraG_VirD4, The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion	NA|447aa|up_1|NZ_FO538765.1_2419334_2420675_+	pfam13701, DDE_Tnp_1_4, Transposase DDE domain group 1	NA|283aa|up_0|NZ_FO538765.1_2420869_2421718_+	COG3505, VirD4, Type IV secretory pathway, VirD4 components [Intracellular trafficking and secretion]	NA|412aa|down_0|NZ_FO538765.1_2428527_2429763_+	PRK11649, PRK11649, putative peptidase; Provisional	NA|258aa|down_1|NZ_FO538765.1_2429775_2430549_+	PRK10673, PRK10673, esterase	NA|681aa|down_2|NZ_FO538765.1_2430553_2432596_+	PRK01254, PRK01254, YgiQ family radical SAM protein	NA|265aa|down_3|NZ_FO538765.1_2432600_2433395_-	PRK09543, znuB, zinc ABC transporter permease subunit ZnuB	NA|261aa|down_4|NZ_FO538765.1_2433378_2434161_-	PRK09544, znuC, high-affinity zinc transporter ATPase; Reviewed	NA|162aa|down_5|NZ_FO538765.1_2434136_2434622_-	PRK11639, PRK11639, zinc uptake transcriptional repressor Zur	NA|132aa|down_6|NZ_FO538765.1_2434902_2435298_+	PRK00051, hisI, phosphoribosyl-AMP cyclohydrolase; Reviewed	NA|355aa|down_7|NZ_FO538765.1_2435302_2436367_+	COG0523, COG0523, Putative GTPases (G3E family) [General function prediction only]	NA|57aa|down_8|NZ_FO538765.1_2436695_2436866_+	NA	NA|172aa|down_9|NZ_FO538765.1_2437194_2437710_-	pfam07486, Hydrolase_2, Cell Wall Hydrolase
GCF_000968135.1_ASM96813v1	NZ_FO538765	Magnetospira sp. QH-2	3	2963755-2963861	3	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	Orphan	GGGTGGATAATTTATGCACTGTCACCGCAACTTC	34	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	NA|152aa|up_0|NZ_FO538765.1_2963202_2963658_-,NA|62aa|down_0|NZ_FO538765.1_2963891_2964077_-,NA|108aa|down_2|NZ_FO538765.1_2965400_2965724_-,NA|311aa|down_6|NZ_FO538765.1_2969572_2970505_-	NA|131aa|up_9|NZ_FO538765.1_2951740_2952133_-	pfam18593, CdiI_2, CdiI immunity protein	NA|742aa|up_8|NZ_FO538765.1_2952151_2954377_-	pfam18431, RNAse_A_bac, Bacterial CdiA-CT RNAse A domain	NA|286aa|up_7|NZ_FO538765.1_2954655_2955513_+	TIGR02072, Malonyl-_O-methyltransferase, malonyl-acyl carrier protein O-methyltransferase BioC	NA|235aa|up_6|NZ_FO538765.1_2955516_2956221_+	PRK00347, PRK00347, DNA/RNA nuclease SfsA	NA|248aa|up_5|NZ_FO538765.1_2956305_2957049_+	PRK05716, PRK05716, methionine aminopeptidase; Validated	NA|267aa|up_4|NZ_FO538765.1_2957217_2958018_+	PRK00024, PRK00024, DNA repair protein RadC	NA|282aa|up_3|NZ_FO538765.1_2958087_2958933_+	cd13566, PBP2_phosphate, Substrate binding domain of putative ABC-type phosphate transporter, a member of the type 2 periplasmic binding fold superfamily	NA|539aa|up_2|NZ_FO538765.1_2958936_2960553_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|519aa|up_1|NZ_FO538765.1_2960873_2962430_+	PRK00915, PRK00915, 2-isopropylmalate synthase; Validated	NA|152aa|up_0|NZ_FO538765.1_2963202_2963658_-	NA	NA|62aa|down_0|NZ_FO538765.1_2963891_2964077_-	NA	NA|329aa|down_1|NZ_FO538765.1_2964284_2965271_+	COG0790, COG0790, FOG: TPR repeat, SEL1 subfamily [General function prediction only]	NA|108aa|down_2|NZ_FO538765.1_2965400_2965724_-	NA	NA|254aa|down_3|NZ_FO538765.1_2965790_2966551_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|330aa|down_4|NZ_FO538765.1_2966481_2967471_-	PRK12704, PRK12704, phosphodiesterase; Provisional	NA|344aa|down_5|NZ_FO538765.1_2968098_2969130_-	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|311aa|down_6|NZ_FO538765.1_2969572_2970505_-	NA	NA|274aa|down_7|NZ_FO538765.1_2970494_2971316_-	cd07572, nit, Nit1, Nit 2, and related proteins, and the Nit1-like domain of NitFhit (class 10 nitrilases)	NA|87aa|down_8|NZ_FO538765.1_2971318_2971579_-	cd03418, GRX_GRXb_1_3_like, Glutaredoxin (GRX) family, GRX bacterial class 1 and 3 (b_1_3)-like subfamily; composed of bacterial GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain	NA|253aa|down_9|NZ_FO538765.1_2971747_2972506_-	COG1040, ComFC, Predicted amidophosphoribosyltransferases [General function prediction only]
GCF_000968135.1_ASM96813v1	NZ_FO538765	Magnetospira sp. QH-2	4	3600219-3600295	4	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	Orphan	CCGTACAGCCGGTCATTGCCGGC	23	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	NA|192aa|up_0|NZ_FO538765.1_3599292_3599868_-,NA|113aa|down_5|NZ_FO538765.1_3608087_3608426_+	NA|178aa|up_9|NZ_FO538765.1_3591430_3591964_+	TIGR02667, Molybdenum_cofactor_biosynthesis_protein_B	NA|220aa|up_8|NZ_FO538765.1_3591968_3592628_+	pfam11150, DUF2927, Protein of unknown function (DUF2927)	NA|205aa|up_7|NZ_FO538765.1_3592588_3593203_-	TIGR04282, hypothetical_protein, transferase 1, rSAM/selenodomain-associated	NA|224aa|up_6|NZ_FO538765.1_3593195_3593867_-	cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function	NA|363aa|up_5|NZ_FO538765.1_3593949_3595038_+	TIGR04470, hypothetical_protein_ALIPUT_00462, radical SAM mobile pair protein B	NA|370aa|up_4|NZ_FO538765.1_3595029_3596139_-	cd07332, M48C_Oma1_like, Peptidase M48C Ste24p, integral membrane endopeptidase	NA|407aa|up_3|NZ_FO538765.1_3596141_3597362_-	pfam05987, DUF898, Bacterial protein of unknown function (DUF898)	NA|210aa|up_2|NZ_FO538765.1_3597470_3598100_+	cd07182, RNase_HII_bacteria_HII_like, Bacterial Ribonuclease HII-like	NA|349aa|up_1|NZ_FO538765.1_3598230_3599277_+	pfam01555, N6_N4_Mtase, DNA methylase	NA|192aa|up_0|NZ_FO538765.1_3599292_3599868_-	NA	NA|333aa|down_0|NZ_FO538765.1_3601627_3602626_-	pfam17892, Cadherin_5, Cadherin-like domain	NA|295aa|down_1|NZ_FO538765.1_3602799_3603684_-	PRK01402, hslO, Hsp33-like chaperonin; Reviewed	NA|303aa|down_2|NZ_FO538765.1_3603680_3604589_-	PRK00779, PRK00779, ornithine carbamoyltransferase; Provisional	NA|397aa|down_3|NZ_FO538765.1_3604585_3605776_-	PRK01278, argD, acetylornithine transaminase protein; Provisional	NA|690aa|down_4|NZ_FO538765.1_3605983_3608053_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|113aa|down_5|NZ_FO538765.1_3608087_3608426_+	NA	NA|344aa|down_6|NZ_FO538765.1_3608679_3609711_-	COG3547, COG3547, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	NA|330aa|down_7|NZ_FO538765.1_3610032_3611022_-	pfam12146, Hydrolase_4, Serine aminopeptidase, S33	NA|313aa|down_8|NZ_FO538765.1_3611023_3611962_-	COG1131, CcmA, ABC-type multidrug transport system, ATPase component [Defense mechanisms]	NA|65aa|down_9|NZ_FO538765.1_3612048_3612243_+	pfam10276, zf-CHCC, Zinc-finger domain
GCF_000968135.1_ASM96813v1	NZ_FO538765	Magnetospira sp. QH-2	5	3669473-3669576	5	CRISPRCasFinder	no		csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	Orphan	AATGGTGGAGCCGACCGGGATCGAACC	27	0	0	NA	NA	NA	1	1	Orphan	csa3,DinG,DEDDh,cas3,cas5,cas8c,cas7,cas4,cas1,cas2,RT	NA,NA|168aa|down_0|NZ_FO538765.1_3672052_3672556_+	NA|536aa|up_9|NZ_FO538765.1_3654751_3656359_+	PRK00741, prfC, peptide chain release factor 3; Provisional	NA|137aa|up_8|NZ_FO538765.1_3656355_3656766_+	COG0537, Hit, Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only]	NA|848aa|up_7|NZ_FO538765.1_3656917_3659461_+	COG4251, COG4251, Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms]	NA|254aa|up_6|NZ_FO538765.1_3659605_3660367_+	COG4598, HisP, ABC-type histidine transport system, ATPase component [Amino acid transport and metabolism]	NA|255aa|up_5|NZ_FO538765.1_3660421_3661186_+	cd13702, PBP2_mlr5654_like, Substrate binding domain of ABC-type histidine/lysine/arginine/ornithine transporter-like; the type 2 periplasmic-binding protein fold	NA|232aa|up_4|NZ_FO538765.1_3661254_3661950_+	COG4215, ArtQ, ABC-type arginine transport system, permease component [Amino acid transport and metabolism]	NA|240aa|up_3|NZ_FO538765.1_3661949_3662669_+	COG4160, ArtM, ABC-type arginine/histidine transport system, permease component [Amino acid transport and metabolism]	NA|202aa|up_2|NZ_FO538765.1_3662693_3663299_-	COG1279, COG1279, Lysine efflux permease [General function prediction only]	NA|201aa|up_1|NZ_FO538765.1_3663412_3664015_-	cd04745, LbH_paaY_like, paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E	NA|442aa|up_0|NZ_FO538765.1_3664136_3665462_-	cd05561, Peptidases_S8_4, Peptidase S8 family domain, uncharacterized subfamily 4	NA|168aa|down_0|NZ_FO538765.1_3672052_3672556_+	NA	NA|447aa|down_1|NZ_FO538765.1_3672569_3673910_-	sd00006, TPR, Tetratricopeptide repeat	NA|592aa|down_2|NZ_FO538765.1_3674031_3675807_-	PRK09111, PRK09111, DNA polymerase III subunits gamma and tau; Validated	NA|178aa|down_3|NZ_FO538765.1_3675993_3676527_-	cd13906, CuRO_3_CumA_like, The third cupredoxin domain of CumA like multicopper oxidase	NA|315aa|down_4|NZ_FO538765.1_3676642_3677587_+	COG2816, NPY1, NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair]	NA|173aa|down_5|NZ_FO538765.1_3677594_3678113_-	cd19923, REC_CheY_CheY3, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY3 and similar CheY family proteins	NA|292aa|down_6|NZ_FO538765.1_3678281_3679157_+	COG2301, CitE, Citrate lyase beta subunit [Carbohydrate transport and metabolism]	NA|351aa|down_7|NZ_FO538765.1_3679153_3680206_+	cd03451, FkbR2, FkbR2 is a Streptomyces hygroscopicus protein with a hot dog fold that belongs to a conserved family of proteins found in prokaryotes and archaea but not in eukaryotes	NA|265aa|down_8|NZ_FO538765.1_3680209_3681004_-	pfam09968, DUF2202, Uncharacterized protein domain (DUF2202)	NA|143aa|down_9|NZ_FO538765.1_3681181_3681610_+	smart00318, SNc, Staphylococcal nuclease homologues
