assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001458695.1_NiCh1	NZ_LN885086	Candidatus Nitrospira inopinata isolate ENR4 chromosome 1	1	993770-1002140	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	Type III-D,Type III-B,Type III-A,Type III-C	GTCTTAATCCCTTTTTCTTCAGGTCGGAATCCCATC,GTCTTAATCCCTTTTTCTTCAGGTCGGAATCCCATC,GTCTTAATCCCTTTTTCTTCAGGTCGGAATCCCATC	36,36,36	3	3	994370-994408|994727-994765|997082-997117	NZ_LN885086.1_491897-491935|NZ_LN885086.1_491897-491935|NZ_LN885086.1_491878-491913	NA:NA:NA	117,117,117	117	TypeIII-D,TypeIII-B,TypeIII-A,TypeIII-C	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	NA|2289aa|up_7|NZ_LN885086.1_978836_985703_+,NA|212aa|up_0|NZ_LN885086.1_992291_992927_-,NA|49aa|down_3|NZ_LN885086.1_1004248_1004395_-	cas1|259aa|up_9|NZ_LN885086.1_976723_977500_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	cas1|321aa|up_8|NZ_LN885086.1_977474_978437_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|2289aa|up_7|NZ_LN885086.1_978836_985703_+	NA	NA|659aa|up_6|NZ_LN885086.1_985699_987676_+	TIGR03866, PQQ_ABC_repeats, PQQ-dependent catabolism-associated beta-propeller protein	NA|203aa|up_5|NZ_LN885086.1_987683_988292_+	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|187aa|up_4|NZ_LN885086.1_988323_988884_+	cd02968, SCO, SCO (an acronym for Synthesis of Cytochrome c Oxidase) family; composed of proteins similar to Sco1, a membrane-anchored protein possessing a soluble domain with a TRX fold	NA|473aa|up_3|NZ_LN885086.1_988876_990295_+	pfam00034, Cytochrom_C, Cytochrome c	cas2|93aa|up_2|NZ_LN885086.1_990323_990602_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	csx1|423aa|up_1|NZ_LN885086.1_990613_991882_-	pfam09670, Cas_Cas02710, CRISPR-associated protein (Cas_Cas02710)	NA|212aa|up_0|NZ_LN885086.1_992291_992927_-	NA	csx1|388aa|down_0|NZ_LN885086.1_1002243_1003407_-	pfam09002, DUF1887, Domain of unknown function (DUF1887)	NA|103aa|down_1|NZ_LN885086.1_1003459_1003768_-	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	NA|137aa|down_2|NZ_LN885086.1_1003760_1004171_-	pfam08780, NTase_sub_bind, Nucleotidyltransferase substrate binding protein like	NA|49aa|down_3|NZ_LN885086.1_1004248_1004395_-	NA	cmr6gr7|446aa|down_4|NZ_LN885086.1_1004408_1005746_-	cd09661, Cmr6_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr6	cmr5gr11|127aa|down_5|NZ_LN885086.1_1005761_1006142_-	pfam09701, Cas_Cmr5, CRISPR-associated protein (Cas_Cmr5)	cmr4gr7|323aa|down_6|NZ_LN885086.1_1006142_1007111_-	TIGR02580, putative_CRISPR-associated_protein, CRISPR type III-B/RAMP module RAMP protein Cmr4	NA|85aa|down_7|NZ_LN885086.1_1007158_1007413_+	COG4118, Phd, Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]	NA|131aa|down_8|NZ_LN885086.1_1007409_1007802_+	cd09872, PIN_Sll0205-like, VapC-like PIN domain of Sll0205 protein and homologs	cmr3gr5|400aa|down_9|NZ_LN885086.1_1007807_1009007_-	cd09748, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily protein Cmr3
GCF_001458695.1_NiCh1	NZ_LN885086	Candidatus Nitrospira inopinata isolate ENR4 chromosome 1	2	2456753-2460281	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	WYL,cas3,cas8u2,cas7,cas5u,cas1,cas2	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	Unclear	ATTTCCGCGGCTGAAACGCCGCGGCCCCATTGAAGC,ATTTCCGCGGCTGAAACGCCGCGGCCCCATTGAAGC,ATTTCCGCGGCTGAAACGCCGCGGCCCCATTGAAGC	36,36,36	0	0	NA	NA	NA:NA:NA	48,48,48	48	Unclear	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	NA|71aa|up_4|NZ_LN885086.1_2453668_2453881_+,NA|313aa|down_2|NZ_LN885086.1_2462212_2463151_-,NA|130aa|down_3|NZ_LN885086.1_2463215_2463605_-,NA|73aa|down_6|NZ_LN885086.1_2465414_2465633_-,NA|199aa|down_7|NZ_LN885086.1_2465825_2466422_+	cas7|389aa|up_9|NZ_LN885086.1_2448785_2449952_+	cd09678, Csb1_I-U, CRISPR/Cas system-associated protein Csb1	cas5u|523aa|up_8|NZ_LN885086.1_2449948_2451517_+	pfam09609, Cas_GSU0054, CRISPR-associated protein, GSU0054 family (Cas_GSU0054)	NA|177aa|up_7|NZ_LN885086.1_2451699_2452230_+	COG4185, COG4185, Uncharacterized protein conserved in bacteria [Function unknown]	NA|144aa|up_6|NZ_LN885086.1_2452508_2452940_+	PTZ00395, PTZ00395, Sec24-related protein; Provisional	NA|191aa|up_5|NZ_LN885086.1_2452977_2453550_-	pfam05685, Uma2, Putative restriction endonuclease	NA|71aa|up_4|NZ_LN885086.1_2453668_2453881_+	NA	NA|71aa|up_3|NZ_LN885086.1_2453877_2454090_-	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|68aa|up_2|NZ_LN885086.1_2454086_2454290_-	COG1598, COG1598, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	cas1|554aa|up_1|NZ_LN885086.1_2454488_2456150_+	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	cas2|97aa|up_0|NZ_LN885086.1_2456218_2456509_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|240aa|down_0|NZ_LN885086.1_2460442_2461162_+	cd05233, SDR_c, classical (c) SDRs	NA|309aa|down_1|NZ_LN885086.1_2461215_2462142_+	COG1313, PflX, Uncharacterized Fe-S protein PflX, homolog of pyruvate formate lyase activating proteins [General function prediction only]	NA|313aa|down_2|NZ_LN885086.1_2462212_2463151_-	NA	NA|130aa|down_3|NZ_LN885086.1_2463215_2463605_-	NA	NA|144aa|down_4|NZ_LN885086.1_2463693_2464125_-	pfam03100, CcmE, CcmE	NA|390aa|down_5|NZ_LN885086.1_2464196_2465366_-	PRK06991, PRK06991, electron transport complex subunit RsxB	NA|73aa|down_6|NZ_LN885086.1_2465414_2465633_-	NA	NA|199aa|down_7|NZ_LN885086.1_2465825_2466422_+	NA	NA|431aa|down_8|NZ_LN885086.1_2466438_2467731_+	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|209aa|down_9|NZ_LN885086.1_2467821_2468448_+	TIGR02258, UPF0097_protein_AF_2157, 2'-5' RNA ligase
GCF_001458695.1_NiCh1	NZ_LN885086	Candidatus Nitrospira inopinata isolate ENR4 chromosome 1	3	2530668-2532863	3,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas5,cas8c,cas7b,cas4,cas1,cas2	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	 Type I-U?,Type I-U,Type I-C	GTAGCGCCCGCTCGAAAGGGCGGGCGAGGATTGAAAC,GTAGCGCCCGCTCGAAAGGGCGGGCGAGGATTGAAAC,GTAGCGCCCGCTCGAAAGGGCGGGCGAGGATTGAAAC	37,37,37	0	0	NA	NA	I-C,III-B:I-C,III-B:I-C,III-B	30,30,30	30	TypeI-U?,TypeI-U,TypeI-C	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	NA|70aa|up_7|NZ_LN885086.1_2521782_2521992_+,NA|64aa|down_1|NZ_LN885086.1_2536202_2536394_+,NA|68aa|down_5|NZ_LN885086.1_2538834_2539038_-,NA|102aa|down_9|NZ_LN885086.1_2542502_2542808_-	NA|93aa|up_9|NZ_LN885086.1_2520890_2521169_+	cd17040, Ubl_MoaD_like, ubiquitin-like (Ubl) domain found in a group of small sulfide carrier proteins	NA|142aa|up_8|NZ_LN885086.1_2521358_2521784_-	cd18689, PIN_VapC-like, uncharacterized subfamily of the VapC-like nuclease family of the PIN domain superfamily	NA|70aa|up_7|NZ_LN885086.1_2521782_2521992_+	NA	cas3|774aa|up_6|NZ_LN885086.1_2522650_2524972_+	cd17930, DEXHc_cas3, DEXH/Q-box helicase domain of Cas3	cas5|257aa|up_5|NZ_LN885086.1_2524980_2525751_+	cd09752, Cas5_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas8c|583aa|up_4|NZ_LN885086.1_2525753_2527502_+	pfam09709, Cas_Csd1, CRISPR-associated protein (Cas_Csd1)	cas7b|333aa|up_3|NZ_LN885086.1_2527498_2528497_+	pfam05107, Cas_Cas7, CRISPR-associated protein Cas7	cas4|212aa|up_2|NZ_LN885086.1_2528514_2529150_+	cd09637, Cas4_I-A_I-B_I-C_I-D_II-B, CRISPR/Cas system-associated protein Cas4	cas1|344aa|up_1|NZ_LN885086.1_2529157_2530189_+	TIGR03640, cas1_DVULG, CRISPR-associated endonuclease Cas1, subtype I-C/DVULG	cas2|97aa|up_0|NZ_LN885086.1_2530198_2530489_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|844aa|down_0|NZ_LN885086.1_2533602_2536134_-	TIGR01970, ATP-dependent_RNA_helicase_HrpB, ATP-dependent helicase HrpB	NA|64aa|down_1|NZ_LN885086.1_2536202_2536394_+	NA	NA|327aa|down_2|NZ_LN885086.1_2536538_2537519_+	pfam00891, Methyltransf_2, O-methyltransferase	NA|192aa|down_3|NZ_LN885086.1_2537554_2538130_-	sd00010, SLR, Sel1-like repeat	NA|151aa|down_4|NZ_LN885086.1_2538199_2538652_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|68aa|down_5|NZ_LN885086.1_2538834_2539038_-	NA	NA|121aa|down_6|NZ_LN885086.1_2539280_2539643_-	COG1555, ComEA, DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair]	NA|454aa|down_7|NZ_LN885086.1_2539913_2541275_-	pfam03631, Virul_fac_BrkB, Virulence factor BrkB	NA|252aa|down_8|NZ_LN885086.1_2541475_2542231_-	cd14948, BACON, Bacteroidetes-Associated Carbohydrate-binding (putative) Often N-terminal (BACON) domain	NA|102aa|down_9|NZ_LN885086.1_2542502_2542808_-	NA
GCF_001458695.1_NiCh1	NZ_LN885086	Candidatus Nitrospira inopinata isolate ENR4 chromosome 1	4	2698940-2699048	4	CRISPRCasFinder	no		cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	Orphan	GACAGATCGATCGGCAGAGGCCAAAACCGCTTGCGACT	38	0	0	NA	NA	NA	1	1	Orphan	cas6,cas2,cas1,csx1,cmr6gr7,cmr5gr11,cmr4gr7,cmr3gr5,cas10,csm6,WYL,cas3,cas8u2,cas7,cas5u,cas5,cas8c,cas7b,cas4	NA|66aa|up_1|NZ_LN885086.1_2698198_2698396_-,NA|100aa|up_0|NZ_LN885086.1_2698434_2698734_-,NA|123aa|down_1|NZ_LN885086.1_2701702_2702071_-,NA|62aa|down_2|NZ_LN885086.1_2702219_2702405_+,NA|405aa|down_4|NZ_LN885086.1_2703125_2704340_-,NA|90aa|down_5|NZ_LN885086.1_2704552_2704822_-,NA|109aa|down_6|NZ_LN885086.1_2705003_2705330_-	NA|433aa|up_9|NZ_LN885086.1_2687340_2688639_-	cd04179, DPM_DPG-synthase_like, DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily	NA|408aa|up_8|NZ_LN885086.1_2688635_2689859_-	PRK13915, PRK13915, putative glucosyl-3-phosphoglycerate synthase; Provisional	NA|280aa|up_7|NZ_LN885086.1_2690132_2690972_+	PRK00192, PRK00192, mannosyl-3-phosphoglycerate phosphatase; Reviewed	NA|263aa|up_6|NZ_LN885086.1_2691128_2691917_-	cd01627, HAD_TPP, trehalose-phosphate phosphatase similar to Escherichia coli trehalose-6-phosphate phosphatase OtsB and Saccharomyces cerevisiae trehalose-phosphatase TPS2	NA|750aa|up_5|NZ_LN885086.1_2691913_2694163_-	cd03788, GT20_TPS, trehalose-6-phosphate synthase	NA|595aa|up_4|NZ_LN885086.1_2694211_2695996_+	COG3387, SGA1, Glucoamylase and related glycosyl hydrolases [Carbohydrate transport and metabolism]	NA|182aa|up_3|NZ_LN885086.1_2696121_2696667_-	pfam04981, NMD3, NMD3 family	NA|396aa|up_2|NZ_LN885086.1_2696905_2698093_+	cd06164, S2P-M50_SpoIVFB_CBS, SpoIVFB Site-2 protease (S2P), a zinc metalloprotease (MEROPS family M50B), regulates intramembrane proteolysis (RIP), and is involved in the pro-sigmaK pathway of bacterial spore formation	NA|66aa|up_1|NZ_LN885086.1_2698198_2698396_-	NA	NA|100aa|up_0|NZ_LN885086.1_2698434_2698734_-	NA	NA|432aa|down_0|NZ_LN885086.1_2699986_2701282_+	pfam00924, MS_channel, Mechanosensitive ion channel	NA|123aa|down_1|NZ_LN885086.1_2701702_2702071_-	NA	NA|62aa|down_2|NZ_LN885086.1_2702219_2702405_+	NA	NA|218aa|down_3|NZ_LN885086.1_2702441_2703095_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|405aa|down_4|NZ_LN885086.1_2703125_2704340_-	NA	NA|90aa|down_5|NZ_LN885086.1_2704552_2704822_-	NA	NA|109aa|down_6|NZ_LN885086.1_2705003_2705330_-	NA	NA|868aa|down_7|NZ_LN885086.1_2705507_2708111_-	COG4591, LolE, ABC-type transport system, involved in lipoprotein release, permease component [Cell envelope biogenesis, outer membrane]	NA|91aa|down_8|NZ_LN885086.1_2708233_2708506_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|310aa|down_9|NZ_LN885086.1_2708515_2709445_-	PRK02506, PRK02506, dihydroorotate dehydrogenase 1A; Reviewed
