assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001983935.1_ASM198393v1	NZ_CP017641	Fuerstia marisgermanicae strain NH11 chromosome, complete genome	1	997810-997907	1	CRISPRCasFinder	no		csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	Orphan	CGATGGTCGTTATTGATGTCGATTGC	26	0	0	NA	NA	NA	1	1	Orphan	csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	NA|367aa|up_7|NZ_CP017641.1_983275_984376_+,NA|453aa|up_6|NZ_CP017641.1_984495_985854_-,NA|72aa|up_5|NZ_CP017641.1_986060_986276_-,NA|204aa|down_8|NZ_CP017641.1_1018235_1018847_+	NA|472aa|up_9|NZ_CP017641.1_979034_980450_-	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|778aa|up_8|NZ_CP017641.1_980800_983134_+	COG1033, COG1033, Predicted exporters of the RND superfamily [General function prediction only]	NA|367aa|up_7|NZ_CP017641.1_983275_984376_+	NA	NA|453aa|up_6|NZ_CP017641.1_984495_985854_-	NA	NA|72aa|up_5|NZ_CP017641.1_986060_986276_-	NA	NA|549aa|up_4|NZ_CP017641.1_986519_988166_+	cd03398, PAP2_haloperoxidase, PAP2, haloperoxidase_like subfamily	NA|1053aa|up_3|NZ_CP017641.1_988402_991561_-	PRK05672, dnaE2, error-prone DNA polymerase; Validated	NA|494aa|up_2|NZ_CP017641.1_991576_993058_-	cd03468, PolY_like, DNA Polymerase Y-family	NA|272aa|up_1|NZ_CP017641.1_992984_993800_-	NF033429, ImuA_translesion, translesion DNA synthesis-associated protein ImuA	NA|198aa|up_0|NZ_CP017641.1_993977_994571_-	PRK00215, PRK00215, transcriptional repressor LexA	NA|205aa|down_0|NZ_CP017641.1_1005756_1006371_+	TIGR02984, Sig-70_plancto1, RNA polymerase sigma-70 factor, Planctomycetaceae-specific subfamily 1	NA|1425aa|down_1|NZ_CP017641.1_1006367_1010642_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|228aa|down_2|NZ_CP017641.1_1010760_1011444_+	cd02522, GT_2_like_a, GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function	NA|330aa|down_3|NZ_CP017641.1_1011425_1012415_-	COG1907, COG1907, Predicted archaeal sugar kinases [General function prediction only]	NA|193aa|down_4|NZ_CP017641.1_1012411_1012990_-	pfam04289, DUF447, Protein of unknown function (DUF447)	NA|985aa|down_5|NZ_CP017641.1_1013008_1015963_-	TIGR03303, OM_YaeT, outer membrane protein assembly complex, YaeT protein	NA|232aa|down_6|NZ_CP017641.1_1016217_1016913_-	cd18092, SpoU-like_TrmH, SAM-dependent tRNA methylase related to TrmH	NA|289aa|down_7|NZ_CP017641.1_1016925_1017792_-	PRK10334, PRK10334, small-conductance mechanosensitive channel MscS	NA|204aa|down_8|NZ_CP017641.1_1018235_1018847_+	NA	NA|376aa|down_9|NZ_CP017641.1_1020306_1021434_+	cd06821, PLPDE_III_D-TA, Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme D-Threonine Aldolase
GCF_001983935.1_ASM198393v1	NZ_CP017641	Fuerstia marisgermanicae strain NH11 chromosome, complete genome	2	2610288-2610547	1	CRT	no		csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	Orphan	GTNGTGTAAGGAANNNNCTT	20	0	0	NA	NA	NA	5	5	Orphan	csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	NA|128aa|up_9|NZ_CP017641.1_2593744_2594128_-,NA|365aa|up_5|NZ_CP017641.1_2604102_2605197_+,NA|48aa|down_6|NZ_CP017641.1_2620976_2621120_-,NA|205aa|down_9|NZ_CP017641.1_2624673_2625288_+	NA|128aa|up_9|NZ_CP017641.1_2593744_2594128_-	NA	NA|438aa|up_8|NZ_CP017641.1_2595599_2596913_-	PRK05749, PRK05749, 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed	NA|1184aa|up_7|NZ_CP017641.1_2597064_2600616_-	PRK12904, PRK12904, preprotein translocase subunit SecA; Reviewed	NA|710aa|up_6|NZ_CP017641.1_2601292_2603422_+	PRK12323, PRK12323, DNA polymerase III subunit gamma/tau	NA|365aa|up_5|NZ_CP017641.1_2604102_2605197_+	NA	NA|314aa|up_4|NZ_CP017641.1_2605428_2606370_-	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|204aa|up_3|NZ_CP017641.1_2606853_2607465_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|130aa|up_2|NZ_CP017641.1_2608029_2608419_+	PRK00051, hisI, phosphoribosyl-AMP cyclohydrolase; Reviewed	NA|293aa|up_1|NZ_CP017641.1_2608491_2609370_+	PRK00489, hisG, ATP phosphoribosyltransferase; Reviewed	NA|172aa|up_0|NZ_CP017641.1_2609376_2609892_-	cd03134, GATase1_PfpI_like, A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus	NA|229aa|down_0|NZ_CP017641.1_2612578_2613265_-	PRK00685, PRK00685, metal-dependent hydrolase; Provisional	NA|470aa|down_1|NZ_CP017641.1_2613271_2614681_-	PRK14097, pgi, glucose-6-phosphate isomerase; Provisional	NA|110aa|down_2|NZ_CP017641.1_2615185_2615515_+	TIGR01068, Thioredoxin-like_protein_slr0233, thioredoxin	NA|583aa|down_3|NZ_CP017641.1_2615705_2617454_+	TIGR02168, Chromosome_partition_protein_Smc, chromosome segregation protein SMC, common bacterial type	NA|70aa|down_4|NZ_CP017641.1_2617667_2617877_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|452aa|down_5|NZ_CP017641.1_2619342_2620698_+	COG2230, Cfa, Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane]	NA|48aa|down_6|NZ_CP017641.1_2620976_2621120_-	NA	NA|213aa|down_7|NZ_CP017641.1_2621342_2621981_+	pfam00072, Response_reg, Response regulator receiver domain	NA|640aa|down_8|NZ_CP017641.1_2621989_2623909_-	PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed	NA|205aa|down_9|NZ_CP017641.1_2624673_2625288_+	NA
GCF_001983935.1_ASM198393v1	NZ_CP017641	Fuerstia marisgermanicae strain NH11 chromosome, complete genome	3	2804488-2804583	2	CRISPRCasFinder	no		csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	Orphan	CCACGCCAACCGCCCGCCGAGGCGT	25	0	0	NA	NA	NA	1	1	Orphan	csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	NA,NA|669aa|down_5|NZ_CP017641.1_2812601_2814608_+,NA|414aa|down_7|NZ_CP017641.1_2815226_2816468_+	NA|219aa|up_9|NZ_CP017641.1_2791537_2792194_+	pfam07638, Sigma70_ECF, ECF sigma factor	NA|207aa|up_8|NZ_CP017641.1_2792247_2792868_-	pfam05991, NYN_YacP, YacP-like NYN domain	NA|473aa|up_7|NZ_CP017641.1_2793112_2794531_+	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit	NA|199aa|up_6|NZ_CP017641.1_2794586_2795183_+	PRK01641, leuD, 3-isopropylmalate dehydratase small subunit	NA|253aa|up_5|NZ_CP017641.1_2795285_2796044_-	cd05233, SDR_c, classical (c) SDRs	NA|311aa|up_4|NZ_CP017641.1_2796100_2797033_+	cd05292, LDH_2, A subgroup of L-lactate dehydrogenases	NA|235aa|up_3|NZ_CP017641.1_2797246_2797951_+	pfam00359, PTS_EIIA_2, Phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 2	NA|1204aa|up_2|NZ_CP017641.1_2798163_2801775_-	sd00006, TPR, Tetratricopeptide repeat	NA|516aa|up_1|NZ_CP017641.1_2802083_2803631_+	TIGR01730, COG0845:_Membrane-fusion_protein, RND family efflux transporter, MFP subunit	NA|196aa|up_0|NZ_CP017641.1_2803640_2804228_-	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|352aa|down_0|NZ_CP017641.1_2804595_2805651_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|696aa|down_1|NZ_CP017641.1_2806077_2808165_-	cd01920, cyclophilin_EcCYP_like, cyclophilin_EcCYP_like: cyclophilin-type A-like peptidylprolyl cis- trans isomerase (PPIase) domain similar to the cytosolic E	NA|388aa|down_2|NZ_CP017641.1_2808628_2809792_+	TIGR02037, Probable_periplasmic_serine_protease_do/HhoA-like, periplasmic serine protease, Do/DeqQ family	NA|105aa|down_3|NZ_CP017641.1_2809857_2810172_-	cd17551, REC_RpfG-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase response regulator RpfG and similar proteins	NA|485aa|down_4|NZ_CP017641.1_2810641_2812096_-	cd06177, MFS_NHS, Nucleoside:H(+) symporter family of the Major Facilitator Superfamily of transporters	NA|669aa|down_5|NZ_CP017641.1_2812601_2814608_+	NA	NA|132aa|down_6|NZ_CP017641.1_2814614_2815010_+	pfam13620, CarboxypepD_reg, Carboxypeptidase regulatory-like domain	NA|414aa|down_7|NZ_CP017641.1_2815226_2816468_+	NA	NA|1201aa|down_8|NZ_CP017641.1_2816594_2820197_+	pfam07587, PSD1, Protein of unknown function (DUF1553)	NA|480aa|down_9|NZ_CP017641.1_2820238_2821678_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)
GCF_001983935.1_ASM198393v1	NZ_CP017641	Fuerstia marisgermanicae strain NH11 chromosome, complete genome	4	6492017-6492776	1,3,2	PILER-CR,CRISPRCasFinder,CRT	no	cas9,cas1,cas2	csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	Type II-A,Type II-C,Type II-B, or Type II-C?, Type II-B	AGTGTATCCGAGCAGCCGTTCCCGACGAACCACAGC,AGTGTATCCGAGCAGCCGTTCCCGACGAACCACAGC,AGTGTATCCGAGCAGCCGTTCCCGACGAACCACAGC	36,36,36	0	0	NA	NA	NA:NA:NA	11,11,11	11	TypeII-A,TypeII-C,TypeII-B,orTypeII-C?,TypeII-B	csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	NA|267aa|up_9|NZ_CP017641.1_6476553_6477354_+,NA|478aa|up_5|NZ_CP017641.1_6482496_6483930_-,NA|77aa|down_4|NZ_CP017641.1_6497393_6497624_+	NA|267aa|up_9|NZ_CP017641.1_6476553_6477354_+	NA	NA|255aa|up_8|NZ_CP017641.1_6477554_6478319_-	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|430aa|up_7|NZ_CP017641.1_6479422_6480712_+	COG0673, MviM, Predicted dehydrogenases and related proteins [General function prediction only]	NA|366aa|up_6|NZ_CP017641.1_6481294_6482392_-	sd00034, LRR_AMN1, leucine-rich repeats, antagonist of mitotic exit network protein 1-like subfamily	NA|478aa|up_5|NZ_CP017641.1_6482496_6483930_-	NA	NA|553aa|up_4|NZ_CP017641.1_6484263_6485922_-	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	cas9|1047aa|up_3|NZ_CP017641.1_6486604_6489745_+	cd09643, Csn1, CRISPR/Cas system-associated protein Cas9	NA|202aa|up_2|NZ_CP017641.1_6489903_6490509_+	COG1943, COG1943, Transposase and inactivated derivatives [DNA replication, recombination, and repair]	cas1|305aa|up_1|NZ_CP017641.1_6490717_6491632_+	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	cas2|102aa|up_0|NZ_CP017641.1_6491660_6491966_+	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	NA|228aa|down_0|NZ_CP017641.1_6492824_6493508_-	pfam03703, bPH_2, Bacterial PH domain	NA|405aa|down_1|NZ_CP017641.1_6493650_6494865_-	COG0628, yhhT, Predicted permease, member of the PurR regulon [General function prediction only]	NA|87aa|down_2|NZ_CP017641.1_6494888_6495149_-	COG2261, COG2261, Predicted membrane protein [Function unknown]	NA|487aa|down_3|NZ_CP017641.1_6495846_6497307_-	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|77aa|down_4|NZ_CP017641.1_6497393_6497624_+	NA	NA|806aa|down_5|NZ_CP017641.1_6497837_6500255_+	pfam12228, DUF3604, Protein of unknown function (DUF3604)	NA|171aa|down_6|NZ_CP017641.1_6500476_6500989_+	cd00838, MPP_superfamily, metallophosphatase superfamily, metallophosphatase domain	NA|487aa|down_7|NZ_CP017641.1_6501008_6502469_+	pfam03050, DDE_Tnp_IS66, Transposase IS66 family	NA|520aa|down_8|NZ_CP017641.1_6502539_6504099_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|1258aa|down_9|NZ_CP017641.1_6504101_6507875_-	PRK11107, PRK11107, hybrid sensory histidine kinase BarA; Provisional
GCF_001983935.1_ASM198393v1	NZ_CP017641	Fuerstia marisgermanicae strain NH11 chromosome, complete genome	5	8358467-8358561	4	CRISPRCasFinder	no		csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	Orphan	CGCCGGCGCTATAGCCACCACAACGTCA	28	0	0	NA	NA	NA	1	1	Orphan	csa3,PrimPol,DinG,RT,DEDDh,cas3,csm5gr7,cas9,cas1,cas2	NA|440aa|up_9|NZ_CP017641.1_8345157_8346477_-,NA|168aa|up_2|NZ_CP017641.1_8354071_8354575_+,NA|225aa|down_0|NZ_CP017641.1_8358599_8359274_-,NA|19aa|down_5|NZ_CP017641.1_8363803_8363860_+,NA|76aa|down_9|NZ_CP017641.1_8367942_8368170_+	NA|440aa|up_9|NZ_CP017641.1_8345157_8346477_-	NA	NA|339aa|up_8|NZ_CP017641.1_8347132_8348149_+	PRK00436, argC, N-acetyl-gamma-glutamyl-phosphate reductase; Validated	NA|406aa|up_7|NZ_CP017641.1_8348142_8349360_+	PRK05388, argJ, bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase ArgJ	NA|271aa|up_6|NZ_CP017641.1_8349506_8350319_+	COG0580, GlpF, Glycerol uptake facilitator and related permeases (Major Intrinsic Protein Family) [Carbohydrate transport and metabolism]	NA|495aa|up_5|NZ_CP017641.1_8350432_8351917_+	cd07786, FGGY_EcGK_like, Escherichia coli glycerol kinase-like proteins; belongs to the FGGY family of carbohydrate kinases	NA|406aa|up_4|NZ_CP017641.1_8352023_8353241_+	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|190aa|up_3|NZ_CP017641.1_8353317_8353887_+	cd17569, REC_HupR-like, phosphoacceptor receiver (REC) domain of hydrogen uptake protein regulator (HupR) and similar domains	NA|168aa|up_2|NZ_CP017641.1_8354071_8354575_+	NA	NA|242aa|up_1|NZ_CP017641.1_8354581_8355307_-	TIGR02937, RNA_polymerase_sigma_factor, RNA polymerase sigma factor, sigma-70 family	NA|907aa|up_0|NZ_CP017641.1_8355455_8358176_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|225aa|down_0|NZ_CP017641.1_8358599_8359274_-	NA	NA|150aa|down_1|NZ_CP017641.1_8360170_8360620_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|410aa|down_2|NZ_CP017641.1_8360567_8361797_-	pfam01610, DDE_Tnp_ISL3, Transposase	NA|235aa|down_3|NZ_CP017641.1_8361842_8362547_+	cd00200, WD40, WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment	NA|391aa|down_4|NZ_CP017641.1_8362603_8363776_+	pfam13808, DDE_Tnp_1_assoc, DDE_Tnp_1-associated	NA|19aa|down_5|NZ_CP017641.1_8363803_8363860_+	NA	NA|210aa|down_6|NZ_CP017641.1_8363970_8364600_+	pfam13643, DUF4145, Domain of unknown function (DUF4145)	NA|288aa|down_7|NZ_CP017641.1_8364652_8365516_+	pfam13385, Laminin_G_3, Concanavalin A-like lectin/glucanases superfamily	NA|239aa|down_8|NZ_CP017641.1_8367089_8367806_+	cd16142, ARS_like, uncharacterized arylsulfatase subfamily	NA|76aa|down_9|NZ_CP017641.1_8367942_8368170_+	NA
