assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_902386745.1_UHGG_MGYG-HGUT-02398	NZ_LR698992	Laribacter hongkongensis isolate MGYG-HGUT-02398 chromosome 1	1	1176394-1178285	1,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f	csa3,DEDDh,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DinG	Type I-F	GTTCACTGCCGGACAGGCAGCTCAGAAA,GTTCACTGCCGGACAGGCAGCT,GTTCACTGCCGGACAGGCAGCTTAGAAA	28,22,28	1	1	1178225-1178257	NZ_LR698992.1_1654382-1654350	I-F:I-F:I-F	31,31,28	31	TypeI-F	csa3,DEDDh,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DinG	NA|78aa|up_4|NZ_LR698992.1_1171753_1171987_+,NA|54aa|down_4|NZ_LR698992.1_1183514_1183676_+,NA|146aa|down_7|NZ_LR698992.1_1186654_1187092_-	NA|313aa|up_9|NZ_LR698992.1_1161469_1162408_-	PRK11025, PRK11025, 23S rRNA pseudouridine(955/2504/2580) synthase RluC	NA|1071aa|up_8|NZ_LR698992.1_1162957_1166170_+	PRK10811, rne, ribonuclease E; Reviewed	NA|64aa|up_7|NZ_LR698992.1_1166712_1166904_-	pfam13683, rve_3, Integrase core domain	cas1|325aa|up_6|NZ_LR698992.1_1167417_1168392_+	TIGR03637, cas1_YPEST, CRISPR-associated endonuclease Cas1, subtype I-F/YPEST	cas3-cas2|1121aa|up_5|NZ_LR698992.1_1168388_1171751_+	TIGR02562, conserved_hypothetical_protein, CRISPR-associated helicase Cas3, subtype I-F/YPEST	NA|78aa|up_4|NZ_LR698992.1_1171753_1171987_+	NA	cas8f|462aa|up_3|NZ_LR698992.1_1172231_1173617_+	pfam09611, Cas_Csy1, CRISPR-associated protein (Cas_Csy1)	cas5f|337aa|up_2|NZ_LR698992.1_1173603_1174614_+	pfam09614, Cas_Csy2, CRISPR-associated protein (Cas_Csy2)	cas7f|341aa|up_1|NZ_LR698992.1_1174610_1175633_+	pfam09615, Cas_Csy3, CRISPR-associated protein (Cas_Csy3)	cas6f|196aa|up_0|NZ_LR698992.1_1175636_1176224_+	pfam09618, Cas_Csy4, CRISPR-associated protein (Cas_Csy4)	NA|331aa|down_0|NZ_LR698992.1_1178565_1179558_+	COG1613, Sbp, ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|270aa|down_1|NZ_LR698992.1_1180398_1181208_+	TIGR02454, Uncharacterized_protein_MJ1089, cobalt ECF transporter T component CbiQ	NA|273aa|down_2|NZ_LR698992.1_1181213_1182032_+	cd03225, ABC_cobalt_CbiO_domain1, First domain of the ATP-binding cassette component of cobalt transport system	NA|458aa|down_3|NZ_LR698992.1_1182049_1183423_+	cd01031, EriC, ClC chloride channel EriC	NA|54aa|down_4|NZ_LR698992.1_1183514_1183676_+	NA	NA|666aa|down_5|NZ_LR698992.1_1183798_1185796_+	TIGR01778, TonB-copper, TonB-dependent copper receptor	NA|234aa|down_6|NZ_LR698992.1_1185863_1186565_-	PRK10871, nlpD, murein hydrolase activator NlpD	NA|146aa|down_7|NZ_LR698992.1_1186654_1187092_-	NA	NA|273aa|down_8|NZ_LR698992.1_1187104_1187923_-	pfam13679, Methyltransf_32, Methyltransferase domain	NA|306aa|down_9|NZ_LR698992.1_1187959_1188877_-	cd08422, PBP2_CrgA_like, The C-terminal substrate binding domain of LysR-type transcriptional regulator CrgA and its related homologs, contains the type 2 periplasmic binding domain
GCF_902386745.1_UHGG_MGYG-HGUT-02398	NZ_LR698992	Laribacter hongkongensis isolate MGYG-HGUT-02398 chromosome 1	2	1523000-1523869	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no		csa3,DEDDh,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DinG	Orphan	GTTCACTGCCGGACAGGCAGCTCAGAAA,GTTCACTGCCGGACAGGCAGCTCAGAAA,GTTCACTGCCGGACAGGCAGCTCAGAAA	28,28,28	0	0	NA	NA	I-F:I-F:I-F	13,14,14	14	Orphan	csa3,DEDDh,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DinG	NA|104aa|up_8|NZ_LR698992.1_1517782_1518094_+,NA|87aa|up_6|NZ_LR698992.1_1518817_1519078_-,NA|79aa|up_5|NZ_LR698992.1_1519192_1519429_-,NA|209aa|up_2|NZ_LR698992.1_1521587_1522214_-,NA|73aa|up_1|NZ_LR698992.1_1522212_1522431_+,NA	NA|238aa|up_9|NZ_LR698992.1_1513594_1514308_+	PRK10992, PRK10992, iron-sulfur cluster repair protein YtfE	NA|104aa|up_8|NZ_LR698992.1_1517782_1518094_+	NA	NA|163aa|up_7|NZ_LR698992.1_1518261_1518750_+	cd04335, PrdX_deacylase, This CD includes bacterial (Agrobacterium tumefaciens and Caulobacter crescentus ProX, and Clostridium sticklandii PrdX) and eukaryotic (Plasmodium falciparum N-terminal ProRS editing domain) sequences	NA|87aa|up_6|NZ_LR698992.1_1518817_1519078_-	NA	NA|79aa|up_5|NZ_LR698992.1_1519192_1519429_-	NA	NA|96aa|up_4|NZ_LR698992.1_1519517_1519805_-	pfam07045, DUF1330, Domain of unknown function (DUF1330)	NA|484aa|up_3|NZ_LR698992.1_1520110_1521562_-	pfam10145, PhageMin_Tail, Phage-related minor tail protein	NA|209aa|up_2|NZ_LR698992.1_1521587_1522214_-	NA	NA|73aa|up_1|NZ_LR698992.1_1522212_1522431_+	NA	NA|99aa|up_0|NZ_LR698992.1_1522503_1522800_-	pfam10109, Phage_TAC_7, Phage tail assembly chaperone proteins, E, or 41 or 14	NA|105aa|down_0|NZ_LR698992.1_1524129_1524444_-	PHA02600, FII, major tail tube protein; Provisional	NA|736aa|down_1|NZ_LR698992.1_1524891_1527099_-	TIGR03549, TIGR03549, YcaO domain protein	NA|755aa|down_2|NZ_LR698992.1_1527595_1529860_+	PRK15061, PRK15061, catalase/peroxidase	NA|346aa|down_3|NZ_LR698992.1_1530014_1531052_-	pfam14568, SUKH_6, SMI1-KNR4 cell-wall	NA|467aa|down_4|NZ_LR698992.1_1531468_1532869_-	pfam06545, DUF1116, Protein of unknown function (DUF1116)	NA|513aa|down_5|NZ_LR698992.1_1532892_1534431_-	PRK06091, PRK06091, membrane protein FdrA; Validated	NA|520aa|down_6|NZ_LR698992.1_1534507_1536067_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|524aa|down_7|NZ_LR698992.1_1536181_1537753_-	COG2252, COG2252, Xanthine/uracil/vitamin C permease [Nucleotide transport and    metabolism]	NA|304aa|down_8|NZ_LR698992.1_1537903_1538815_-	PRK10094, PRK10094, HTH-type transcriptional activator AllS	NA|224aa|down_9|NZ_LR698992.1_1539047_1539719_+	pfam00857, Isochorismatase, Isochorismatase family
GCF_902386745.1_UHGG_MGYG-HGUT-02398	NZ_LR698992	Laribacter hongkongensis isolate MGYG-HGUT-02398 chromosome 1	3	2443477-2443563	3	CRISPRCasFinder	no		csa3,DEDDh,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DinG	Orphan	CCCGGCCCCCTCCCGTACGGACTCCTG	27	1	1	2443504-2443536	NZ_LR698992.1_1759594-1759626	NA	1	1	Orphan	csa3,DEDDh,WYL,cas1,cas3-cas2,cas8f,cas5f,cas7f,cas6f,DinG	NA,NA	NA|504aa|up_9|NZ_LR698992.1_2428760_2430272_+	PRK00484, lysS, lysyl-tRNA synthetase; Reviewed	NA|248aa|up_8|NZ_LR698992.1_2430441_2431185_+	PRK09692, PRK09692, integrase; Provisional	NA|197aa|up_7|NZ_LR698992.1_2432448_2433039_+	cd00801, INT_P4_C, Bacteriophage P4 integrase, C-terminal catalytic domain	NA|106aa|up_6|NZ_LR698992.1_2433696_2434014_+	cd14845, L-Ala-D-Glu_peptidase_like, L-Ala-D-Glu peptidase, also known as L-alanyl-D-glutamate endopeptidase	NA|374aa|up_5|NZ_LR698992.1_2433994_2435115_-	pfam13683, rve_3, Integrase core domain	NA|631aa|up_4|NZ_LR698992.1_2435555_2437448_+	COG0840, Tar, Methyl-accepting chemotaxis protein [Cell motility and secretion / Signal transduction mechanisms]	NA|314aa|up_3|NZ_LR698992.1_2437980_2438922_+	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|312aa|up_2|NZ_LR698992.1_2438954_2439890_+	COG0835, CheW, Chemotaxis signal transduction protein [Cell motility and secretion / Signal transduction mechanisms]	NA|136aa|up_1|NZ_LR698992.1_2439907_2440315_+	cd19923, REC_CheY_CheY3, phosphoacceptor receiver (REC) domain of chemotaxis response regulator CheY3 and similar CheY family proteins	NA|315aa|up_0|NZ_LR698992.1_2440341_2441286_+	pfam04344, CheZ, Chemotaxis phosphatase, CheZ	NA|725aa|down_0|NZ_LR698992.1_2443629_2445804_-	PRK15061, PRK15061, catalase/peroxidase	NA|345aa|down_1|NZ_LR698992.1_2446576_2447611_+	COG1858, MauG, Cytochrome c peroxidase [Inorganic ion transport and metabolism]	NA|987aa|down_2|NZ_LR698992.1_2447654_2450615_+	PRK10060, PRK10060, cyclic di-GMP phosphodiesterase	NA|367aa|down_3|NZ_LR698992.1_2450841_2451942_-	cd00342, gram_neg_porins, Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane	NA|256aa|down_4|NZ_LR698992.1_2452615_2453383_-	cd09086, ExoIII-like_AP-endo, Escherichia coli exonuclease III (ExoIII) and Neisseria meningitides NExo-like subfamily of the ExoIII family purinic/apyrimidinic (AP) endonucleases	NA|604aa|down_5|NZ_LR698992.1_2453456_2455268_-	PRK00476, aspS, aspartyl-tRNA synthetase; Validated	NA|208aa|down_6|NZ_LR698992.1_2455293_2455917_-	COG2928, COG2928, Uncharacterized conserved protein [Function unknown]	NA|81aa|down_7|NZ_LR698992.1_2455926_2456169_-	TIGR02605, CxxC_CxxC_SSSS, putative regulatory protein, FmdB family	NA|311aa|down_8|NZ_LR698992.1_2456726_2457659_+	PRK11890, PRK11890, phosphate acetyltransferase; Provisional	NA|396aa|down_9|NZ_LR698992.1_2457655_2458843_+	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed
