assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCA_003258705.1_ASM325870v1	CP029480	Arcticibacterium luteifluviistationis strain SM1504 chromosome, complete genome	1	1324128-1324989	1,1,1	CRT,CRISPRCasFinder,PILER-CR	no	cas2,cas1,RT,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10	RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	Type III-A,Type III-D,Type III-B,Type III-C	GTCTAAGTGGAGTATAAAATCCTTAATCAACTGAACG,GTCTAAGTGGAGTATAAAATCCTTAATCAACTGAAC,GTCTAAGTGGAGTATAAAATCCTTAATCAACTGAAC	37,36,36	0	0	NA	NA	NA:NA:NA	12,9,8	12	TypeIII-A,TypeIII-D,TypeIII-B,TypeIII-C	RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	NA|306aa|up_9|CP029480.1_1306085_1307003_+,NA|555aa|up_5|CP029480.1_1314603_1316268_+,NA|317aa|up_0|CP029480.1_1323078_1324029_+,NA	NA|306aa|up_9|CP029480.1_1306085_1307003_+	NA	NA|594aa|up_8|CP029480.1_1307246_1309028_-	pfam07980, SusD_RagB, SusD family	NA|1073aa|up_7|CP029480.1_1309058_1312277_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|386aa|up_6|CP029480.1_1312971_1314129_+	pfam07470, Glyco_hydro_88, Glycosyl Hydrolase Family 88	NA|555aa|up_5|CP029480.1_1314603_1316268_+	NA	NA|464aa|up_4|CP029480.1_1317037_1318429_+	cd01185, INTN1_C_like, Integrase IntN1 of Bacteroides mobilizable transposon NBU1 and similar proteins, C-terminal catalytic domain	NA|255aa|up_3|CP029480.1_1318543_1319308_+	cd01285, nucleoside_deaminase, Nucleoside deaminases include adenosine, guanine and cytosine deaminases	NA|425aa|up_2|CP029480.1_1319352_1320627_+	pfam10137, TIR-like, Predicted nucleotide-binding protein containing TIR-like domain	NA|819aa|up_1|CP029480.1_1320608_1323065_+	TIGR03255, PhnV, 2-aminoethylphosphonate ABC transport system, membrane component PhnV	NA|317aa|up_0|CP029480.1_1323078_1324029_+	NA	cas2|98aa|down_0|CP029480.1_1325103_1325397_-	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	cas1|329aa|down_1|CP029480.1_1325393_1326380_-	cd09634, Cas1_I-II-III, CRISPR/Cas system-associated protein Cas1	RT|347aa|down_2|CP029480.1_1326381_1327422_-	cd03487, RT_Bac_retron_II, RT_Bac_retron_II: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	cas6|225aa|down_3|CP029480.1_1327431_1328106_-	pfam17262, DUF5328, Family of unknown function (DUF5328)	csm5gr7|374aa|down_4|CP029480.1_1328102_1329224_-	TIGR01899, cas_TM1807_csm5, CRISPR type III-A/MTUBE-associated RAMP protein Csm5	csm4gr5|345aa|down_5|CP029480.1_1329223_1330258_-	TIGR01903, Hypothetical_protein	csm3gr7|224aa|down_6|CP029480.1_1330239_1330911_-	cd09684, Csm3_III-A, CRISPR/Cas system-associated RAMP superfamily protein Csm3	csm2gr11|157aa|down_7|CP029480.1_1330907_1331378_-	pfam03750, Csm2_III-A, Csm2 Type III-A	cas10|493aa|down_8|CP029480.1_1331374_1332853_-	cd09680, Cas10_III, CRISPR/Cas system-associated protein Cas10	NA|905aa|down_9|CP029480.1_1333034_1335749_+	pfam12770, CHAT, CHAT domain
GCA_003258705.1_ASM325870v1	CP029480	Arcticibacterium luteifluviistationis strain SM1504 chromosome, complete genome	2	1731748-1731849	2	CRISPRCasFinder	no		RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	Orphan	GTGTCTTAGCGGTTGAAAAACTAACCG	27	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	NA|99aa|up_8|CP029480.1_1724519_1724816_-,NA|117aa|up_7|CP029480.1_1724877_1725228_-,NA|137aa|up_6|CP029480.1_1725237_1725648_-,NA|66aa|down_1|CP029480.1_1733653_1733851_+	NA|260aa|up_9|CP029480.1_1723558_1724338_-	cd05344, BKR_like_SDR_like, putative beta-ketoacyl acyl carrier protein [ACP] reductase (BKR)-like, SDR	NA|99aa|up_8|CP029480.1_1724519_1724816_-	NA	NA|117aa|up_7|CP029480.1_1724877_1725228_-	NA	NA|137aa|up_6|CP029480.1_1725237_1725648_-	NA	NA|393aa|up_5|CP029480.1_1725701_1726880_-	pfam00999, Na_H_Exchanger, Sodium/hydrogen exchanger family	NA|205aa|up_4|CP029480.1_1727047_1727662_+	COG0179, MhpD, 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]	NA|99aa|up_3|CP029480.1_1727777_1728074_-	TIGR02607, Virulence-associated_protein_I, addiction module antidote protein, HigA family	NA|303aa|up_2|CP029480.1_1728258_1729167_-	cd10447, GIY-YIG_unchar_2, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria and archaea	NA|235aa|up_1|CP029480.1_1729229_1729934_-	PRK07326, PRK07326, SDR family oxidoreductase	NA|480aa|up_0|CP029480.1_1730081_1731521_-	cd16026, GALNS_like, galactosamine-6-sulfatase; also known as N-acetylgalactosamine-6-sulfatase (GALNS)	NA|500aa|down_0|CP029480.1_1731961_1733461_+	PRK01406, gltX, glutamyl-tRNA synthetase; Reviewed	NA|66aa|down_1|CP029480.1_1733653_1733851_+	NA	NA|260aa|down_2|CP029480.1_1733847_1734627_+	pfam01987, AIM24, Mitochondrial biogenesis AIM24	NA|285aa|down_3|CP029480.1_1734773_1735628_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|201aa|down_4|CP029480.1_1735679_1736282_-	pfam00856, SET, SET domain	NA|978aa|down_5|CP029480.1_1736425_1739359_-	cd18012, DEXQc_arch_SWI2_SNF2, DEAQ-box helicase domain of archaeal and bacterial SNF2-related proteins	NA|152aa|down_6|CP029480.1_1739736_1740192_-	pfam13474, SnoaL_3, SnoaL-like domain	NA|320aa|down_7|CP029480.1_1740178_1741138_-	sd00038, Kelch, Kelch repeat	NA|471aa|down_8|CP029480.1_1741163_1742576_-	pfam07995, GSDH, Glucose / Sorbosone dehydrogenase	NA|282aa|down_9|CP029480.1_1742788_1743634_+	TIGR00571, DNA_adenine_methylase, DNA adenine methylase (dam)
GCA_003258705.1_ASM325870v1	CP029480	Arcticibacterium luteifluviistationis strain SM1504 chromosome, complete genome	3	3071347-3071483	3	CRISPRCasFinder	no	WYL	RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	Unclear	GACGCGTAGTCCGTCTACCTACTCGGGCGATACGAACTATT	41	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	NA|453aa|up_9|CP029480.1_3055755_3057114_-,NA	NA|453aa|up_9|CP029480.1_3055755_3057114_-	NA	NA|806aa|up_8|CP029480.1_3057128_3059546_-	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|347aa|up_7|CP029480.1_3059617_3060658_+	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|761aa|up_6|CP029480.1_3060806_3063089_-	cd06563, GH20_chitobiase-like, The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin	WYL|230aa|up_5|CP029480.1_3063298_3063988_+	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|139aa|up_4|CP029480.1_3064063_3064480_+	pfam08570, DUF1761, Protein of unknown function (DUF1761)	NA|194aa|up_3|CP029480.1_3064480_3065062_+	COG4430, COG4430, Uncharacterized protein conserved in bacteria [Function unknown]	NA|192aa|up_2|CP029480.1_3065058_3065634_-	COG2249, MdaB, Putative NADPH-quinone reductase (modulator of drug activity B) [General function prediction only]	NA|261aa|up_1|CP029480.1_3065709_3066492_-	COG2207, AraC, AraC-type DNA-binding domain-containing proteins [Transcription]	NA|1230aa|up_0|CP029480.1_3066771_3070461_+	PRK05297, PRK05297, phosphoribosylformylglycinamidine synthase; Provisional	NA|229aa|down_0|CP029480.1_3078408_3079095_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|419aa|down_1|CP029480.1_3079091_3080348_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|214aa|down_2|CP029480.1_3080495_3081137_+	cd05250, CC3_like_SDR_a, CC3(TIP30)-like, atypical (a) SDRs	NA|132aa|down_3|CP029480.1_3081137_3081533_-	pfam11680, DUF3276, Protein of unknown function (DUF3276)	NA|156aa|down_4|CP029480.1_3081767_3082235_+	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|304aa|down_5|CP029480.1_3082247_3083159_+	COG1721, COG1721, Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) [General function prediction only]	NA|182aa|down_6|CP029480.1_3083164_3083710_-	pfam12867, DinB_2, DinB superfamily	NA|517aa|down_7|CP029480.1_3083684_3085235_-	COG1649, COG1649, Uncharacterized protein conserved in bacteria [Function unknown]	NA|224aa|down_8|CP029480.1_3085381_3086053_+	pfam14123, DUF4290, Domain of unknown function (DUF4290)	NA|435aa|down_9|CP029480.1_3086075_3087380_+	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated
GCA_003258705.1_ASM325870v1	CP029480	Arcticibacterium luteifluviistationis strain SM1504 chromosome, complete genome	4	3854289-3854366	4	CRISPRCasFinder	no		RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	Orphan	ACCGATGTCATGTCCCTAAGGGAC	24	0	0	NA	NA	NA	1	1	Orphan	RT,cas3,DEDDh,cas2,cas1,cas6,csm5gr7,csm4gr5,csm3gr7,csm2gr11,cas10,WYL,csa3,PD-DExK,Cas9_archaeal	NA,NA|124aa|down_3|CP029480.1_3858603_3858975_-	NA|133aa|up_9|CP029480.1_3840986_3841385_+	pfam16231, DUF4890, Domain of unknown function (DUF4890)	NA|783aa|up_8|CP029480.1_3841442_3843791_-	cd09618, CBM9_like_2, DOMON-like type 9 carbohydrate binding module	NA|241aa|up_7|CP029480.1_3843985_3844708_-	PRK12378, PRK12378, YebC/PmpR family DNA-binding transcriptional regulator	NA|992aa|up_6|CP029480.1_3844836_3847812_+	pfam04738, Lant_dehydr_N, Lantibiotic dehydratase, C-terminus	NA|241aa|up_5|CP029480.1_3847821_3848544_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|346aa|up_4|CP029480.1_3848626_3849664_-	COG3275, LytS, Putative regulator of cell autolysis [Signal transduction mechanisms]	NA|222aa|up_3|CP029480.1_3849971_3850637_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|401aa|up_2|CP029480.1_3850949_3852152_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|229aa|up_1|CP029480.1_3852291_3852978_+	cd03024, DsbA_FrnE, DsbA family, FrnE subfamily; FrnE is a DsbA-like protein containing a CXXC motif	NA|411aa|up_0|CP029480.1_3853002_3854235_+	COG2027, DacB, D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) [Cell envelope biogenesis, outer membrane]	NA|256aa|down_0|CP029480.1_3854500_3855268_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|411aa|down_1|CP029480.1_3855273_3856506_-	cd06239, M14-like, Peptidase M14-like domain; uncharacterized subgroup	NA|698aa|down_2|CP029480.1_3856496_3858590_-	COG0768, FtsI, Cell division protein FtsI/penicillin-binding protein 2 [Cell envelope biogenesis, outer membrane]	NA|124aa|down_3|CP029480.1_3858603_3858975_-	NA	NA|306aa|down_4|CP029480.1_3859103_3860021_-	PRK00050, PRK00050, 16S rRNA (cytosine(1402)-N(4))-methyltransferase RsmH	NA|408aa|down_5|CP029480.1_3860117_3861341_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|397aa|down_6|CP029480.1_3861416_3862607_+	COG2814, AraJ, Arabinose efflux permease [Carbohydrate transport and metabolism]	NA|246aa|down_7|CP029480.1_3862603_3863341_-	PRK08193, araD, L-ribulose-5-phosphate 4-epimerase AraD	NA|477aa|down_8|CP029480.1_3863463_3864894_-	PRK02929, PRK02929, L-arabinose isomerase; Provisional	NA|327aa|down_9|CP029480.1_3864910_3865891_-	TIGR02666, Cyclic_pyranopterin_monophosphate_synthase, molybdenum cofactor biosynthesis protein A, bacterial
