assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002302515.1_ASM230251v1	NZ_CP022380	Capnocytophaga sp. H4358 chromosome, complete genome	1	240621-241148	1	CRT	no		cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	Orphan	TTANNAAAAATACAAATTTAA	21	0	0	NA	NA	NA	8	8	Orphan	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	NA|96aa|up_3|NZ_CP022380.1_230920_231208_+,NA	NA|137aa|up_9|NZ_CP022380.1_225049_225460_-	pfam10825, DUF2752, Protein of unknown function (DUF2752)	NA|118aa|up_8|NZ_CP022380.1_225456_225810_-	pfam05154, TM2, TM2 domain	NA|565aa|up_7|NZ_CP022380.1_225846_227541_-	PRK09376, rho, transcription termination factor Rho; Provisional	NA|203aa|up_6|NZ_CP022380.1_227789_228398_-	COG2860, COG2860, Predicted membrane protein [Function unknown]	NA|151aa|up_5|NZ_CP022380.1_228719_229172_-	cd14797, DUF302, Uncharacterized domain family DUF302	NA|490aa|up_4|NZ_CP022380.1_229275_230745_-	PRK05326, PRK05326, potassium/proton antiporter	NA|96aa|up_3|NZ_CP022380.1_230920_231208_+	NA	NA|612aa|up_2|NZ_CP022380.1_231305_233141_+	PRK00095, mutL, DNA mismatch repair endonuclease MutL	NA|252aa|up_1|NZ_CP022380.1_233156_233912_+	pfam01694, Rhomboid, Rhomboid family	NA|983aa|up_0|NZ_CP022380.1_234063_237012_-	pfam08487, VIT, Vault protein inter-alpha-trypsin domain	NA|138aa|down_0|NZ_CP022380.1_241479_241893_-	pfam01220, DHquinase_II, Dehydroquinase class II	NA|1118aa|down_1|NZ_CP022380.1_241958_245312_-	PRK12901, secA, preprotein translocase subunit SecA; Reviewed	NA|74aa|down_2|NZ_CP022380.1_245542_245764_-	pfam11387, DUF2795, Protein of unknown function (DUF2795)	NA|126aa|down_3|NZ_CP022380.1_245914_246292_-	TIGR00004, RutC_family_protein, reactive intermediate/imine deaminase	NA|529aa|down_4|NZ_CP022380.1_246610_248197_-	pfam01551, Peptidase_M23, Peptidase family M23	NA|228aa|down_5|NZ_CP022380.1_248396_249080_-	COG2884, FtsE, Predicted ATPase involved in cell division [Cell division and chromosome partitioning]	NA|305aa|down_6|NZ_CP022380.1_249086_250001_-	PRK00892, lpxD, UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional	NA|211aa|down_7|NZ_CP022380.1_250022_250655_-	pfam08889, WbqC, WbqC-like protein family	NA|512aa|down_8|NZ_CP022380.1_250667_252203_-	PRK10861, PRK10861, signal peptidase I	NA|67aa|down_9|NZ_CP022380.1_252673_252874_-	cd14792, GH27, glycosyl hydrolase family 27 (GH27)
GCF_002302515.1_ASM230251v1	NZ_CP022380	Capnocytophaga sp. H4358 chromosome, complete genome	2	745787-753534	1,1,2,2,3,4	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR,PILER-CR	no	cas2,cas1,cas9	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	Type II-C, Type II-B,Type II-B,Type II-A, or Type II-C?	TCTGTAACTTTGTGATTAGTTACAAC,GTTGTAAAATCCTTTCAAAATCTGTAACTTTGTGATTAGTTACAAC,AANNNCNTTNNANNNTCNGTAACTTTGTGATTAGTTACAAC,GTTGTAAAATCCTTTCAA-AATCTGTAACTTTGTGATTAGTTACAAC,GTTGTAAATTGCTTTCAATTTTCCGTAACTTTGTGATTAGTTACAAC,GTTGTAAAATCCTTTCAA-AATCTGTAACTTTGTGATTAGTTACAAC	26,46,41,47,47,47	0	0	NA	NA	NA:NA:NA:NA:NA:NA	84,100,100,84,84,84	100	TypeII-C,TypeII-B,TypeII-B,TypeII-A,orTypeII-C?	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	NA|231aa|up_0|NZ_CP022380.1_744561_745254_+,NA	NA|248aa|up_9|NZ_CP022380.1_732073_732817_-	PRK07570, PRK07570, succinate dehydrogenase/fumarate reductase iron-sulfur subunit; Validated	NA|129aa|up_8|NZ_CP022380.1_732872_733259_-	cd16377, 23S_rRNA_IVP_like, 23S rRNA-intervening sequence protein and similar proteins	NA|677aa|up_7|NZ_CP022380.1_733329_735360_-	PRK07573, sdhA, fumarate reductase/succinate dehydrogenase flavoprotein subunit	NA|231aa|up_6|NZ_CP022380.1_735362_736055_-	cd03498, SQR_TypeB_2_TM, Succinate:quinone oxidoreductase (SQR)-like Type B subfamily 2, transmembrane subunit; composed of proteins with similarity to the SQRs of Geobacter metallireducens and Corynebacterium glutamicum	NA|592aa|up_5|NZ_CP022380.1_739436_741212_+	cd08977, SusD, starch binding outer membrane protein SusD	NA|298aa|up_4|NZ_CP022380.1_741284_742178_-	pfam02253, PLA1, Phospholipase A1	NA|232aa|up_3|NZ_CP022380.1_742384_743080_-	PRK00685, PRK00685, metal-dependent hydrolase; Provisional	NA|201aa|up_2|NZ_CP022380.1_743122_743725_-	pfam13023, HD_3, HD domain	NA|149aa|up_1|NZ_CP022380.1_743768_744215_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|231aa|up_0|NZ_CP022380.1_744561_745254_+	NA	NA|320aa|down_0|NZ_CP022380.1_753690_754650_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|401aa|down_1|NZ_CP022380.1_754673_755876_-	pfam00872, Transposase_mut, Transposase, Mutator family	cas2|113aa|down_2|NZ_CP022380.1_761000_761339_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|298aa|down_3|NZ_CP022380.1_761335_762229_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	NA|142aa|down_4|NZ_CP022380.1_762235_762661_-	cd18692, PIN_VapC-like, uncharacterized subfamily of the VapC-like nuclease family of the PIN domain superfamily	cas9|1475aa|down_5|NZ_CP022380.1_762644_767069_-	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	NA|371aa|down_6|NZ_CP022380.1_767357_768470_+	cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP) synthase (KAS), type I and II	NA|841aa|down_7|NZ_CP022380.1_768487_771010_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|352aa|down_8|NZ_CP022380.1_771081_772137_-	pfam01636, APH, Phosphotransferase enzyme family	NA|289aa|down_9|NZ_CP022380.1_772353_773220_-	cd04181, NTP_transferase, NTP_transferases catalyze the transfer of nucleotides onto phosphosugars
GCF_002302515.1_ASM230251v1	NZ_CP022380	Capnocytophaga sp. H4358 chromosome, complete genome	3	756076-760755	2,5	CRISPRCasFinder,PILER-CR	no	cas2,cas1,cas9	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	Type II-C, Type II-B,Type II-B,Type II-A, or Type II-C?	GTTGTAAAATCCTTTCAAAATCTGTAACTTTGTGATTAGTTACAAC,GTTGTAAAATCCTTTCAA-AATCTGTAACTTTGTGATTAGTTACAAC	46,47	0	0	NA	NA	NA:NA	61,33	61	TypeII-C,TypeII-B,TypeII-B,TypeII-A,orTypeII-C?	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	NA|231aa|up_3|NZ_CP022380.1_744561_745254_+,NA|127aa|up_2|NZ_CP022380.1_745412_745793_+,NA|218aa|down_9|NZ_CP022380.1_774639_775293_-	NA|231aa|up_9|NZ_CP022380.1_735362_736055_-	cd03498, SQR_TypeB_2_TM, Succinate:quinone oxidoreductase (SQR)-like Type B subfamily 2, transmembrane subunit; composed of proteins with similarity to the SQRs of Geobacter metallireducens and Corynebacterium glutamicum	NA|592aa|up_8|NZ_CP022380.1_739436_741212_+	cd08977, SusD, starch binding outer membrane protein SusD	NA|298aa|up_7|NZ_CP022380.1_741284_742178_-	pfam02253, PLA1, Phospholipase A1	NA|232aa|up_6|NZ_CP022380.1_742384_743080_-	PRK00685, PRK00685, metal-dependent hydrolase; Provisional	NA|201aa|up_5|NZ_CP022380.1_743122_743725_-	pfam13023, HD_3, HD domain	NA|149aa|up_4|NZ_CP022380.1_743768_744215_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|231aa|up_3|NZ_CP022380.1_744561_745254_+	NA	NA|127aa|up_2|NZ_CP022380.1_745412_745793_+	NA	NA|320aa|up_1|NZ_CP022380.1_753690_754650_+	COG2826, Tra8, Transposase and inactivated derivatives, IS30 family [DNA replication, recombination, and repair]	NA|401aa|up_0|NZ_CP022380.1_754673_755876_-	pfam00872, Transposase_mut, Transposase, Mutator family	cas2|113aa|down_0|NZ_CP022380.1_761000_761339_-	COG3512, COG3512, CRISPR-associated protein, Cas2 homolog [Defense mechanisms]	cas1|298aa|down_1|NZ_CP022380.1_761335_762229_-	TIGR03639, cas1_NMENI, CRISPR-associated endonuclease Cas1, subtype II/NMENI	NA|142aa|down_2|NZ_CP022380.1_762235_762661_-	cd18692, PIN_VapC-like, uncharacterized subfamily of the VapC-like nuclease family of the PIN domain superfamily	cas9|1475aa|down_3|NZ_CP022380.1_762644_767069_-	pfam18541, RuvC_III, RuvC endonuclease subdomain 3	NA|371aa|down_4|NZ_CP022380.1_767357_768470_+	cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP) synthase (KAS), type I and II	NA|841aa|down_5|NZ_CP022380.1_768487_771010_-	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|352aa|down_6|NZ_CP022380.1_771081_772137_-	pfam01636, APH, Phosphotransferase enzyme family	NA|289aa|down_7|NZ_CP022380.1_772353_773220_-	cd04181, NTP_transferase, NTP_transferases catalyze the transfer of nucleotides onto phosphosugars	NA|319aa|down_8|NZ_CP022380.1_773368_774325_-	pfam02562, PhoH, PhoH-like protein	NA|218aa|down_9|NZ_CP022380.1_774639_775293_-	NA
GCF_002302515.1_ASM230251v1	NZ_CP022380	Capnocytophaga sp. H4358 chromosome, complete genome	4	1382431-1382560	3	CRISPRCasFinder	no		cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	Orphan	AATTACGAATTACGAGATTTAATTACGAATTAC	33	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	NA,NA|214aa|down_2|NZ_CP022380.1_1384737_1385379_-,NA|82aa|down_3|NZ_CP022380.1_1385546_1385792_+	NA|300aa|up_9|NZ_CP022380.1_1364717_1365617_-	TIGR03523, hypothetical_protein_P700755_05002, gliding motility associated protien GldN	NA|525aa|up_8|NZ_CP022380.1_1365628_1367203_-	TIGR03517, GldM_gliding, gliding motility-associated protein GldM	NA|221aa|up_7|NZ_CP022380.1_1367235_1367898_-	TIGR03513, GldL_gliding, gliding motility-associated protein GldL	NA|459aa|up_6|NZ_CP022380.1_1367952_1369329_-	TIGR03525, lipoprotein_putative, gliding motility-associated lipoprotein GldK	NA|713aa|up_5|NZ_CP022380.1_1369717_1371856_-	COG1505, COG1505, Serine proteases of the peptidase family S9A [Amino acid transport and metabolism]	NA|639aa|up_4|NZ_CP022380.1_1372179_1374096_-	pfam12741, SusD-like, Susd and RagB outer membrane lipoprotein	NA|1042aa|up_3|NZ_CP022380.1_1374115_1377241_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|145aa|up_2|NZ_CP022380.1_1377480_1377915_+	PRK00601, dut, dUTP diphosphatase	NA|259aa|up_1|NZ_CP022380.1_1377921_1378698_+	pfam14125, DUF4292, Domain of unknown function (DUF4292)	NA|380aa|up_0|NZ_CP022380.1_1378875_1380015_+	PRK09354, recA, recombinase A; Provisional	NA|121aa|down_0|NZ_CP022380.1_1382612_1382975_-	PRK12275, PRK12275, hypothetical protein; Reviewed	NA|471aa|down_1|NZ_CP022380.1_1383076_1384489_-	PRK09441, PRK09441, cytoplasmic alpha-amylase; Reviewed	NA|214aa|down_2|NZ_CP022380.1_1384737_1385379_-	NA	NA|82aa|down_3|NZ_CP022380.1_1385546_1385792_+	NA	NA|339aa|down_4|NZ_CP022380.1_1385892_1386909_-	cd12956, CBM_SusE-F_like, carbohydrate-binding modules from Bacteroides thetaiotaomicron SusE, SusF and similar proteins	NA|527aa|down_5|NZ_CP022380.1_1386921_1388502_-	NF033071, SusD, starch-binding outer membrane lipoprotein SusD	NA|979aa|down_6|NZ_CP022380.1_1388532_1391469_-	TIGR04056, OMP_RagA_SusC, TonB-linked outer membrane protein, SusC/RagA family	NA|468aa|down_7|NZ_CP022380.1_1392741_1394145_+	cd02966, TlpA_like_family, TlpA-like family; composed of  TlpA, ResA, DsbE and similar proteins	NA|281aa|down_8|NZ_CP022380.1_1394166_1395009_+	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain	NA|305aa|down_9|NZ_CP022380.1_1395014_1395929_+	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain
GCF_002302515.1_ASM230251v1	NZ_CP022380	Capnocytophaga sp. H4358 chromosome, complete genome	5	1456348-1456464	4	CRISPRCasFinder	no	PD-DExK	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	Unclear	TAGCACTTTGTCAAAGTCTCAAACTTTGACAAAGTAGATAA	41	0	0	NA	NA	NA	1	1	Orphan	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	NA|266aa|up_9|NZ_CP022380.1_1443285_1444083_+,NA|268aa|up_8|NZ_CP022380.1_1444135_1444939_+,NA|153aa|up_6|NZ_CP022380.1_1448346_1448805_+,NA|130aa|up_5|NZ_CP022380.1_1448850_1449240_+,NA|194aa|up_3|NZ_CP022380.1_1450593_1451175_+,NA|234aa|up_2|NZ_CP022380.1_1451603_1452305_+,NA|196aa|up_0|NZ_CP022380.1_1455734_1456322_+,NA|158aa|down_5|NZ_CP022380.1_1462748_1463222_-,NA|124aa|down_8|NZ_CP022380.1_1465844_1466216_-	NA|266aa|up_9|NZ_CP022380.1_1443285_1444083_+	NA	NA|268aa|up_8|NZ_CP022380.1_1444135_1444939_+	NA	NA|1095aa|up_7|NZ_CP022380.1_1445050_1448335_+	PLN02563, PLN02563, aminoacyl-tRNA ligase	NA|153aa|up_6|NZ_CP022380.1_1448346_1448805_+	NA	NA|130aa|up_5|NZ_CP022380.1_1448850_1449240_+	NA	NA|414aa|up_4|NZ_CP022380.1_1449349_1450591_+	pfam13304, AAA_21, AAA domain, putative AbiEii toxin, Type IV TA system	NA|194aa|up_3|NZ_CP022380.1_1450593_1451175_+	NA	NA|234aa|up_2|NZ_CP022380.1_1451603_1452305_+	NA	NA|985aa|up_1|NZ_CP022380.1_1452593_1455548_+	COG3587, COG3587, Restriction endonuclease [Defense mechanisms]	NA|196aa|up_0|NZ_CP022380.1_1455734_1456322_+	NA	NA|643aa|down_0|NZ_CP022380.1_1456481_1458410_+	COG2189, COG2189, Adenine specific DNA methylase Mod [DNA replication, recombination, and repair]	NA|327aa|down_1|NZ_CP022380.1_1458478_1459459_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|232aa|down_2|NZ_CP022380.1_1459900_1460596_-	cd06578, HemD, Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme	NA|378aa|down_3|NZ_CP022380.1_1460724_1461858_-	COG0801, FolK, 7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase [Coenzyme metabolism]	NA|227aa|down_4|NZ_CP022380.1_1462038_1462719_-	COG2885, OmpA, Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane]	NA|158aa|down_5|NZ_CP022380.1_1462748_1463222_-	NA	NA|495aa|down_6|NZ_CP022380.1_1463411_1464896_+	cd06437, CESA_CaSu_A2, Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex	NA|229aa|down_7|NZ_CP022380.1_1465001_1465688_-	pfam13557, Phenol_MetA_deg, Putative MetA-pathway of phenol degradation	NA|124aa|down_8|NZ_CP022380.1_1465844_1466216_-	NA	NA|191aa|down_9|NZ_CP022380.1_1466596_1467169_+	pfam13715, CarbopepD_reg_2, CarboxypepD_reg-like domain
GCF_002302515.1_ASM230251v1	NZ_CP022380	Capnocytophaga sp. H4358 chromosome, complete genome	6	2038423-2038567	3	CRT	no		cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	Orphan	TACAAAAGCATAAAATCGG	19	0	0	NA	NA	NA	3	3	Orphan	cas3,DEDDh,PD-DExK,cas2,cas1,cas9,WYL	NA|537aa|up_1|NZ_CP022380.1_2034967_2036578_+,NA|194aa|down_2|NZ_CP022380.1_2060655_2061237_+,NA|253aa|down_5|NZ_CP022380.1_2061866_2062625_+,NA|96aa|down_6|NZ_CP022380.1_2062658_2062946_-,NA|207aa|down_9|NZ_CP022380.1_2068145_2068766_+	NA|310aa|up_9|NZ_CP022380.1_2024693_2025623_-	PRK00856, pyrB, aspartate carbamoyltransferase catalytic subunit	NA|188aa|up_8|NZ_CP022380.1_2025631_2026195_-	PRK05205, PRK05205, bifunctional pyr operon transcriptional regulator/uracil phosphoribosyltransferase PyrR	NA|844aa|up_7|NZ_CP022380.1_2026198_2028730_-	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane]	NA|311aa|up_6|NZ_CP022380.1_2028746_2029679_-	COG4874, COG4874, Uncharacterized protein conserved in bacteria containing a pentein-type domain [Function unknown]	NA|305aa|up_5|NZ_CP022380.1_2030069_2030984_-	COG1834, COG1834, N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism]	NA|317aa|up_4|NZ_CP022380.1_2031249_2032200_+	COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and metabolism]	NA|471aa|up_3|NZ_CP022380.1_2032386_2033799_+	COG1785, PhoA, Alkaline phosphatase [Inorganic ion transport and metabolism]	NA|327aa|up_2|NZ_CP022380.1_2033826_2034807_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|537aa|up_1|NZ_CP022380.1_2034967_2036578_+	NA	NA|477aa|up_0|NZ_CP022380.1_2036714_2038145_+	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|3414aa|down_0|NZ_CP022380.1_2039266_2049508_+	pfam13585, CHU_C, C-terminal domain of CHU protein family	NA|3494aa|down_1|NZ_CP022380.1_2049942_2060424_+	pfam13585, CHU_C, C-terminal domain of CHU protein family	NA|194aa|down_2|NZ_CP022380.1_2060655_2061237_+	NA	NA|94aa|down_3|NZ_CP022380.1_2061229_2061511_+	pfam13601, HTH_34, Winged helix DNA-binding domain	NA|94aa|down_4|NZ_CP022380.1_2061529_2061811_+	pfam13239, 2TM, 2TM domain	NA|253aa|down_5|NZ_CP022380.1_2061866_2062625_+	NA	NA|96aa|down_6|NZ_CP022380.1_2062658_2062946_-	NA	NA|640aa|down_7|NZ_CP022380.1_2063034_2064954_+	cd01347, ligand_gated_channel, TonB dependent/Ligand-Gated channels are created by a monomeric 22 strand (22,24) anti-parallel beta-barrel	NA|1039aa|down_8|NZ_CP022380.1_2065036_2068153_+	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|207aa|down_9|NZ_CP022380.1_2068145_2068766_+	NA
