assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	1	112396-112809	1	CRT	no		RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Orphan	TTCTACCGGNGGCTCTAC	18	6	10	112414-112467|112414-112467|112486-112539|112558-112587|112558-112587|112606-112635|112606-112635|112654-112671|112654-112671|112774-112791	NZ_AP014815.1_112366-112419|NZ_AP014815.1_112390-112443|NZ_AP014815.1_112378-112431|NZ_AP014815.1_112366-112395|NZ_AP014815.1_112390-112419|NZ_AP014815.1_112366-112395|NZ_AP014815.1_112390-112419|NZ_AP014815.1_112366-112383|NZ_AP014815.1_112390-112407|NZ_AP014815.1_112378-112395	NA	8	8	Orphan	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|381aa|up_1|NZ_AP014815.1_107565_108708_-,NA|84aa|down_5|NZ_AP014815.1_126741_126993_-	NA|901aa|up_9|NZ_AP014815.1_96328_99031_-	COG0643, CheA, Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms]	NA|854aa|up_8|NZ_AP014815.1_99186_101748_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|142aa|up_7|NZ_AP014815.1_101866_102292_-	pfam01584, CheW, CheW-like domain	NA|121aa|up_6|NZ_AP014815.1_102538_102901_-	cd19937, REC_OmpR_BsPhoP-like, phosphoacceptor receiver (REC) domain of BsPhoP-like OmpR family response regulators	NA|379aa|up_5|NZ_AP014815.1_102916_104053_-	cd17602, REC_PatA-like, phosphoacceptor receiver (REC) domain of PatA and similar domains	NA|96aa|up_4|NZ_AP014815.1_104711_104999_-	cd12399, RRM_HP0827_like, RNA recognition motif in Helicobacter pylori HP0827 protein and similar proteins	NA|591aa|up_3|NZ_AP014815.1_105164_106937_-	PRK05945, sdhA, succinate dehydrogenase/fumarate reductase flavoprotein subunit	NA|165aa|up_2|NZ_AP014815.1_107067_107562_+	smart00260, CheW, Two component signalling adaptor domain	NA|381aa|up_1|NZ_AP014815.1_107565_108708_-	NA	NA|658aa|up_0|NZ_AP014815.1_108812_110786_-	pfam03160, Calx-beta, Calx-beta domain	NA|178aa|down_0|NZ_AP014815.1_120340_120874_+	PRK00099, rplJ, 50S ribosomal protein L10; Reviewed	NA|127aa|down_1|NZ_AP014815.1_120931_121312_+	CHL00083, rpl12, ribosomal protein L12	NA|1032aa|down_2|NZ_AP014815.1_121399_124495_-	COG4889, COG4889, Predicted helicase [General function prediction only]	NA|142aa|down_3|NZ_AP014815.1_124610_125036_-	COG3011, COG3011, Predicted thiol-disulfide oxidoreductase [General function    prediction only]	NA|483aa|down_4|NZ_AP014815.1_125218_126667_-	PRK00012, gatA, Asp-tRNA(Asn)/Glu-tRNA(Gln) amidotransferase subunit GatA	NA|84aa|down_5|NZ_AP014815.1_126741_126993_-	NA	NA|379aa|down_6|NZ_AP014815.1_127223_128360_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|271aa|down_7|NZ_AP014815.1_128399_129212_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|280aa|down_8|NZ_AP014815.1_129344_130184_-	pfam00685, Sulfotransfer_1, Sulfotransferase domain	NA|245aa|down_9|NZ_AP014815.1_130211_130946_-	TIGR04155, hypothetical_protein, PEP-CTERM protein sorting domain, cyanobacterial subclass
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	2	156621-156727	1	CRISPRCasFinder	no	RT	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Unclear	TTTCTCCTAATTTTAACAAAATTTCTCGACTG	32	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|63aa|up_8|NZ_AP014815.1_148949_149138_+,NA|82aa|up_7|NZ_AP014815.1_149137_149383_+,NA|77aa|up_4|NZ_AP014815.1_151730_151961_-,NA|72aa|down_0|NZ_AP014815.1_158860_159076_-	NA|114aa|up_9|NZ_AP014815.1_148406_148748_+	pfam08869, XisI, XisI protein	NA|63aa|up_8|NZ_AP014815.1_148949_149138_+	NA	NA|82aa|up_7|NZ_AP014815.1_149137_149383_+	NA	RT|531aa|up_6|NZ_AP014815.1_149414_151007_-	cd01646, RT_Bac_retron_I, RT_Bac_retron_I: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|167aa|up_5|NZ_AP014815.1_151210_151711_+	PRK01617, PRK01617, hypothetical protein; Provisional	NA|77aa|up_4|NZ_AP014815.1_151730_151961_-	NA	NA|210aa|up_3|NZ_AP014815.1_152880_153510_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|473aa|up_2|NZ_AP014815.1_153837_155256_+	COG3264, COG3264, Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane]	NA|105aa|up_1|NZ_AP014815.1_155575_155890_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|114aa|up_0|NZ_AP014815.1_155886_156228_+	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|72aa|down_0|NZ_AP014815.1_158860_159076_-	NA	NA|1136aa|down_1|NZ_AP014815.1_159191_162599_-	pfam13809, Tubulin_2, Tubulin like	NA|126aa|down_2|NZ_AP014815.1_162735_163113_-	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|82aa|down_3|NZ_AP014815.1_163099_163345_-	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|374aa|down_4|NZ_AP014815.1_163506_164628_-	smart00327, VWA, von Willebrand factor (vWF) type A domain	NA|181aa|down_5|NZ_AP014815.1_164773_165316_-	pfam11322, DUF3124, Protein of unknown function (DUF3124)	NA|440aa|down_6|NZ_AP014815.1_165493_166813_+	cd06346, PBP1_ABC_ligand_binding-like, type 1 periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems predicted to be involved in uptake of amino acids, peptides, or inorganic ions	NA|198aa|down_7|NZ_AP014815.1_166888_167482_+	cd02042, ParAB_family, partition proteins ParAB family	NA|456aa|down_8|NZ_AP014815.1_167566_168934_+	smart00857, Resolvase, Resolvase, N terminal domain	NA|291aa|down_9|NZ_AP014815.1_168950_169823_-	PRK05481, PRK05481, lipoyl synthase; Provisional
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	3	1387979-1390024	1,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Unclear	GTGATCAACGCCTAATGGCGATCGAAGGTTAAACAG,GTGATCAACGCCTAATGGCGATCGAAGGTTAAACAG,GTGATCAACGCCTAATGGCGATCGAAGGTTAAACAG	36,36,36	0	0	NA	NA	NA:NA:NA	28,28,28	28	Unclear	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|192aa|up_9|NZ_AP014815.1_1370645_1371221_+,NA	NA|192aa|up_9|NZ_AP014815.1_1370645_1371221_+	NA	NA|461aa|up_8|NZ_AP014815.1_1371485_1372868_-	COG4250, COG4250, Predicted sensor protein/domain [Signal transduction mechanisms]	NA|265aa|up_7|NZ_AP014815.1_1373022_1373817_-	pfam13911, AhpC-TSA_2, AhpC/TSA antioxidant enzyme	NA|1264aa|up_6|NZ_AP014815.1_1374335_1378127_-	cd17267, RMtype1_S_EcoAO83I-TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to S	NA|638aa|up_5|NZ_AP014815.1_1378365_1380279_-	pfam15978, TnsD, Tn7-like transposition protein D	NA|656aa|up_4|NZ_AP014815.1_1380299_1382267_-	pfam15978, TnsD, Tn7-like transposition protein D	NA|517aa|up_3|NZ_AP014815.1_1382263_1383814_-	pfam13401, AAA_22, AAA domain	NA|729aa|up_2|NZ_AP014815.1_1383806_1385993_-	pfam00665, rve, Integrase core domain	NA|285aa|up_1|NZ_AP014815.1_1385989_1386844_-	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|327aa|up_0|NZ_AP014815.1_1386972_1387953_+	pfam14072, DndB, DNA-sulfur modification-associated	cas5|241aa|down_0|NZ_AP014815.1_1390286_1391009_-	TIGR02586, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5/DevS, subtype MYXAN	cas7|305aa|down_1|NZ_AP014815.1_1391079_1391994_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b3|560aa|down_2|NZ_AP014815.1_1392059_1393739_-	TIGR04413, hypothetical_protein_LEP1GSC082_4029, CRISPR type MYXAN-associated protein Cmx8	cas3|777aa|down_3|NZ_AP014815.1_1393738_1396069_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas6|223aa|down_4|NZ_AP014815.1_1396046_1396715_-	pfam09559, Cas6, Cas6 Crispr	WYL|324aa|down_5|NZ_AP014815.1_1397061_1398033_-	pfam13280, WYL, WYL domain	cas1|556aa|down_6|NZ_AP014815.1_1398389_1400057_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|down_7|NZ_AP014815.1_1400061_1400355_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|626aa|down_8|NZ_AP014815.1_1402405_1404283_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|187aa|down_9|NZ_AP014815.1_1404401_1404962_-	pfam11371, DUF3172, Protein of unknown function (DUF3172)
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	4	1400605-1402284	2,3,3	PILER-CR,CRISPRCasFinder,CRT	no	cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Unclear	GTGATCAACGCCTAATGGCGATCGAAGGTTAAACAG,CTGTTTAACCTTCGATCGCCATTAGGCGTTGATCAC,CTGTTTAACCTTCGATCGCCATTAGGCGTTGATCAC	36,36,36	0	0	NA	NA	NA:NA:NA	22,23,23	23	Unclear	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA,NA|257aa|down_5|NZ_AP014815.1_1409673_1410444_-,NA|172aa|down_9|NZ_AP014815.1_1415043_1415559_-	NA|285aa|up_9|NZ_AP014815.1_1385989_1386844_-	pfam08722, Tn7_Tnp_TnsA_N, TnsA endonuclease N terminal	NA|327aa|up_8|NZ_AP014815.1_1386972_1387953_+	pfam14072, DndB, DNA-sulfur modification-associated	cas5|241aa|up_7|NZ_AP014815.1_1390286_1391009_-	TIGR02586, CRISPR-associated_protein_Cas5, CRISPR-associated protein Cas5/DevS, subtype MYXAN	cas7|305aa|up_6|NZ_AP014815.1_1391079_1391994_-	cd09687, Cas7_I-C, CRISPR/Cas system-associated RAMP superfamily protein Cas7	cas8b3|560aa|up_5|NZ_AP014815.1_1392059_1393739_-	TIGR04413, hypothetical_protein_LEP1GSC082_4029, CRISPR type MYXAN-associated protein Cmx8	cas3|777aa|up_4|NZ_AP014815.1_1393738_1396069_-	TIGR01587, CRISPR-associated_endonuclease/helicase_Cas3, CRISPR-associated helicase Cas3	cas6|223aa|up_3|NZ_AP014815.1_1396046_1396715_-	pfam09559, Cas6, Cas6 Crispr	WYL|324aa|up_2|NZ_AP014815.1_1397061_1398033_-	pfam13280, WYL, WYL domain	cas1|556aa|up_1|NZ_AP014815.1_1398389_1400057_+	TIGR03983, hypothetical_protein_LA3181, CRISPR-associated endonuclease Cas1, subtype MYXAN	cas2|98aa|up_0|NZ_AP014815.1_1400061_1400355_+	pfam09827, CRISPR_Cas2, CRISPR associated protein Cas2	NA|626aa|down_0|NZ_AP014815.1_1402405_1404283_-	PRK00331, PRK00331, isomerizing glutamine--fructose-6-phosphate transaminase	NA|187aa|down_1|NZ_AP014815.1_1404401_1404962_-	pfam11371, DUF3172, Protein of unknown function (DUF3172)	NA|604aa|down_2|NZ_AP014815.1_1405172_1406984_+	cd01948, EAL, EAL domain	NA|289aa|down_3|NZ_AP014815.1_1407169_1408036_-	pfam03881, Fructosamin_kin, Fructosamine kinase	NA|511aa|down_4|NZ_AP014815.1_1408046_1409579_-	COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism]	NA|257aa|down_5|NZ_AP014815.1_1409673_1410444_-	NA	NA|456aa|down_6|NZ_AP014815.1_1410529_1411897_-	COG1316, LytR, Transcriptional regulator [Transcription]	NA|354aa|down_7|NZ_AP014815.1_1411982_1413044_+	COG0836, {ManC}, Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane]	NA|604aa|down_8|NZ_AP014815.1_1413181_1414993_+	cd01115, SLC13_permease, Permease SLC13 (solute carrier 13)	NA|172aa|down_9|NZ_AP014815.1_1415043_1415559_-	NA
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	5	2101328-2101419	4	CRISPRCasFinder	no	WYL,cas3,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas1,cas2	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Type I-D	AGTTTCAATCCCTCATAGGTATT	23	0	0	NA	NA	NA	1	1	TypeI-D	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|64aa|up_8|NZ_AP014815.1_2093808_2094000_-,NA|106aa|down_0|NZ_AP014815.1_2101593_2101911_+,NA|85aa|down_1|NZ_AP014815.1_2102129_2102384_+,NA|93aa|down_3|NZ_AP014815.1_2105980_2106259_+,NA|69aa|down_4|NZ_AP014815.1_2106261_2106468_+	NA|368aa|up_9|NZ_AP014815.1_2092652_2093756_-	PRK00389, gcvT, glycine cleavage system aminomethyltransferase GcvT	NA|64aa|up_8|NZ_AP014815.1_2093808_2094000_-	NA	NA|97aa|up_7|NZ_AP014815.1_2094074_2094365_-	pfam03929, PepSY_TM, PepSY-associated TM region	NA|450aa|up_6|NZ_AP014815.1_2094517_2095867_-	COG0642, BaeS, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|226aa|up_5|NZ_AP014815.1_2095832_2096510_-	COG0745, OmpR, Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [Signal transduction mechanisms / Transcription]	WYL|285aa|up_4|NZ_AP014815.1_2096637_2097492_-	pfam13280, WYL, WYL domain	NA|88aa|up_3|NZ_AP014815.1_2097619_2097883_+	COG1669, COG1669, Predicted nucleotidyltransferases [General function prediction only]	NA|115aa|up_2|NZ_AP014815.1_2097921_2098266_+	COG2361, COG2361, Uncharacterized conserved protein [Function unknown]	NA|151aa|up_1|NZ_AP014815.1_2098258_2098711_+	pfam18765, Polbeta, Polymerase beta, Nucleotidyltransferase	cas3|727aa|up_0|NZ_AP014815.1_2099146_2101327_+	cd09710, Cas3_I-D, CRISPR/Cas system-associated protein Cas3; Distinct diverged subfamily of Cas3 helicase domain	NA|106aa|down_0|NZ_AP014815.1_2101593_2101911_+	NA	NA|85aa|down_1|NZ_AP014815.1_2102129_2102384_+	NA	cas10d|1169aa|down_2|NZ_AP014815.1_2102453_2105960_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|93aa|down_3|NZ_AP014815.1_2105980_2106259_+	NA	NA|69aa|down_4|NZ_AP014815.1_2106261_2106468_+	NA	csc2gr7|340aa|down_5|NZ_AP014815.1_2106505_2107525_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|250aa|down_6|NZ_AP014815.1_2107585_2108335_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	cas6|273aa|down_7|NZ_AP014815.1_2108466_2109285_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|197aa|down_8|NZ_AP014815.1_2109346_2109937_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	NA|123aa|down_9|NZ_AP014815.1_2110022_2110391_+	TIGR02436, S23_ribosomal_protein, four helix bundle protein
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	6	2111973-2118063	3,5,4,4,5	PILER-CR,CRISPRCasFinder,CRT,PILER-CR,PILER-CR	no	WYL,cas3,cas10d,csc2gr7,csc1gr5,cas6,cas4,cas1,cas2	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Type I-D	GTTTAAATTATTATTAATACCTATCAGGGATTGAAAC,GTTTAAATTATTATTAATACCTATCAGGGATTGAAAC,GTTTAAATTATTATTAATACCTATCAGGGATTGAAAC,GTTTAAATTATTATTAATACCTATCAGGGATTGAAAC,GTTTAAATTATTATTAATACCTATCAGGGATTGAAAC	37,37,37,37,37	0	0	NA	NA	I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B:I-D,II-B	79,83,83,79,79	83	TypeI-D	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|93aa|up_8|NZ_AP014815.1_2105980_2106259_+,NA|69aa|up_7|NZ_AP014815.1_2106261_2106468_+,NA|101aa|down_4|NZ_AP014815.1_2122634_2122937_-,NA|82aa|down_6|NZ_AP014815.1_2124020_2124266_+	cas10d|1169aa|up_9|NZ_AP014815.1_2102453_2105960_+	TIGR03174, cas_Csc3, CRISPR type I-D/CYANO-associated protein Csc3/Cas10d	NA|93aa|up_8|NZ_AP014815.1_2105980_2106259_+	NA	NA|69aa|up_7|NZ_AP014815.1_2106261_2106468_+	NA	csc2gr7|340aa|up_6|NZ_AP014815.1_2106505_2107525_+	pfam18320, Csc2, Csc2 Crispr	csc1gr5|250aa|up_5|NZ_AP014815.1_2107585_2108335_+	cd09711, Csc1_I-D, CRISPR/Cas system-associated protein Csc1	cas6|273aa|up_4|NZ_AP014815.1_2108466_2109285_+	COG5551, COG5551, CRISPR system related protein, RAMP superfamily [Defense    mechanisms]	cas4|197aa|up_3|NZ_AP014815.1_2109346_2109937_+	TIGR00372, conserved_hypothetical_protein, CRISPR-associated protein Cas4	NA|123aa|up_2|NZ_AP014815.1_2110022_2110391_+	TIGR02436, S23_ribosomal_protein, four helix bundle protein	cas1|336aa|up_1|NZ_AP014815.1_2110451_2111459_+	TIGR04093, hypothetical_protein_L8106_25395, CRISPR-associated endonuclease Cas1, subtype CYANO	cas2|92aa|up_0|NZ_AP014815.1_2111470_2111746_+	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	NA|197aa|down_0|NZ_AP014815.1_2118304_2118895_-	cd16383, GUN4, porphyrin-binding protein domain GUN4	NA|467aa|down_1|NZ_AP014815.1_2118994_2120395_-	PRK05478, PRK05478, 3-isopropylmalate dehydratase large subunit	NA|123aa|down_2|NZ_AP014815.1_2120601_2120970_-	PRK11770, PRK11770, YccF domain-containing protein	NA|133aa|down_3|NZ_AP014815.1_2121274_2121673_-	pfam01641, SelR, SelR domain	NA|101aa|down_4|NZ_AP014815.1_2122634_2122937_-	NA	NA|258aa|down_5|NZ_AP014815.1_2123092_2123866_-	cd01000, PBP2_Cys_DEBP_like, Substrate-binding domain of cysteine- and aspartate/glutamate-binding proteins; the type 2 periplasmic-binding protein fold	NA|82aa|down_6|NZ_AP014815.1_2124020_2124266_+	NA	NA|578aa|down_7|NZ_AP014815.1_2124356_2126090_+	COG0497, RecN, ATPase involved in DNA repair [DNA replication, recombination, and repair]	NA|802aa|down_8|NZ_AP014815.1_2126093_2128499_-	COG4354, COG4354, Predicted bile acid beta-glucosidase [Carbohydrate transport and metabolism]	NA|193aa|down_9|NZ_AP014815.1_2128770_2129349_-	PRK02726, PRK02726, molybdenum cofactor guanylyltransferase
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014815	Geminocystis sp. NIES-3708	7	2553978-2554600	6,6,5	PILER-CR,CRISPRCasFinder,CRT	no		RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	Orphan	GTTGAAATAAGAAAATACCTTCTATAGGGATTGAAAG,GTTGAAATAAGAAAATACCTTCTATAGGGATTGAAAG,GTTGAAATAAGAAAATACCTTCTATAGGGATTGAAAG	37,37,37	0	0	NA	NA	NA:NA:NA	7,8,8	8	Orphan	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|50aa|up_5|NZ_AP014815.1_2544528_2544678_-,NA|148aa|up_4|NZ_AP014815.1_2544676_2545120_+,NA|85aa|up_2|NZ_AP014815.1_2546344_2546599_-,NA	NA|232aa|up_9|NZ_AP014815.1_2535888_2536584_-	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|365aa|up_8|NZ_AP014815.1_2536646_2537741_-	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|276aa|up_7|NZ_AP014815.1_2541356_2542184_+	COG2842, COG2842, Uncharacterized ATPase, putative transposase [General function prediction only]	NA|163aa|up_6|NZ_AP014815.1_2542183_2542672_+	pfam06527, TniQ, TniQ	NA|50aa|up_5|NZ_AP014815.1_2544528_2544678_-	NA	NA|148aa|up_4|NZ_AP014815.1_2544676_2545120_+	NA	NA|383aa|up_3|NZ_AP014815.1_2545154_2546303_-	cd17282, RMtype1_S_Eco16444ORF1681_TRD1-CR1_like, Type I restriction-modification system specificity (S) subunit Target Recognition Domain-ConseRved domain (TRD-CR), similar to Escherichia coli G4/9 S subunit (S	NA|85aa|up_2|NZ_AP014815.1_2546344_2546599_-	NA	NA|88aa|up_1|NZ_AP014815.1_2546604_2546868_-	pfam04365, BrnT_toxin, Ribonuclease toxin, BrnT, of type II toxin-antitoxin system	NA|140aa|up_0|NZ_AP014815.1_2551195_2551615_-	cd01105, HTH_GlnR-like, Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators	NA|186aa|down_0|NZ_AP014815.1_2555083_2555641_+	PRK09364, moaC, cyclic pyranopterin monophosphate synthase MoaC	NA|101aa|down_1|NZ_AP014815.1_2555695_2555998_-	CHL00074, rps14, ribosomal protein S14	NA|309aa|down_2|NZ_AP014815.1_2556202_2557129_-	pfam13354, Beta-lactamase2, Beta-lactamase enzyme family	NA|231aa|down_3|NZ_AP014815.1_2557231_2557924_+	pfam00877, NLPC_P60, NlpC/P60 family	NA|302aa|down_4|NZ_AP014815.1_2557972_2558878_-	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|341aa|down_5|NZ_AP014815.1_2559507_2560530_+	PRK05385, PRK05385, phosphoribosylaminoimidazole synthetase; Provisional	NA|273aa|down_6|NZ_AP014815.1_2560873_2561692_+	PRK00513, minC, septum formation inhibitor; Reviewed	NA|271aa|down_7|NZ_AP014815.1_2561845_2562658_+	COG2894, MinD, Septum formation inhibitor-activating ATPase [Cell division and chromosome partitioning]	NA|94aa|down_8|NZ_AP014815.1_2562705_2562987_+	PRK13988, PRK13988, cell division topological specificity factor MinE; Provisional	NA|461aa|down_9|NZ_AP014815.1_2563416_2564799_-	PRK13352, PRK13352, phosphomethylpyrimidine synthase ThiC
GCF_001548095.1_Gm3709_assembly_1.0	NZ_AP014818	Geminocystis sp. NIES-3708 plasmid pGM03, complete sequence	1	13208-13299	1	CRISPRCasFinder	no			Orphan	CCTCTACCTTTCCTCTACCTTTACT	25	0	0	NA	NA	NA	1	1	Orphan	RT,csa3,c2c5_V-U5,cas5,cas7,cas8b3,cas3,cas6,WYL,cas1,cas2,DinG,cas10d,csc2gr7,csc1gr5,cas4,cas14k,DEDDh	NA|85aa|up_9|NZ_AP014818.1_1717_1972_+,NA|270aa|up_6|NZ_AP014818.1_7009_7819_-,NA|225aa|up_1|NZ_AP014818.1_11143_11818_-,NA|235aa|down_1|NZ_AP014818.1_16309_17014_-,NA|62aa|down_3|NZ_AP014818.1_17435_17621_-	NA|85aa|up_9|NZ_AP014818.1_1717_1972_+	NA	NA|83aa|up_8|NZ_AP014818.1_1968_2217_+	pfam10049, DUF2283, Protein of unknown function (DUF2283)	NA|1245aa|up_7|NZ_AP014818.1_2942_6677_+	smart00382, AAA, ATPases associated with a variety of cellular activities	NA|270aa|up_6|NZ_AP014818.1_7009_7819_-	NA	NA|282aa|up_5|NZ_AP014818.1_8014_8860_+	COG4974, XerD, Site-specific recombinase XerD [DNA replication, recombination, and repair]	NA|274aa|up_4|NZ_AP014818.1_8981_9803_+	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|137aa|up_3|NZ_AP014818.1_9827_10238_-	pfam18478, PIN_10, PIN like domain	NA|239aa|up_2|NZ_AP014818.1_10339_11056_-	COG2442, COG2442, Uncharacterized conserved protein [Function unknown]	NA|225aa|up_1|NZ_AP014818.1_11143_11818_-	NA	NA|330aa|up_0|NZ_AP014818.1_11821_12811_-	COG2856, COG2856, Predicted Zn peptidase [Amino acid transport and metabolism]	NA|803aa|down_0|NZ_AP014818.1_13681_16090_-	pfam13148, DUF3987, Protein of unknown function (DUF3987)	NA|235aa|down_1|NZ_AP014818.1_16309_17014_-	NA	NA|110aa|down_2|NZ_AP014818.1_17109_17439_-	pfam13518, HTH_28, Helix-turn-helix domain	NA|62aa|down_3|NZ_AP014818.1_17435_17621_-	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA	NA|NA	NA
