Overview
Our system allow user to upload files containing processed nucleotide sequences and identify what species each file contains with the below nucleobases.
Supported nucleobases
The system supports a combination of nucleobase symbols used in DNA sequencing. Each letter stands for a specific nucleobase or a group of nucleobases:
- Name
A
- Type
- Adenine
- Description
A purine nucleobase that pairs with Thymine in DNA through two hydrogen bonds.
- Name
G
- Type
- Guanine
- Description
Another purine nucleobase, which pairs with Cytosine in DNA via three hydrogen bonds.
- Name
C
- Type
- Cytosine
- Description
A pyrimidine nucleobase that pairs with Guanine.
- Name
T
- Type
- Thymine
- Description
Also a pyrimidine nucleobase, pairing with Adenine.
- Name
N
- Type
- Any (∀)
- Description
Any Nucleotide (A, G, C, or T).
- Name
M
- Type
- Amino group
- Description
Amino group, representing either A or C.
- Name
R
- Type
- Purine
- Description
Purine, representing either A or G.
- Name
W
- Type
- Weak (A or T)
- Description
Weak, representing either A or T.
- Name
S
- Type
- Strong (G or C)
- Description
Strong, representing either G or C.
- Name
Y
- Type
- Pyrimidine
- Description
Pyrimidine, representing either C or T.
- Name
K
- Type
- Keto
- Description
Keto, representing either G or T.
- Name
V
- Type
- Not T
- Description
Not T (A, C, or G).
- Name
H
- Type
- Not G
- Description
Not G (A, C, or T).
- Name
D
- Type
- Not C
- Description
Not C (A, G, or T).
- Name
B
- Type
- Not A
- Description
Not A (C, G, or T).
Preparing your data
Data can be prepared using various formats, such as FASTA, CSV, TXT, remember to remove raw reads.
Using FASTA file
The FASTA format is a text-based format for representing nucleotide sequences (like DNA or RNA) or peptide sequences (amino acids), commonly used in bioinformatics. It is a simple and widely adopted format for input into many sequence analysis tools and databases, there are two main parts:
- Header Line: Begins with ">" and includes the sequence name or identifier.
- Sequence Lines: Contain the genetic sequence in letters (like A, G, C, T for DNA, or amino acid codes for proteins).
Example.fasta
>sp|Q6GZX4|001R_FRG3G Putative transcription factor 001R OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-001R PE=4 SV=1
VRVTVVADBKMWTDKRDSCCBBVKHRDDWNTGKGMNADCDBSCGABBVTRWMMMVTKBGM
HYSHYGSGMKNGDYCVDMGKKCAYBTARBRHBDWVAWVNHNWMWNMTGWGRCCYCRYDMD
STKRGSVVTVTDVRCTSYMHRKTVBDCTRNDCVNSSBMGDTSWNTSKYRABGWVWSYABY
WHGCHCTRHWKWSVVCCAD
>sp|Q6GZX3|002L_FRG3G Uncharacterized protein 002L OS=Frog virus 3 (isolate Goorha) OX=654924 GN=FV3-002L PE=4 SV=1
VRVTVVADBKMWTDKRDSCCBBVKHRDDWNTGKGMNADCDBSCGABBVTRWMMMVTKBGM
HYSHYGSGMKNGDYCVDMGKKCAYBTARBRHBDWVAWVNHNWMWNMTGWGRCCYCRYDMD
STKRGSVVTVTDVRCTSYMHRKTVBDCTRNDCVNSSBMGDTSWNTSKYRABGWVWSYABY
WHGCHCTRHWKWSVVCCAD
... ...
Using CSV file
The CSV (Comma-Separated Values) format is a simple text format used to store tabular data (like a spreadsheet or a database) in plain text. Each data separated by comma will be a single genetic reads.
- No Header: Do not include any header, just include reads separted by comma.
Example.csv
VRVTVVADBKMWTDKRDSCCBBVKHRDDWNTGKGMNADCDBSCGABBVTRWMMMVTKBGM
HYSHYGSGMKNGDYCVDMGKKCA,YBTARBRHBDWVAWVNHNWMWNMTGWGRCCYCRYDM
STKRGSVVTVTDVRCTSYMHRKTVBDCTRNDCVNSSBMGDTSWNTSKYRABGWVWSYABY
WHGCHCTRHWKWSVVCCAD,VRVTVVADBKMWTDKRDSCCBBVKHRDDWNTGKGMNADCD
HYSHYGSGMKNGDYCVDMGK,KCAYBTARBRHBDWVAWVNHNWMWNMTGWGRCCYCRYDM
STKRGSVVTVTDVRCTSYMHRKTVBDCTRNDCVNSSBMGDTSWNTSKYRABGWVWSYABY
WHGCHCTRHWKWSVVCCAD
... ...
Using TXT file
The TXT format is a plain text format, you can separate your data with either comma or new line.
- Use comma: Do not include any header, follow CSV style
- Use newline: Make sure each reads are separated with a new line.
Example.txt
VRVTVVADBKMWTDKRDSCCBBVKHRDDWNTGKGMNADCDBSCGABBVTRWMMMVTKBGM
HYSHYGSGMKNGDYCVDMGKKCAYBTARBRHBDWVAWVNHNWMWNMTGWGRCCYCRYDMD
STKRGSVVTVTDVRCTSYMHRKTVBDCTRNDCVNSSBMGDTSWNTSKYRABGWVWSYABY
WHGCHCTRHWKWSVVCCAD
VRVTVVADBKMWTDKRDSCCBBVKHRDDWNTGKGMNADCDBSCGABBVTRWMMMVTKBGM
HYSHYGSGMKNGDYCVDMGKKCAYBTARBRHBDWVAWVNHNWMWNMTGWGRCCYCRYDMD
STKRGSVVTVTDVRCTSYMHRKTVBDCTRNDCVNSSBMGDTSWNTSKYRABGWVWSYABY
WHGCHCTRHWKWSVVCCAD
... ...
Contact Us
For general inquiries, please send an email to
Admin: nucbadmin@nucbarcoder.com
Privacy Policy
Last revised on 29/Nov/2023
The Gist
Nucbarcoder ltd. may collect certain non-personally identify information about you as you use our sites. We may use this data to better understand our users. We can also publish this data, but the data will be about a large group of users, not individuals.
We will also ask you to provide personal information, but you'll always be able to opt out. If you give us personal information, we won't do anything evil with it.
We don't use any cookies
That's the basic idea, but you must read through the entire Privacy Policy below and agree with all the details before you use any of our sites.
Questions
If you have question about this Privacy Policy, please contact us at nucbadmin@nucbarcoder.com
Visitors
Like most website operators, Nucbarcoder ltd. collects non-personally-identifying information of the sort that web browsers and servers typically make available, such as the browser type, language preference, referring site, and the date and time of each visitor request. Our purpose in collecting non-personally identifying information is to better understand how our visitors use its website. From time to time, we may release non-personally-identifying information in the aggregate, e.g., by publishing a report on trends in the usage of its website.
Nucbarcoder ltd. also collects potentially personally-identifying information like Internet Protocol (IP) addresses. Nucbarcoder ltd. does not use such information to identify its visitors, however, and does not disclose such information, other than under the same circumstances that it uses and discloses personally-identifying information, as described below. We may also collect and use IP addresses to block users who violated our Terms of Service.
Data Storage
Nucbarcoder ltd. uses third party vendors and hosting partners to provide the necessary hardware, software, networking, storage, and related technology required to run the Service. You understand that although you retain full rights to your data, it may be stored on third party storage and transmitted through third party networks.
Privacy Policy Changes
Although most changes are likely to be minor, Nucbarcoder ltd. may change its Privacy Policy from time to time, and in it's sole discretion. We encourage visitors to frequently check this page for any changes to its Privacy Policy. Your continued use of this site after any change in this Privacy Policy will constitute your acceptance of such change.