Assembler is a type of software or tool used in the process of genome or transcriptome assembly. Genome or transcriptome assembly involves reconstructing longer, more complete sequences from short DNA or RNA sequencing reads. Assemblers take these short reads as input and use various algorithms to piece them together into longer sequences, ultimately producing a more comprehensive picture of the genome or transcriptome being studied
DNBSEQ™ technology is BGI's proprietary sequencing technology that reduces the error rate that occurs during amplification to EXTREMELY LOW levels thanks to the Rolling Circle Amplification (RCA) technique. DNBSEQ™ creates DNA NanoBalls (DNBs) combined with the use of Flow Cell chips with extremely sophisticated design, high density to help the activation sites are distributed evenly, avoiding interference with fluorescent signals
Down-sampling is the process of reducing the size or quantity of a dataset, often applied in the field of signal processing, statistics, machine learning, and many other fields. In the context of genomic or sequencing data, downsampling can be applied to reduce the size of the sequencing dataset by removing part of the reads or reducing the size of each read in the dataset. This process can be done to reduce the computational resources required for analysis or to strike a balance between accuracy and speed during data analysis.
Illumina's sequencing technology uses amplification and cluster amplification to create a large number of clusters of DNA fragments in a single reaction. The fragments are then sequenced simultaneously using single nucleotides used to identify each location on the chain. Illumina's sequencing technology provides high performance, good accuracy and affordable cost compared to previous sequencing methods
Kraken 2 is a taxonomic classification system using exact k-mer matches to achieve high accuracy and fast classification speeds. This classifier matches each k-mer within a query sequence to the lowest common ancestor (LCA) of all genomes containing the given k-mer
Nanopore sequencing is a DNA and RNA sequencing method, based on the use of nanotek techniques to determine the sequence of nucleic molecules. This technology is part of third-generation sequencing (NGS), with the main advantage being the ability to directly sequence nucleic molecules without the need to copy into multiple copies like traditional sequencing methods
Next-generation sequencing (NGS) is a new technology used for DNA and RNA sequencing and variant/mutation detection. NGS can sequence hundreds and thousands of genes or whole genome in a short period of time
The OLC approach is based on graph overlaps. Fundamentally, this approach operates in three stages. First, overlaps (O) are found between all the reads. The OLC method then creates the layout (L) of all reads and overlaps the details in the graph
SMRT (Single Molecule, Real-Time) sequencing, is a type of long-read sequencing. In SMRT sequencing, DNA sequences are read longer than in short-read sequencing. Specifically, readable sequence fragments can reach tens or even hundreds of thousands of bp each, helping to identify complex gene structures and detect major gene variations
Taxon is a group of organisms that are classified together based on shared characteristics. Taxa are used in taxonomy, which is the science of classifying and naming organisms. Taxa can range from large groups, such as kingdoms and phyla, to smaller groups, such as genera and species
Unicycler: is a bioinformatics tool used for genome assembly, particularly for bacterial and viral genomes. It's designed to automatically generate complete genomes from short-read sequencing data, including both chromosomal and plasmid-encoded genes.