The GenoToul bioinformatics platform provides access to high-performance computing resources with softwares already installed to ease its usage. An exhaustive list is provided hereunder. Software are updated only upon user request. If you need any other software or if you need an update, fill the installation software form.
Application | Description | Availability/Use |
---|---|---|
3D-DNA | 3D de novo assembly (3D-DNA) pipeline. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
3rdChimeraMiner | Exploration of whole genome amplification generated chimeric sequences in long-read sequencing data. | Genobioinfo Cluster: How to use |
AAF | This is a package for constructing phylogeny without doing alignment or assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ABCtoolbox | BCtoolbox is a general-purpose program to perform Approximate Bayesian Computation. ABCtoolbox can be used for ABC inference on almost any type of model, including models arising in physics, biology or engineering. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ABySS | ABySS (Assembly By Short Sequences) is a de novo, parallel, paired-end sequence assembler that is designed for short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AC-DIAMOND | AC-DIAMOND attempts to speed up DIAMOND via better SIMD parallelization and compressed indexing. Experimental results show that AC-DIAMOND was about 6~7 times faster than DIAMOND on aligning DNA reads or contigs while retaining the essentially the similar sensitivity. AC-DIAMOND was developped based on DIAMOND v0.7.9. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ACACIA | Allele CAlling proCedure for Illumina Amplicon sequencing data: This workflow aims at extracting allele information out of paired-end Illumina FASTQC files. |
Genobioinfo Cluster: How to use |
Accel-align | Accel-align is a fast alignment tool implemented in C++ programming language. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ACFS | Accurate CircRNA Finder Suite. Discovering circRNAs from RNA-Seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AdapterRemoval | This program was developed to remove residual adapter sequences from next generation sequencing reads. The program handles both single end and paired end data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
adegenet | R package dedicated to the exploratory analysis of genetic data. It implements a set of tools ranging from multivariate methods to spatial genetics and genome-wise SNP data analysis | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ADMIXTOOLS | ADMIXTOOLS (Patterson et al. 2012) is a software package that supports formal tests of whether admixture occurred, and makes it possible to infer admixture proportions and dates. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Admixture | ADMIXTURE is a software tool for maximum likelihood estimation of individual ancestries from multilocus SNP genotype datasets. It uses the same statistical model as STRUCTURE but calculates estimates much more rapidly using a fast numerical optimization algorithm. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AGAT | Another Gff Analysis Toolkit: suite of tools to handle gene annotations in any GTF/GFF format. Some examples what AGAT can do: standardise any GTF/GFF file into a comprehensive GFF3 format (script with agat_sp prefix): add missing parent features (e.g. gene and mRNA if only CDS/exon exist). add missing features (e.g. exon and UTR). add missing mandatory attributes (i.e. ID, Parent). fix identifier to be uniq. fix feature location. remove duplicated features. group related features (if spread in different places in the file). sort features. merge overlapping loci into one single locus (only if option activated). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ALDER | The ALDER software computes the weighted linkage disequilibrium (LD) statistic for making inference about population admixture |
Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
ALFATClust | ALignment-Free Adaptive Threshold Clustering:Biological sequence clustering tool with dynamic threshold for individual clusters. Suitable for clustering multiple groups of homologous sequences. | Genobioinfo Cluster: How to use |
Alfred | BAM Statistics, Feature Counting and Annotation | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AlleleSeq | pipeline which constructs a diploid personal genome from genomic sequence variants of a family trio, including SNPs, indels and structural variants and maps functional genomic data onto this personal genome. | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
ALLHiC | Phasing and scaffolding polyploid genomes based on Hi-C data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AllPaths-LG | ALLPATHS-LG is a whole genome shotgun assembler that can generate high quality genome assemblies using short reads (~100bp) such as those produced by the new generation of sequencers. The significant difference between ALLPATHS and traditional assemblers such as Arachne is that ALLPATHS assemblies are not necessarily linear, but instead are presented in the form of a graph. This graph representation retains ambiguities, such as those arising from polymorphism, uncorrected read errors, and unresolved repeats, thereby providing information that has been absent from previous genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Alphafold2-Pytorch | An unofficial working Pytorch implementation of Alphafold2, a 3D protein predictor. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AlphaImpute | AlphaImpute is a software package for imputing and phasing genotype data in diploid populations with pedigree information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AMAS | Calculate summary statistics and manipulate multiple sequence alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AMOS | A Modular, Open-Source whole genome assembler. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ampliconnoise | AmpliconNoise is a collection of programs for the removal of noise from 454 sequenced PCR amplicons. It involves two steps the removal of noise from the sequencing itself and the removal of PCR point errors. This project also includes the Perseus algorithm for chimera removal. | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
AmpliSAT | AmpliSAT (Amplicon Sequencing Analysis Tools) are a set of online tools that make easy the analysis of Amplicon Sequencing experiments. |
Genobioinfo Cluster: How to use |
ancIBD | Identify IBD segments between pairs of individuals in ancient human DNA data. The software package Software documentation here. |
Genobioinfo Cluster: How to use |
ANGEL | Robust Open Reading Frame prediction (ANGLE re-implementation) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ANGSD | ANGSD is a software for analyzing next generation sequencing data. The software can handle a number of different input types from mapped reads to imputed genotype probabilities. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AnnotSV | An integrated tool for Structural Variations annotation and ranking. |
Genobioinfo Cluster: How to use |
ANNOVAR | ANNOVAR is an efficient software tool to utilize update-to-date information to functionally annotate genetic variants detected from diverse genomes (including human genome hg18, hg19, hg38, as well as mouse, worm, fly, yeast and many others) | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
Anvio | Anvi’o is an analysis and visualization platform for ‘omics data. It brings together many aspects of today’s cutting-edge genomic, metagenomic, and metatranscriptomic analysis practices to address a wide array of needs. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ApoplastP | ApoplastP is a machine learning method for predicting localization of proteins to the plant apoplast. ApoplastP can distinguish non-apoplastic proteins from apoplastic proteins for both plant proteins and pathogen proteins. In particular, ApoplastP can predict if an effector localizes to the plant apoplast. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Apptainer | Apptainer is an open source container platform designed to be simple, fast, and secure. | Genobioinfo Cluster: How to use |
Apscale | Advanced Pipeline for Simple yet Comprehensive AnaLysEs of DNA metabarcoding data |
Genobioinfo Cluster: How to use |
Aquila | Diploid personal genome assembly and comprehensive variant detection based on linked-reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARAGORN | ARAGORN is a program to detect tRNA genes and tmRNA genes in nucleotide sequence | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARBitR | ARBitR is an overlap aware genome assembly scaffolder for linked sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARC | ARC is a pipeline which facilitates iterative, reference guided de novo assemblies with the intent of: - Reducing time in analysis and increasing accuracy of results by only considering those reads which should assemble together. - Reducing/removing reference bias as compared to mapping based approaches. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARCS | Scaffolding genome sequence assemblies using 10X Genomics GemCode/Chromium data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARKS | Scaffolding genome sequence assemblies using 10X Genomics GemCode/Chromium data. This project is a new kmer-based (alignment free) implementation of ARCS. It provides improved runtime performance over the original ARCS implementation by removing the requirement to perform alignments with bwa mem. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Armatus | Multiresolution domain calling software for chromosome conformation capture interaction matrices. Armatus is a Topologically Associated Domain caller. Follow the Web page to know more about Armatus. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Arriba | Arriba is a command-line tool for the detection of gene fusions from RNA-Seq data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ArrowGrid | The distribution is a parallel wrapper around the Arrow consensus framework within the SMRT Analysis Software |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ART | ART is a set of simulation tools to generate synthetic next-generation sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ASGART | ASGART (A Segmental duplications Gathering and Refinement Tool) is a multiplatform (GNU/Linux, macOS, Windows) tool designed to search for large duplications amongst one or two DNA strands. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ASHURE | Python-based pipeline for analyzing Nanopore sequencing metabarcoding data. ASHURE can take a reference database in order to improve accuracy. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ASMC | Ascertained Sequentially Markovian Coalescent (contains ASMC and an extension, FastSMC, together with python bindings for both) |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
assemblathon2 | This repo contains a motley assortment of unpublished scripts and commands used by Ian Korf, Keith Bradnam, and Joe Fass in the analysis of Assemblathon 2 competition entries (assemblies). |
Genobioinfo Cluster: How to use |
assembly-stats | Get assembly statistics from FASTA and FASTQ files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Assemblytics | Assemblytics is a bioinformatics tool to detect and analyze structural variants from a genome assembly by comparing it to a reference genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Assexon | Assembling Exon Using Gene Capture Data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ASTER | A family of ASTRAL-like algorithms. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ASTRAL | ASTRAL is a tool for estimating an unrooted species tree given a set of unrooted gene trees. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ASTRAL-Pro | ASTRAL-Pro stands for ASTRAL for PaRalogs and Orthologs. ASTRAL is a tool for estimating an unrooted species tree given a set of unrooted gene trees. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
atac dnase pipelines | ATAC-seq and DNase-seq processing pipeline. This pipeline is designed for automated end-to-end quality control and processing of ATAC-seq or DNase-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Atropos | Atropos is tool for specific, sensitive, and speedy trimming of NGS reads. It is a fork of Cutadapt read trimmer. | Genologin Cluster: In Python-3.6.3 Genobioinfo Cluster: Ask for Install |
Augustus | Augustus is a program that predicts genes in eukaryotic genomic sequences |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AutoHiC | AutoHiC is a deep learning tool that uses Hi-C data to support genome assembly. It can automatically correct errors during genome assembly and generate genomes at the chromosome level. |
Genobioinfo Cluster: How to use |
AvP | AvP performs automatic detection of HGT candidates within a phylogenetic framework. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
awscli | The AWS Command Line Interface (AWS CLI) is an open source tool that enables you to interact with AWS services using commands in your command-line shell. | Genobioinfo Cluster: How to use |
Back_to_sequences | Given a set of kmers (fasta / fastq <.gz> format) and a set of sequences (fasta / fastq <.gz> format), this tool will extract the sequences containing some of those kmers. |
Genobioinfo Cluster: How to use |
BAli-Phy | BAli-Phy is software by Ben Redelings that estimates multiple sequence alignments and evolutionary trees from DNA, amino acid, or codon sequences. It uses likelihood-based evolutionary models of substitutions and insertions and deletions to place gaps. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bam2plot | Make coverage plots from bam files. |
Genobioinfo Cluster: How to use |
BamBam | several simple-to-use tools to facilitate NGS analysis | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BAMM | A program for multimodel inference on speciation and trait evolution. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BAMscorer | BAMscorer can be used to conduct genomic assignment tests from BAM files. Assignments can be done on genomic regions, inversions, and whole-genome datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bamstats (notsame as BAMstats) | Bamstats is a command line tool written in Go for computing mapping statistics from a BAM file. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bamtofastq | Tool for converting 10x BAMs produced by Cell Ranger, Space Ranger, Cell Ranger ATAC, Cell Ranger DNA, and Long Ranger back to FASTQ files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bamtools | BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bamUtil | bamUtil is a repository that contains several programs that perform operations on SAM/BAM files. All of these programs are built into a single executable, bam. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bandage-NG | Bandage-NG is a GUI program that allows users to interact with the assembly graphs made by de novo assemblers such as SPAdes, MEGAHIT and others. |
Genobioinfo Cluster: How to use |
Barrnap | Barrnap predicts the location of ribosomal RNA genes in genomes. It supports bacteria (5S,23S,16S), archaea (5S,5.8S,23S,16S), mitochondria (12S,16S) and eukaryotes (5S,5.8S,28S,18S). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BayesAss3-SNPs | Modification of BayesAss 3.0.4 to allow handling of large SNP datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BayeScan | Detecting natural selection from population-bases genetic data using differences in alleles frequencies between populations. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BayeScEnv | BayeScEnv is a Fst-based, genome-scan method that uses environmental variables to detect local adaptation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BayesTraits | BayesTraits is a computer package for performing analyses of trait evolution among groups of species for which a phylogeny or sample of phylogenies is available. This new package incoporates our earlier and separate programes Multistate, Discrete and Continuous. BayesTraits can be applied to the analysis of traits that adopt a finite number of discrete states, or to the analysis of continuously varying traits. Hypotheses can be tested about models of evolution, about ancestral states and about correlations among pairs of traits. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BayPass | The package BayPass is a population genomics software which is primarily aimed at identifying genetic markers subjected to selection and/or associated to population-specific covariates (e.g., environmental variables, quantitative or categorical phenotypic characteristics). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BBMap | a short read aligner, as well as various other bioinformatic tools. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BCALM2 | A bioinformatics tool for constructing the compacted de Bruijn graph from sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BCFtools | utilities for variant calling and manipulating VCFs and BCFs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bcl2fastq | The Bcl2FastQ conversion software is a new tool to handle bcl conversion and demultiplexing of both unzipped and zipped bcl files, which have reduced footprint and were introduced as an optional output of the HCS Software version 2.0 |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BCOOL | BCOOL is a read corrector for NGS sequencing data that align reads on a de Bruijn graph. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Beagle | BEAGLE is a state of the art software package for analysis of large-scale genetic data sets with hundreds of thousands of markers genotyped on thousands of samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Beagle_Utilities | Simple utility programs for manipulating text files, especially VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Beagle-lib | BEAGLE-lib is a high-performance library that can perform the core calculations at the heart of most Bayesian and Maximum Likelihood phylogenetics packages |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BEAST | BEAST is a software package for phylogenetic analysis with an emphasis on time-scaled trees. BEAST is a cross-platform program for Bayesian analysis of molecular sequences using MCMC. It is entirely orientated towards rooted, time-measured phylogenies inferred using strict or relaxed molecular clock models. It can be used as a method of reconstructing phylogenies but is also a framework for testing evolutionary hypotheses without conditioning on a single tree topology. BEAST uses MCMC to average over tree space, so that each tree is weighted proportional to its posterior probability. We include a simple to use user-interface program for setting up standard analyses and a suit of programs for analysing the results. |
Genobioinfo Cluster: How to use |
BEAST2 | BEAST 2 is a cross-platform program for Bayesian phylogenetic analysis of molecular sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BEDOPS | BEDOPS is an open-source command-line toolkit that performs highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bedtools | The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BELLA | A computationally-efficient and highly-accurate long-read to long-read aligner and overlapper. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BeXY | BeXY is a tool to jointly infer sex karyotypes and sex-linked scaffolds from read count data. It can also be used to genetically sex single individuals. BeXY is a command-line tool, and we provide an easy-to-use R package to visualize and parse the results. |
Genobioinfo Cluster: How to use |
bgc | bgc implements Bayesian estimation of genomic clines to quantify introgression at many loci. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bifrost | Highly parallel construction and indexing of colored and compacted de Bruijn graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BIG-SCAPE | Biosynthetic Genes Similarity Clustering and Prospecting Engine. Defines a distance metric between Gene Clusters using a combination of three indices (Jaccard Index of domain types, Domain Sequence Similarity the Adjacency Index) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BigDataScript | BigDataScript is intended as a scripting language for big data pipeline | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bioawk | Bioawk is an extension to Brian Kernighan's awk, adding the support of several common biological data formats, including optionally gzip'ed BED, GFF, SAM, VCF, FASTA/Q and TAB-delimited formats with column names. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
biohazard-tools | This is a collection of command line utilities that do useful stuff involving BAM files for Next Generation Sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Biopieces | The Biopieces are a collection of bioinformatics tools that can be pieced together in a very easy and flexible manner to perform both simple and complex tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BIOPYTHON | Biopython is a set of freely available tools for biological computation written in Python by an international team of developers. | Genologin Cluster: In Python-3.6.3 Genobioinfo Cluster: Ask for Install |
BiSCoT | BiSCoT is a tool that aims to improve the contiguity of scaffolds and contigs generated after a Bionano scaffolding. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BISCUIT | BISulfite-seq CUI Toolkit (BISCUIT) is a utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data. |
Genobioinfo Cluster: How to use |
Bismark | A tool to map bisulfite converted sequence reads and determine cytosine methylation states |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BisSNP | Accurate combined SNP/Methylation calling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Blasr | Reference-based alignment | Genologin Cluster: How to use ( See SMRTLink) Genobioinfo Cluster: Ask for Install |
blat | The BLAST-Like Alignment Tool: similarity search in databanks. BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of length 25 bases or more. BLAT on proteins finds sequences of 80% and greater similarity of length 20 amino acids or more. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BlobTools | A modular command-line solution for visualisation, quality control and taxonomic partitioning of genome datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BlockClust | BlockClust is an efficient approach to detect transcripts with similar processing patterns. We propose a novel way to encode expression profiles in compact discrete structures, which can then be processed using fast graph-kernel techniques. BlockClust allows both clustering and classification of small non-coding RNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BOLDigger | Python program to query .fasta files against the different databases of www.boldsystems.org |
SLURM Cluster: How to use Genobioinfo Cluster: How to use |
BOLDigger2 | An even better Python program to query .fasta files against the COI database of www.boldsystems.org |
Genobioinfo Cluster: How to use |
Bowtie | Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome at a rate of over 25 million 35-bp reads per hour. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: typically about 2.2 GB for the human genome (2.9 GB for paired-end). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bowtie2 | Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes. Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BPP | Bayesian analysis of genomic sequence data under the multispecies coalescent model. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bracken | Bracken (Bayesian Reestimation of Abundance with KrakEN) is a highly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Braker | BRAKER(1,2,3) is a tool for fully automated genome annotation with GeneMark-ET and AUGUSTUS |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BreakDancer | SV detection from paired end reads mapping. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
breseq | breseq is a computational pipeline for finding mutations relative to a reference sequence in short-read DNA re-sequencing data for haploid microbial-sized genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bridger | Bridger is an efficient de novo trascriptome assembler for RNA-Seq data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BS-Seeker2- | BS-Seeker2 is a seamless and versatile pipeline for accurately and fast mapping the bisulfite-treated reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BS-SNPer | BS-SNPer is an ultrafast and memory-efficient package, a program for BS-Seq variation detection from alignments in standard BAM/SAM format using approximate Bayesian modeling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BSMAP | BSMAP is a short reads mapping software for bisulfite sequencing reads. Bisulfite treatment converts unmethylated Cytosines into Uracils (sequenced as Thymine) and leave methylated Cytosines unchanged, hence provides a way to study DNA cytosine methylation at single nucleotide resolution. BSMAP aligns the Ts in the reads to both Cs and Ts in the reference | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Btrim | A fast and accurate adapter, barcodes, and low-quality region trimming and binning program written in C for next-generating sequencing reads. The search algorithm is based on Eugene Myers' fast bit-vector algorithm. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BUCKy | BUCKy is a free program to combine molecular data from multiple loci. BUCKy estimates the dominant history of sampled individuals, and how much of the genome supports each relationship, using Bayesian concordance analysis. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BUSCO | BUSCO v2 provides quantitative measures for the assessment of genome assembly, gene set, and transcriptome completeness, based on evolutionarily-informed expectations of gene content from near-universal single-copy orthologs selected from OrthoDB v9. BUSCO assessments are implemented in open-source software, with a large selection of lineage-specific sets of Benchmarking Universal Single-Copy Orthologs. These conserved orthologs are ideal candidates for large-scale phylogenomics studies, and the annotated BUSCO gene models built during genome assessments provide a comprehensive gene predictor training set for use as part of genome annotation pipelines. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BUSCO_phylogenomics | This is a Python pipeline to construct species phylogenies using BUSCO proteins. It works directly from BUSCO output and can generate concatenated supermatrix alignments and also gene trees of BUSCO families. The pipeline identifies BUSCO proteins that are complete and single-copy in all input samples. Alternatively, you can account for missing data and choose to include BUSCO proteins that are complete and single-copy in a certain percentage of input samples. Each BUSCO family is individually aligned, trimmed, and then concatenated together to generate a supermatrix alignment. The pipeline also identifies BUSCO proteins that are complete and single-copy in at least 4 input samples, and generates gene trees for each of these families. |
Genobioinfo Cluster: How to use |
busco2fasta | A script to turn a set of BUSCO results into a directory of multisequence FASTA files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bustools | bustools is a program for manipulating BUS files for single cell RNA-Seq datasets. It can be used to error correct barcodes, collapse UMIs, produce gene count or transcript compatibility count matrices, and is useful for many other tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bwa | Burrows-Wheeler Aligner (BWA) is an efficient program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome. It implements two algorithms, bwa-short and BWA-SW. The former works for query sequences shorter than 200bp and the latter for longer sequences up to around 100kbp. Both algorithms do gapped alignment. They are usually more accurate and faster on queries with low error rates. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bwa-mem2 | Bwa-mem2 is the next version of the bwa-mem algorithm in bwa. It produces alignment identical to bwa and is ~80% faster. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bwa-meth | Fast and accurate alignment of BS-Seq reads using bwa-mem and a 3-letter genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bwtools | bwtool is a command-line utility for bigWig files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cabal | Cabal is the standard package system for Haskell software. It helps people to configure, build and install Haskell software and to distribute it easily to other users and developers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cactus | Cactus is a reference-free whole-genome alignment program, as well as a pagenome graph construction toolkit. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAFE | Software for Computational Analysis of gene Family Evolution. The purpose of CAFE is to analyze changes in gene family size in a way that accounts for phylogenetic history and provides a statistical foundation for evolutionary inferences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAID | The CAID software produces all outputs necessary for Critical Assessment of Intrinsic Disorder (CAID) edition, including baselines, references, metrics and plots, starting from predictions and a reference (see Data Availability section to know how to obtain this data). |
Genobioinfo Cluster: How to use |
CAMI-AMBER | AMBER is an evaluation package for the comparative assessment of genome reconstructions and taxonomic assignments from metagenome benchmark datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CAMISIM | CAMISIM is a software to model abundance distributions of microbial communities and to simulate corresponding shotgun metagenome datasets. |
Genobioinfo Cluster: How to use |
canu | A single molecule sequence assembler for genomes large and small. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAP3 | A DNA Sequence Assembly Program | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CARNAC-LR | Clustering coefficient-based Acquisition of RNA Communities in Long Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Carthagene | CarthaGene is a genetic/radiated hybrid mapping software. CarthaGene looks for multiple populations maximum likelihood consensus maps using a fast EM algorithm for maximum likelihood estimation and powerful ordering algorithms. CarthaGene can handle data made up of several distinct populations which t may each be either F2 backcross, recombinant inbred lines, F2 t intercross, phase known outbreds and/or radiated hybrids (haploid t and diploid data). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CAT | This project aims to provide a straightforward end-to-end pipeline that takes as input a HAL-format multiple whole genome alignment as well as a GFF3 file representing annotations on one high quality assembly in the HAL alignment, and produces a output GFF3 annotation on all target genomes chosen. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CCMetagen | CCMetagen processes sequence alignments produced with KMA, which implements the ConClave sorting scheme to achieve highly accurate read mappings. CCMetagen processes sequence alignments produced with KMA, which implements the ConClave sorting scheme to achieve highly accurate read mappings. CCMetagen produces ranked taxonomic results in user-friendly formats that are ready for publication or downstream statistical analyses. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cctools | The Cooperative Computing Tools (cctools) enable large scale distributed computations to harness hundreds to thousands of machines from clusters, clouds, and grids. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cd-hit | CD-HIT stands for Cluster Database at High Identity with Tolerance. The program (cd-hit) takes a fasta format sequence database as input and produces a set of 'non-redundant' (nr) representative sequences as output. In addition cd-hit outputs a cluster file, documenting the sequence 'groupies' for each nr sequence representative. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cdbfasta | This is a brief introduction to a couple of platform independent file-based hashing tools (cdbfasta and cdbyank) that can be used for creating indices for quick retrieval of any particular sequences from large multi-FASTA files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cDNA_Cupcake | cDNA_Cupcake is a miscellaneous collection of Python and R scripts used for analyzing sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CEGMA | CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellRanger | Cell Ranger is a set of analysis pipelines that processes Chromium single cell 3’ RNA-seq output to align reads, generate gene-cell matrices and perform clustering and gene expression analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellRanger ARC | Cell Ranger ARC's pipelines analyze sequencing data produced from Chromium Single Cell Multiome ATAC + Gene Expression. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellRanger ATAC | Cell Ranger ATAC is a set of analysis pipelines that process Chromium Single Cell ATAC data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CellRanger DNA | Cell Ranger DNA is a set of analysis pipelines that process Chromium single cell DNA sequencing output to align reads, identify copy number variation (CNV), and compare heterogeneity among cells. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CENSOR | CENSOR compares and masks protein or nucleotide sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Centrifuge | Classifier for metagenomic sequences. Centrifuge is a novel microbial classification engine that enables rapid, accurate and sensitive labeling of reads and quantification of species on desktop computers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cgMLSTFinder | Core genome Multi-Locus Sequence Typing cgMLSTFinder runs KMA <1> against a chosen core genome MLST (cgMLST) database and outputs the detected alleles in a matrix file. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CheckM | Assess the quality of microbial genomes recovered from isolates, single cells, and metagenomes. |
Genobioinfo Cluster: How to use |
CheckM2 | Assessing the quality of metagenome-derived genome bins using machine learning. |
Genobioinfo Cluster: How to use |
CHEUI | CHEUI (Methylation (CH3) Estimation Using Ionic current) is an RNA modification detection software for Oxford Nanopore direct RNA sequencing data. CHEUI can be used to detect m6A and m5C in individual reads at single-nucleotide resolution from any sample (e.g. single condition), or detect differential m6A or m5C between any two conditions. CHEUI uses a two-stage deep learning method to detect m6A and m5C transcriptome-wide at single-read and single-site resolution in any sequence context (i.e. without any sequence constrains). |
Genobioinfo Cluster: How to use |
chimerascan | chimerascan is a software package that detects gene fusions in paired-end RNA sequencing (RNA-Seq) datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ChimPipe | ChimPipe is a computational method for the detection of novel transcription-induced chimeric transcripts and fusion genes from Illumina Paired-End RNA-seq data. It combines junction spanning and paired-end read information to accurately detect chimeric splice junctions at base-pair resolution. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
chopper | This tool, intended for long read sequencing such as PacBio or ONT, filters and trims a fastq file. chopper is a tool that reunites the now outdated softwares NanoFilt and NanoLyse. It permits to filter QC files and has a faster execution time than NanoFilt and NanoLyse.
|
Genobioinfo Cluster: How to use |
ChopStitch | Exon annotation and splice graph construction using transcriptome assembly and whole genome sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
chromeister | A dotplot generator for large chromosomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Chromonomer | Chromonomer is a program designed to integrate a genome assembly with a genetic map. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Chromosight | Python package to detect chromatin loops (and other patterns) in Hi-C contact maps. |
Genobioinfo Cluster: How to use |
CIRCexplorer2 | CIRCexplorer2 is a comprehensive and integrative circular RNA analysis toolset |
Genobioinfo Cluster: How to use |
circfull | A tool to detect and quantify full-length circRNA isoforms from circFL-seq. |
Genobioinfo Cluster: How to use |
Circlator | A tool to circularize genome assemblies | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
circos | Circos is a software package for visualizing data and information. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
circtools | A modular, python-based framework for circRNA-related tools that unifies several functionalities in a single, command line driven software. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Circuitscape | Circuitscape borrows algorithms from electronic circuit theory to predict patterns of movement, gene flow, and genetic differentiation among plant and animal populations in heterogeneous landscapes. |
Genobioinfo Cluster: How to use |
CIRI | CIRI (circRNA identifier) is a novel chiastic clipping signal based algorithm, which can unbiasedly and accurately detect circRNAs from transcriptome data by employing multiple filtration strategies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CIRI-long | Circular RNA Identification for Long-Reads Nanopore Sequencing Data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CITE-seq-Count | A tool that allows to get UMI counts from a single cell protein assay. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Clair3 | Clair3 is a germline small variant caller for long-reads. Clair3 makes the best of two major method categories: pileup calling handles most variant candidates with speed, and full-alignment tackles complicated candidates to maximize precision and recall. Clair3 runs fast and has superior performance, especially at lower coverage. Clair3 is simple and modular for easy deployment and integration. |
Genobioinfo Cluster: How to use |
CleaveLand4 | Analysis of degradome data to find sliced miRNA and siRNA targets | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ClinSV | Genobioinfo Cluster: How to use | |
ClipAndMerge | Clip&Merge is a tool to clip off adapters from sequencing reads and merge overlapping paired end reads together. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ClonalFrameML | A software package that performs efficient inference of recombination in bacterial genomes. |
Genobioinfo Cluster: How to use |
CLUMPAK | Clustering Markov Packager Across K - was developed in order to aid users analyse the results of STRUCTURE-like programs. The software offers a few alternative modes of action, please go to the Help section for detailed about these modes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Clustal Omega | Clustal Omega is the latest addition to the Clustal family. It offers a significant increase in scalability over previous versions, allowing hundreds of thousands of sequences to be aligned in only a few hours. It will also make use of multiple processors, where present. In addition, the quality of alignments is superior to previous versions, as measured by a range of popular benchmarks | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ClustalW | Multiple sequence alignment program for DNA or proteins. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CMfinder | CMfinder is a RNA motif prediction tool. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cnD | cnD is a program to detect copy number variants from short-read sequence data. |
Genobioinfo Cluster: How to use |
CNVkit | A command-line toolkit and Python library for detecting copy number variants and alterations genome-wide from high-throughput sequencing. CNVkit is a Python library and command-line software toolkit to infer and visualize copy number from high-throughput DNA sequencing data. It is designed for use with hybrid capture, including both whole-exome and custom target panels, and short-read sequencing platforms such as Illumina and Ion Torrent. Read the full documentation at: http://cnvkit.readthedocs.io |
Genobioinfo Cluster: How to use |
CNVnator | A tool for CNV discovery and genotyping from depth of read mapping. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cnvpipelines | A pipeline to detect copy number variations (CNV) on several samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cogent | Cogent is a tool for reconstructing the coding genome using high-quality full-length transcriptome sequences. It is designed to be used on Iso-Seq data and in cases where there is no reference genome or the ref genome is highly incomplete. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ColabFold | ColabFold is an easy-to-use Notebook based environment for fast and convenient protein structure predictions. Its structure prediction is powered by AlphaFold2 and RoseTTAFold combined with a fast multiple sequence alignment generation stage using MMseqs2. |
Genobioinfo Cluster: How to use |
COMEBin | COMEBin allows effective binning of metagenomic contigs using COntrastive Multi-viEw representation learning. |
Genobioinfo Cluster: How to use |
Comp-D | A program for comprehensive computation of D-statistics and population summaries (serial version). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CONCOCT | A program for unsupervised binning of metagenomic contigs by using nucleotide composition, coverage data in multiple samples and linkage data from paired end reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Concrete Autoencoders | The concrete autoencoder is an end-to-end differentiable method for global feature selection, which efficiently identifies a subset of the most informative features and simultaneously learns a neural network to reconstruct the input data from the selected features. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Consensify | Consensify is a method for generating a consensus pseudohaploid genome sequence with greatly reduced error rates compared to standard pseudohaploidisation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CONSENT | CONSENT (sCalable self-cOrrectioN of long reads with multiple SEquence alignmeNT) is a self-correction method for long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Conterminator | Conterminator is an efficient method for detecting incorrectly labeled sequences across kingdoms by an exhaustive all-against-all sequence comparison. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ContextMap2 | Fast and accurate context-based RNA-seq mapping. ContextMap determines the most likely origin of a read by evaluating the context of the read in the form of alignments of other reads to the same genomic region. In the original implementation, the focus was on improving initial mappings provided by other mapping tools. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CONTRAST | CONTRAST predicts protein-coding genes from a multiple genomic alignment using a combination of discriminative machine learning techniques. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ContScout | ContScout is a pipeline developed for the identification and removal of contaminating sequences in draft genomes. |
Genobioinfo Cluster: How to use |
Cooler | Cooler is a support library for a sparse, compressed, binary persistent storage format, also called cooler, used to store genomic interaction data, such as Hi-C contact matrices. |
Genobioinfo Cluster: How to use |
coolpuppy | A versatile tool to perform pile-up analysis on Hi-C data in .cool format. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CRABS | CRABS (Creating Reference databases for Amplicon-Based Sequencing) is a versatile software program that generates curated reference databases for metagenomic analysis. |
Genobioinfo Cluster: How to use |
CRISP | CRISP is a software program to detect SNPs and short indels from pooled sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CRISPOR | CRISPOR predicts off-targets in the genome, ranks guides, highlights problematic guides, designs primers and helps with cloning. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CroCo | A program to detect potential cross contaminations in HTS assembled transcriptomes using expression level quantification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
csem | CSEM is a ChIP-Seq multi-read allocator. CSEM stands for ChIP-Seq multi-read allocation using Expectation-Maximization. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
csvtk | A cross-platform, efficient and practical CSV/TSV toolkit in Golang. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cufflinks | Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples. It accepts aligned RNA-Seq reads and assembles the alignments into a parsimonious set of transcripts. Cufflinks then estimates the relative abundances of these transcripts based on how many reads support each one, taking into account biases in library preparation protocols. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
currentNE | Estimation of current effective population using artificial neural networks. |
Genobioinfo Cluster: How to use |
cutadapt | Cutadapt removes adapter sequences from DNA high-throughput sequencing data. This is usually necessary when the read length of the machine is longer than the molecule that is sequenced, such as in microRNA data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cuteSV | Long read based human genomic structural variation detection with cuteSV. |
Genobioinfo Cluster: How to use |
d2SBin | Improving the binning of metagenomic contigs on d2S oligonucleotide frequency dissimilarity | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
dadi | dadi implements a method for demographic inference from genetic data, based on a diffusion approximation to the allele frequency spectrum. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DALIGNER | The commands below permit one to find all significant local alignments between reads encoded in Dazzler database. The assumption is that the reads are from a PACBIO RS IIlong read sequencer. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DamageProfiler | A Java based tool to determine damage patterns on ancient DNA as a replacement for mapDamage. DamageProfiler calculates damage profiles of mapped reads and provides a graphical as well as text based representation. It creates
|
Genobioinfo Cluster: How to use |
DAmar | Long read QC, assembly and scaffolding pipeline for PacBio or Oxford Nanopore long-read sequencing data. T he pipeline produces a number of QC metrics at various stages as well as incorporating further technologies including Bionano, 10x and HiC data to scaffold the created contigs. DAmar, is a hybrid of the earlier Marvel, Dazzler, and Daccord systems of the Eugene Myers lab. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DANPOS2 | A toolkit for Dynamic Analysis of Nucleosome and Protein Occupancy by Sequencing, version 2 | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DARIC | A complete framework for identifying quantitatively differential compartments from Hi-C and Micro-C data.
|
Genobioinfo Cluster: How to use |
DAS_Tool | An automated method that integrates the results of a flexible number of binning algorithms to calculate an optimized, non-redundant set of bins from a single assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Dasel | Dasel (short for data-selector) allows you to query and modify data structures using selector strings. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
datamash | GNU datamash is a command-line program which performs basic numeric, textual and statistical operations on input textual data files. |
Genologin Cluster: How to use Genobioinfo Cluster: default system |
DATES | DATES (Distribution of Ancestry Tracts of Evolutionary Signals) is a method to estimate the time of admixture in ancient DNA samples described in Narasimhan, Patterson et al. 2018 | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DAZZ_DB | To facilitate the multiple phases of the dazzler assembler, we organize all the read data into what is effectively a "database" of the reads and their meta-information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
dbcAmplicons | Analysis of Double Barcoded Illumina Amplicon Data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
dbCAN3 | run_dbcan (dbCAN3) is the standalone version of the dbCAN3 annotation tool for automated CAZyme annotation. |
Genobioinfo Cluster: How to use |
DBG2OLC | The genome assembler that reduces the computational time of human genome assembly from 400,000 CPU hours to 2,000 CPU hours, utilizing long erroneous 3GS sequencing reads and short accurate NGS sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DBSCAN-SWA | An integrated tool for rapid prophage detection and annotation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeChat | Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat. |
Genobioinfo Cluster: How to use |
decOM | decOM is a high-accuracy microbial source tracking method that is suitable for contamination quantification in paleogenomics, namely the analysis of collections of possibly contaminated ancient oral metagenomic data sets. |
Genobioinfo Cluster: How to use |
DeconSeq | Detect and remove contaminations from your sequence data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DECX | This is the DECX (DEC eXtended) model for historical biogeographic inference | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeDup | A merged read deduplication tool capable to perform merged read deduplication on single end data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Deepbinner | Deepbinner is a tool for demultiplexing barcoded Oxford Nanopore sequencing reads. It does this with a deep convolutional neural network classifier, using many of the architectural advances that have proven successful in image classification. Unlike other demultiplexers (e.g. Albacore and Porechop), Deepbinner identifies barcodes from the raw signal (a.k.a. squiggle) which gives it greater sensitivity and fewer unclassified reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeepSignal | Detecting methylation using signal-level features from Nanopore sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
deepTools | Tools to process and analyze deep sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DeepVariant | DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. |
Genobioinfo Cluster: How to use |
Delly | DELLY is an integrated structural variant prediction method that can detect deletions, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read massively parallel sequencing data. It uses paired-ends and split-reads to sensitively and accurately delineate genomic rearrangements throughout the genome. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
demuxlet | Genetic multiplexing of barcoded single cell RNA-seq |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DENTIST | DENTIST is a sensitive, highly-accurate and automated pipeline method to close gaps in (short read) assemblies with long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DESMAN | De novo Extraction of Strains from MetAgeNomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
detettore | A program to detect transposable element polymorphisms | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
devCellPy | devCellPy is a Python package designed for hierarchical multilayered classification of cells based on single-cell RNA-sequencing (scRNA-seq). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DFE-alpha | DFE-alpha was initially written to estimate the distribution of fitness effects (DFE) of new deleterious mutations using within-species nucleotide polymorphism data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DIAMOND | Accelerated BLAST compatible local sequence aligner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DIAMOND2GO | Diamond2GO is a set of tools that can rapidly assign gene ontology and perform enrichment for functional genomics. | Genobioinfo Cluster: How to use |
diffTF | Quantification of Differential Transcription Factor Activity and Multiomics-Based Classification into Activators and Repressor. |
Genobioinfo Cluster: How to use |
DinuQ | The DinuQ (Dinucleotide Quantification) Python3 package provides a range of metrics for quantifying nucleotide, dinucleotide and synonymous codon representation in genetic sequences. | Genologin Cluster: Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Discovar | Assemble genomes and find variants with DISCOVAR & DISCOVAR de novo | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DIYABC | A user-friendly approach to Approximate Bayesian Computation for inference on population history using molecular markers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DLCpar | DLCpar is a reconciliation method for inferring gene duplications, losses, and coalescence (accounting for incomplete lineage sorting). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
drap | De novo RNA-seq Assembly Pipeline |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
dRep | dRep is a python program for rapidly comparing large numbers of genomes. dRep can also "de-replicate" a genome set by identifying groups of highly similar genomes and choosing the best representative genome for each genome set. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DSK | DSK is a k-mer counting software, similar to Jellyfish. DSK supports large values of k, and runs with (almost-)arbitrarily low memory usage and reasonably low temporary disk usage. DSK can count k-mers of large Illumina datasets on laptops and desktop computers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Dsuite | Fast calculation of Paterson's D (ABBA-BABA) and the f4-ratio statistics across many populations/species |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Dysgu | dysgu-SV is a collection of tools for calling structural variants using short or long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
E2P2 | Ensemble Enzyme Prediction Pipeline. |
Genobioinfo Cluster: How to use |
Eagle | The Eagle software estimates haplotype phase either within a genotyped cohort or using a phased reference panel. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EarlGrey | A fully automated TE curation and annotation pipeline. |
Genobioinfo Cluster: How to use |
ecoPCR | ecoPCR is an electronic PCR software developed by LECAand Helix-Project . It helps you to estimate Barcode primers quality. In conjunction with OBItools, you can postprocess ecoPCR output to compute barcode coverage and barcode speci?city. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ecoPrimers | ecoPrimer is a barcoding software which is written in C language. It finds universal primers from a set of input DNA sequences by finding conserved regions without "a priori" on candidate sequences. It also evaluates the quality of the primers and barcode regions by measuring the "barcode specificity" and "barcode coverage" indices | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EDirect | Entrez Direct (EDirect) provides access to the NCBI's suite of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EDTA | This package is developed for automated whole-genome de-novo TE annotation and benchmarking the annotation performance of TE libraries. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EEMS | EEMS method for analyzing and visualizing spatial population structure from geo-referenced genetic samples. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EffectorP | EffectorP is a machine learning method for fungal effector prediction in secretomes and has been trained to distinguish secreted proteins from secreted effectors in plant-pathogenic fungi. |
Genobioinfo Cluster: How to use |
EGA_download_client | The EgaDemoClient is a JAVA based data streamer that enables EGA account holders to securely download files and datasets, either through an interactive shell (IS) or using direct command line mode (DCLM). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EggLib | EggLib is a C++/Python library and program package for evolutionary genetics and genomics. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
eggNog-mapper | eggnog-mapper is a tool for fast functional annotation of novel sequences. It uses precomputed orthologous groups and phylogenies from the eggNOG database to transfer functional information from fine-grained orthologs only. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Eigen | Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Eigensoft | The EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification correction method (Price et al. 2006). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ELAI | The software performs local ancestry inference for admixed individuals. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ELECTOR | EvaLuation of Error Correction Tools for lOng Reads. |
|
elPrep | elPrep is a high-performance tool for analyzing .sam/.bam files (up to and including variant calling) in sequencing pipelines. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
eLSA | Extended Local Similarity Analysis -- Finding Time-Dependent Associations in Time Series Datasets | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EMA | EMA uses a latent variable model to align barcoded short-reads (such as those produced by 10x Genomics' sequencing platform). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EMBOSS | EMBOSS is "The European Molecular Biology Open Software Suite". EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EMMAX | EMMAX is a statistical test for large scale human or model organism association mapping accounting for the sample structure. In addition to the computational efficiency obtained by EMMA algorithm, EMMAX takes advantage of the fact that each loci explains only a small fraction of complex traits, which allows us to avoid repetitive variance component estimation procedure, resulting in a significant amount of increase in computational time of association mapping using mixed model. |
Genobioinfo Cluster: How to use |
Emu | Emu is a relative abundance estimator for 16S genomic sequences. The method is optimized for error-prone full-length reads, but can also be utilized for short-read data. |
Genobioinfo Cluster: How to use |
Ensembl-API | Ensembl uses MySQL relational databases to store its information. A comprehensive set of Application Programme Interfaces (APIs) serve as a middle-layer between underlying database schemes and more specific application programmes. The APIs aim to encapsulate the database layout by providing efficient high-level access to data tables and isolate applications from data layout changes. Ensembl's API is written in Perl | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EPIK | EPIK is a program for rapid alignment-free phylogenetic placement. |
Genobioinfo Cluster: How to use |
ErmineJ | ErmineJ performs analyses of gene sets in high-throughput genomics data such as gene expression profiling studies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ERPIN | ERPIN (Easy RNA Profile IdentificatioN) is an RNA motif search program developped by Daniel Gautheret and André Lambert. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ERVmap | ERVmap is one part curated database of human proviral ERV loci and one part a stringent algorithm to determine which ERVs are transcribed in their RNA seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
est-sfs | est-sfs implements a maximum likelihood method to infer the unfolded site frequency spectrum (the uSFS) and ancestral state probabilities for DNA sequence data. |
Genologin Cluster: How to use |
ETE | A Python framework for the analysis and visualization of trees. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EuGeneEP | EuGene is an open integrative gene finder for eukaryotic and prokaryotic genomes. EuGene-EP (Eukaryote Pipeline) facilitates the application of EuGene on eukaryote genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EukCC | EukCC is a completeness and contamination estimator for metagenomic assembled microbial eukaryotic genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EukRep | Classification of Eukaryotic and Prokaryotic sequences from metagenomic datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EUPAN | Toolkit that integrates various software in order to build eukaryotic pangenomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
evalAdmix | evalAdmix allows to evaluate the results of an admixture analysis (i.e. the result of applying ADMIXTURE, STRUCTURE, NGSadmix and similar). |
Genobioinfo Cluster: How to use |
EVE | EVE is a set of protein-specific models providing for any single amino acid mutation of interest a score reflecting the propensity of the resulting protein to be pathogenic. |
Genobioinfo Cluster: How to use |
EVidenceModeler (EVM) | The EVidenceModeler (aka EVM) software combines ab intio gene predictions and protein and transcript alignments into weighted consensus gene structures. EVM provides a flexible and intuitive framework for combining diverse evidence types into a single automated gene structure annotation system. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ExaML | Exascale Maximum Likelihood (ExaML) code for phylogenetic inference using MPI. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Exonerate | A generic tool for sequence alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
eXpress | eXpress is a streaming tool for quantifying the abundances of a set of target sequences from sampled subsequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FABuLOUS | A gap-closing software tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. Initially called TGS-GapCloser. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON | Falcon: a set of tools for fast aligning long reads for consensus and assembly |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON_unzip | Making diploid assembly becomes common practice for genomic study | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON-Phase | FALCON-Phase integrates PacBio long-read assemblies with Phase Genomics Hi-C data to create phased, diploid, chromosome-scale scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FaMoz | FaMoz, a software written in the C language and in TclTk, uses likelihood calculation and simulation to perform parentage studies with codominant, dominant, cytoplasmic markers or combinations of the different types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FAMSA | Algorithm for large-scale multiple sequence alignments (400k proteins in 2 hours and 8BG of RAM) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FaST-LMM | FaST-LMM (Factored Spectrally Transformed Linear Mixed Models) is a program for performing genome-wide association studies (GWAS) on large data sets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Fast-Plast | Fast-Plast is a pipeline that leverages existing and novel programs to quickly assemble, orient, and verify whole chloroplast genome sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FASTA | FASTA is a sequence similarity search tool which uses heuristics for fast local alignment searching. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FASTA Composition | finds the overall composition of sequences in a FASTA file |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FASTA_Length | FASTA Length finds the lengths of sequences in a FASTA file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fasta_validator | C code to validate a fasta file. |
Genobioinfo Cluster: How to use |
FastaGrep | FastaGrep is a tool for searching oligonucleotide binding sites from FastA genomic sequences. It can do both match/mismatch based and thermodynamic binding energy searches. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastix | A simple command line tool to add prefixes to FASTA headers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastk-medians | A set of utilities to calculate the median number of times the k-mers in a sequence of interest occur across the whole set. |
Genobioinfo Cluster: How to use |
FastME | FastME provides distance algorithms to infer phylogenies. FastME is based on balanced minimum evolution, which is the very principle of NJ. FastME improves over NJ by performing topological moves using fast, sophisticated algorithms. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastNGSadmix | Program for infering admixture proportions and doing PCA with a single NGS sample. Inferences based on reference panel. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastp | A tool designed to provide fast all-in-one preprocessing for FastQ files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastplong | Ultra-fast preprocessing and quality control for long-read sequencing data. |
Genobioinfo Cluster: How to use |
fastprofkernel | fastprofkernel is a Debian package that uses an accelerated version of the original profile kernel <1> to automatically train SVM based classification models. It can assign user-defined classes to so far uncharacterized proteins. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastQ Screen | FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastq_illumina_filter | This program can filter FASTQ files produced by CASAVA 1.8, and keep/discard reads based on this filter flag. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastQC | A Quality Control application for FastQ files. FastQC is an application which takes a FastQ file and runs a series of tests on it to generate a comprehensive QC report. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastQValidator | The fastQValidator validates the format of fastq files | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FastSimBac | FastSimBac is a simulator of the coalescent process with bacterial recombination that simulates genealogies spatially across chromosomes as a Markov process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastsimcoal2 | Fast sequential Markov coalescent simulation of genomic data under complex evolutionary models | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastStructure | fastStructure is an algorithm for inferring population structure from large SNP genotype data. It is based on a variational Bayesian framework for posterior inference and is written in Python2.x. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastTree | FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FASTX-Toolkit | The FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FBAT | FBAT is an acronym for Family-Based Association Tests in genetic analyses. Family-based association designs, as opposed to case-control study designs, are particularly attractive, since they test for linkage as well as association, avoid spurious associations caused by admixture of populations, and are convenient for investigators interested in refining linkage findings in family samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FCS-GX | FCS-GX detects contamination from foreign organisms in genome sequences. This tool is one module within the NCBI Foreign Contamination Screening (FCS) program suite. |
Genobioinfo Cluster: How to use |
FEELnc | FlExible Extraction of LncRNA. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fgbio | A set of tools to analyze genomic data with a focus on Next Generation Sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FigTree | FigTree is designed as a graphical viewer of phylogenetic trees and as a program for producing publication-ready figures. |
Genobioinfo Cluster: How to use |
Filtlong | Filtlong is a tool for filtering long reads by quality. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FindingOverCovRegions | FindOverCovRegions.py search for genomic regions with abnormal read coverage (e.g. depth). To do so, this program requieres begraph-like file (e.g. bedtools genomecov per-base reports) where for each position of the genome, the coverage depth is reported (even 0 values). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fineRADstructure | A complete, easy to use, and fast population inference package for RAD-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fineSTRUCTURE | fineSTRUCTURE is a fast and powerful algorithm for identifying population structure using dense sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FLAIR | FLAIR (Full-Length Alternative Isoform analysis of RNA) for the correction, isoform definition, and alternative splicing analysis of noisy reads. FLAIR has primarily been used for nanopore cDNA, native RNA, and PacBio sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
flare | The flare program uses a set of reference haplotypes to infer the ancestry of each allele in a set of admixed study samples. The flare program is fast, accurate, and memory-efficient. | Genobioinfo Cluster: How to use |
FLAS | FLAS is software that makes self-correction for PacBio long reads with fast speed and high throughput. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FLASH | FLASH, Fast Length Adjustment of SHort reads, is a very accurate fast tool to merge paired-end reads from fragments that are shorter than twice the length of reads. The extended length of reads has a significant positive impact on improvement of genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Flexbar | Flexbar preprocesses high-throughput sequencing data efficiently. It demultiplexes barcoded runs and removes adapter sequences. Moreover, trimming and filtering features are provided. Flexbar increases read mapping rates and improves genome and transcriptome assemblies. It supports next-generation sequencing data in fasta/q and csfasta/q format from Illumina, Roche 454, and the SOLiD platform. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Flye | Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FMLRC | FMLRC, or FM-index Long Read Corrector, is a tool for performing hybrid correction of long read sequencing using the BWT and FM-index of short-read sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FMLRC2 | FMLRC2 performs error correction/polishing of long erroneous sequences with accurate short reads. As such, it can be used as both an error-correction tool <1> for raw long reads (ex. Oxford Nanopore) and a polishing tool <2> for de novo assemblies. |
Genobioinfo Cluster: How to use |
fpa | Filter Pairwise Alignment | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fqtools | fqtools is a software suite for fast processing of FASTQ files; Various file manipulations are supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FragGeneScan | FragGeneScan is an application for finding (fragmented) genes in short reads. It can also be applied to predict prokaryotic genes in incomplete assemblies or complete genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FragGeneScanRs | FragGeneScanRs is an application for finding (fragmented) genes in short reads. It can also be applied to predict prokaryotic genes in incomplete assemblies or complete genomes. It is a re-implementation of FragGeneScan in Rust. |
Genobioinfo Cluster: How to use |
fragmatic | Simple program for in silico restriction digest of genomic sequences, to simulate RAD-family NGS library prep methods. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FrameBot | RDP FrameBot is a tool for correcting frameshift errors caused by insertions and deletions in DNA sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FrameDP | Sensitive peptide detection on noisy matured sequences. Available with command line interface on the cluster. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FreeBayes | FreeBayes is a Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Funannotate | Funannotate is a genome prediction, annotation, and comparison software package. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
G-PhoCS | G-PhoCS is a software package for inferring ancestral population sizes, population divergence times, and migration rates from individual genome sequences. | Genobioinfo Cluster: How to use |
g3d | Genomics 3D visualizer tool sets.
|
Genobioinfo Cluster: How to use |
GAAS | Genome Assembly Annotation Service: Suite of tools related to Genome Assembly Annotation Service tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Galaxy | Galaxy is a scientific workflow, data integration, and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming experience. Although it was initially developed for genomics research, it is largely domain agnostic and is now used as a general bioinformatics workflow management system | Genologin Cluster: soon SGE Cluster: Link to Galaxy instance Genobioinfo Cluster: Ask for Install |
GALBA | GALBA is a pipeline for fully automated prediction of protein coding gene structures with AUGUSTUS in novel eukaryotic genomes for the scenario where high quality proteins from a closely related species are available. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Gamma-SMC | This is an alternative and an upgrade of the widely used PSMC method, which infers population size trajectories from VCF files. |
Genobioinfo Cluster: How to use |
GapCloser | The GapCloser is designed to close the gaps emerging during the scaffolding process by SOAPdenovo, using the abundant pair relationships of short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gappa | A toolkit for analyzing and visualizing phylogenetic (placement) data. | Genobioinfo Cluster: How to use |
gargammel | gargammel is an ancient DNA simulator | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GARLI | GARLI, Genetic Algorithm for Rapid Likelihood Inference is a program for inferring phylogenetic trees. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GATK | The GATK is a structured software library that makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gblocks | Gblocks is a computer program written in ANSI C language that eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences. These positions may not be homologous or may have been saturated by multiple substitutions and it is convenient to eliminate them prior to phylogenetic analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GCI | Genome Continuity Inspector (GCI) is an assembly assessment tool for high-quality genomes (e.g. T2T genomes), in base resolution. | Genobioinfo Cluster: How to use |
gCluster | The gCluster algorithm is a general clustering method that predicts clusters of any biological word or combination of them, relying only on the DNA sequence and the statistical significance. When using CG as word, gCluster works similarly to CpGcluster, our method to predict CpG islands. More broadly, gCluster has much in common with wordCluster but uses an improved distance model. |
Genobioinfo Cluster: How to use |
GCTA | GCTA (Genome-wide Complex Trait Analysis) was originally designed to estimate the proportion of phenotypic variance explained by genome- or chromosome-wide SNPs for complex traits (the GREML method), and has subsequently extended for many other analyses to better understand the genetic architecture of complex traits. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GDAL | a translator library for raster and vector geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation. | Genologin Cluster: /tools/libraries/gdal Genobioinfo Cluster: Ask for Install |
GEM | GEM is a scientific software for studying protein-DNA interaction at high resolution using ChIP-seq/ChIP-exo data. It can also be applied to CLIP-seq and Branch-seq data. |
Genobioinfo Cluster: How to use |
GEM-library | A set of very optimized tools for indexing/querying huge genomes/files. |
Genobioinfo Cluster: How to use |
GEM-Tools | GEM-Tools is a C API and a Python module to support and simplify usage of the GEM Mapper. |
Genobioinfo Cluster: How to use |
gemBS | gemBS is a high performance bioinformatic pipeline designed for highthroughput analysis of DNA methylation data from whole genome bisulfites sequencing data (WGBS). It combines GEM3, a high performance read aligner and bs_call, a high performance variant and methyation caller, into a streamlined and efficient pipeline for bisulfite sequence analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GEMMA | GEMMA is the software implementing the Genome-wide Efficient Mixed Model Association algorithm for a standard linear mixed model and some of its close relatives for genome-wide association studies (GWAS). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeMoMa | Gene Model Mapper (GeMoMa) is a homology-based gene prediction program. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
geneid | geneid is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneMark-ES | Unsupervised training is an important feature of the GeneMark-ES algorithm that identifies protein coding genes in eukaryotic genomes. This is the only eukaryotic gene finder that can perform gene prediction without curated training sets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneMark-ET | a semi-supervised version of GeneMark-ES, called GeneMark-ET that uses RNA-Seq reads to improve training. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneSeqer | Sensitive spliced-alignment of cDNAs or proteins. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Genewise | Wise2 is a package focused on comparisons of biopolymers, commonly DNA sequence and protein sequence. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
genomegaMap | Within-species genome-wide dN/dS estimation from very many genomes. | Genobioinfo Cluster: How to use |
GenomeScope | Fast genome analysis from unassembled short reads |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomeScope2.0 | Reference-free profiling of polyploid genomes | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomeSTRiP | Genome STRiP (Genome STRucture In Populations) is a suite of tools for discovering and genotyping structural variations using sequencing data. The methods are designed to detect shared variation using data from multiple individuals. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GenomeThreader | GenomeThreader is a software tool to compute gene structure predictions. The gene structure predictions are calculated using a similarity-based approach where additional cDNA/EST and/or protein sequences are used to predict gene structures via spliced alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomeTools | Collection of bioinformatics tools (in the realm of genome informatics) combined into a single binary named "gt". |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomOrder | GenomOrder is a Nextflow pipeline reordering and renaming scaffolds from up to 5 assemblies using a reference. It is also able to produce D-Genies back-up files allowing rapid visual comparison of chromosomes of the assemblies versus the reference. These files can be uploaded and visualized with the online tool D-Genies : http://dgenies.toulouse.inra.fr/ The assembly mapping versus the reference is performed with minimap2. These assemblies can be scaffolded or not. If they are not, an option enables to scaffold them according to the reference. The pipeline produces D-Genies back-up file for a user defined list of reference chromosomes. The chromosome file contains one reference chromosome name per line. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Gerbil | A basic task in bioinformatics is the counting of k-mers in genome strings. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GEVA | Genealogical Estimation of Variant Age. We have developed a method for estimating the age of genetic variants; that is, the time of origin of an allele through mutation at a single locus. Our approach, which we refer to as the Genealogical Estimation of Variant Age (GEVA), is similar to existing methods that involve coalescent modeling to infer the time to the most recent common ancestor (TMRCA) between individual genomes <13, 23, 24>. However, these methods typically operate on a discretized timescale <13>, utilize only a fraction of the information available in larger sample data <25>, or employ approximations to overcome computational complexity <14, 15, 26>. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GFAffix | Genobioinfo Cluster: How to use | |
gfatools | gfatools is a set of tools for manipulating sequence graphs in the GFA or the rGFA format. It has implemented parsing, subgraph and conversion to FASTA/BED. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GfaViz | Graphical interactive tool for the visualization of sequence graphs in GFA format. |
Genobioinfo Cluster: How to use |
gff3sort | A Perl Script to sort gff3 files and produce suitable results for tabix tools | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gff3toembl | Converts Prokka GFF3 files to EMBL files for uploading annotated assemblies to EBI | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gffcompare | gffcompare can be used to compare, merge, annotate and estimate accuracy of one or more GFF files (the “query” files), when compared with a reference annotation (also provided as GFF). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gffread | GFF/GTF utility providing format conversions, region filtering, FASTA sequence extraction and more. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ggCaller | A de Bruijn graph-based gene-caller and pangenome analysis tool. | Genobioinfo Cluster: How to use |
gget | gget enables efficient querying of genomic reference databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gh-cli | gh is GitHub on the command line. It brings pull requests, issues, and other GitHub concepts to the terminal next to where you are already working with git and your code. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GHC | GHC is a state-of-the-art, open source, compiler and interactive environment for the functional language Haskell | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gIMble | A genome-wide IM blockwise likelihood estimation toolkit | Genologin Cluster: Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GlimmerHMM | GlimmerHMM is a new gene finder based on a Generalized Hidden Markov Model (GHMM). Although the gene finder conforms to the overall mathematical framework of a GHMM, additionally it incorporates splice site models adapted from the GeneSplicer program and a decision tree adapted from GlimmerM. It also utilizes Interpolated Markov Models for the coding and noncoding models . Currently, GlimmerHMM's GHMM structure includes introns of each phase, intergenic regions, and four types of exons (initial, internal, final, and single). |
Genobioinfo Cluster: How to use |
GLIMPSE | GLIMPSE is a phasing and imputation method for large-scale low-coverage sequencing studies. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GMAP-GSNAP | GMAP: A Genomic Mapping and Alignment Program for mRNA and EST SequencesGSNAP: Genomic Short-read Nucleotide Alignment Program |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gmove | Gmove is a genome annotation tool. This combiner takes as input mapping of RNA-seq or protein or ab initio data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Goalign | Goalign is a set of command line tools to manipulate multiple alignments. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
goatools | Python library to handle Gene Ontology (GO) terms. |
Genobioinfo Cluster: How to use |
GONE | This program calculates and uses linkage disequilibrium at genomic marker loci to infer the effective population size trajectories over a period of about 100-200 hundred generations back in time. |
Genobioinfo Cluster: How to use |
Gotree | Gotree is a set of command line tools and an API to manipulate phylogenetic trees. |
Genobioinfo Cluster: How to use |
GraffiTE | GraffiTE is a pipeline that finds polymorphic transposable elements in genome assemblies and/or long reads, and genotypes the discovered polymorphisms in read sets using genome-graphs. |
Genobioinfo Cluster: How to use |
GraphAligner | Seed-and-extend program for aligning long error-prone reads to genome graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraPhlAn | GraPhlAn is a software tool for producing high-quality circular representations of taxonomic and phylogenetic trees. It focuses on concise, integrative, informative, and publication-ready representations of phylogenetically- and taxonomically-driven investigation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GraphMap | A highly sensitive and accurate mapper for long, error-prone reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
graphtyper | graphtyper is a graph-based variant caller capable of genotyping population-scale short read data sets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraphUnzip | Unzip assembly graphs with Hi-C data and/or long reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GRIDSS | GRIDSS is a module software suite containing tools useful for the detection of genomic rearrangements. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Grinder | Grinder is a versatile open-source bioinformatic tool to create simulated omic shotgun and amplicon sequence libraries for all main sequencing platforms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GROMACS | GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles. It is primarily designed for biochemical molecules like proteins, lipids and nucleic acids that have a lot of complicated bonded interactions, but since GROMACS is extremely fast at calculating the nonbonded interactions (that usually dominate simulations) many groups are also using it for research on non-biological systems, e.g. polymers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GSAlign | An ultra-fast sequence alignment algorithm for intra-species genome comparison. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GTFtools | GTFtools provides a set of functions to analyze various modes of gene models. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gtools | GTOOL is a program for transforming sets of genotype data for use with the programs SNPTEST and IMPUTE. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Gubbins | Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences. Gubbins (Genealogies Unbiased By recomBinations In Nucleotide Sequences) is an algorithm that iteratively identifies loci containing elevated densities of base substitutions while concurrently constructing a phylogeny based on the putative point mutations outside of these regions. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hal2vg | Convert HAL to vg-compatible sequence graph. |
Genobioinfo Cluster: How to use |
HANNO | Efficient High-throughput ANNOtation of protein coding genes in eukaryote genomes. |
Genobioinfo Cluster: How to use |
hap-ibd | The hap-ibd program detects identity-by-descent (IBD) segments and homozygosity-by-descent (HBD) segments in phased genotype data. |
Genobioinfo Cluster: How to use |
hap.py | Genobioinfo Cluster: How to use | |
Hap10 | The goal is to reconstruct accurate and long haplotypes polyploid genome using linked reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HapCUT2 | Software tools for haplotype assembly from sequence data | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hapflk | hapflk is a software implementing the hapFLK <1> and FLK <2> tests for the detection of selection signatures based on multiple population genotyping data. | Genologin Cluster: In Python-2.7.15 (module load system/Python-2.7.15) Genobioinfo Cluster: How to use |
HAPHPIPE | NGS viral assembly and population genetics. |
Genobioinfo Cluster: How to use |
Haplogrep | Haplogrep is a command-line tool for mtDNA haplogroup classification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HaploHiC | Comprehensive haplotype division of Hi-C PE-reads based on local contacts ratio. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HaploMerger2 | <HaploMerger2 (HM2) is an important upgrade over HaploMerger. HM2 is an easy-to-use automated pipeline for improving genome assembly in the post-assembly stage. It consists of a set of executables as well as wrappers for several third-part software. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Haplostrips | Haplostrips produce plots that depict variants in a genomic window among different samples. Visualize similarities between haplotypes with respect to a reference haplotype through haplotype clustering and sorting, useful for revealing hidden population structure. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Hapo-G | Hapo-G is a tool that aims to improve the quality of genome assemblies by polishing the consensus with accurate reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HarvestTools | HarvestTools is a utility for creating and interfacing with Gingr files, which are efficient archives that the Harvest Suite uses to store reference-compressed multi-alignments, phylogenetic trees, filtered variants and annotations. |
Genobioinfo Cluster: How to use |
HASLR | HASLR is a tool for rapid genome assembly of long sequencing reads. HASLR is a hybrid tool which means it requires long reads generated by Third Generation Sequencing technologies (such as PacBio or Oxford Nanopore) together with Next Generation Sequencing reads (such as Illumina) from the same sample. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Hclust2 | Hclust2 is a handy tool for plotting heat-maps with several useful options to produce high quality figures that can be used in publication. |
Genobioinfo Cluster: How to use |
hcluster_sg | A hierarchical clustering software for sparse graphs | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HDFView | HDFView is a visual tool for browsing and editing HDF4 and HDF5 files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HECIL | Hybrid Error Correction of Long Reads using Iterative Learning | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HELEN | HELEN (Homopolymer Encoded Long-read Error-corrector for Nanopore) uses a Recurrent-Neural-Network (RNN) based Multi-Task Learning (MTL) model that can predict a base and a run-length for each genomic position using the weights generated by MarginPolish. This installation includes MarginPolish. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
hexamer | Find likely coding segments in DNA using composition-normalised hexamer tables. |
Genobioinfo Cluster: How to use |
HG-CoLoR | HG-CoLoR (Hybrid method based on a variable-order de bruijn Graph for the error Correction of Long Reads) is a hybrid method for the error correction of long reads that both aligns the short reads to the long reads, and uses a variable-order de Bruijn graph, in a seed-and-extend approach. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HGT-ID | An efficient and sensitive program for detecting viral insertion sequences from known viral reference genome in the genome of human cancers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HGTector | Genome-wide prediction of horizontal gene transfer based on distribution of sequence homology patterns. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HiCExplorer | HiCExplorer is a powerful and easy to use set of tools to process, normalize and visualize Hi-C data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Hickit | Hickit is a set of tools initially developed to process diploid single-cell Hi-C data. It extracts contact pairs from read alignment, identifies phases of contacts overlapping with SNPs of known phases, imputes missing phases, infers the 3D structure of a single cell and visualizes the structure. |
Genobioinfo Cluster: How to use |
HiCLift | A fast and efficient tool for converting chromatin interaction data between genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
hifiasm | Hifiasm is a fast haplotype-resolved de novo assembler for PacBio Hifi reads. Unlike most existing assemblers, hifiasm starts from uncollapsed genome. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hifiasm-meta | De novo metagenome assembler, based on hifiasm, a haplotype-resolved de novo assembler for PacBio Hifi reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HISAT2 | HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes (as well as to a single reference genome). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HMMER | HMMER is a package used for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Homer | HOMER (Hypergeometric Optimization of Motif EnRichment) is a suite of tools for Motif Discovery and next-gen sequencing analysis. It is a collection of command line programs for unix-style operating systems written in Perl and C++. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HTSeq | HTSeq is a Python package that provides infrastructure to process data from high-throughput sequencing assays. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HyPhy | HyPhy is an open-source software package for the analysis of genetic sequences using techniques in phylogenetics, molecular evolution, and machine learning. It features a complete graphical user interface (GUI) and a rich scripting language for limitless customization of analyses. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HyPO | HyPo (a Hybrid Polisher): Super Fast & Accurate Polisher for Long Read Genome Assemblies | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
i-ADHoRe | i-ADHoRe is a highly sensitive software tool to detect degenerated homology relations within and between different genomes. |
Genologin Cluster: How to use <Genobioinfo Cluster: How to use |
IBA | Python script to assemble AHE (Anchored Hybrid Enrichment) data loci by loci. To summarize, raw sequencing reads for each species were filtered for quality using Trim Galore! v. 0.4.0 (Krueger, 2015), and assembled using the iterative baited assembly (IBA) Python script which employs USEARCH v. 7.0 (Edgar, 2010) and Bridger v. 2014-12-01 (Chang et al., 2015) to assemble loci for each taxon. MAFFT v. 7.245 (Katoh and Standley, 2013) was used to align assembled sequences, and the probe and flanking regions were separated with the Python script Extract_probe_region.py (Breinholt et al., 2018). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ICEscreen | ICEscreen is a bioinformatic pipeline for the detection and annotation of ICEs (Integrative and Conjugative Elements) and IMEs (Integrative and Mobilizable Elements) in Bacillota genomes. |
Genobioinfo Cluster: How to use |
IDR | The IDR (Irreproducible Discovery Rate) framework is a unified approach to measure the reproducibility of findings identified from replicate experiments and provide highly stable thresholds based on reproducibility. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IGV | The Integrative Genomics Viewer (IGV) is a high-performance visualization tool for interactive exploration of large, integrated genomic datasets. It supports a wide variety of data types, including array-based and next-generation sequence data, and genomic annotations. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Infernal | Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities. It is an implementation of a special case of profile stochastic context-free grammars called covariance models (CMs). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Infomap | Multi-level network clustering based on the Map Equation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Inspector | A tool for evaluate long-read de novo assembly results. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
InterProScan | InterProScan is a tool that combines different protein signature recognition methods into one resource. No less than 14 pattern/profiles databanks can be interrogated.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IPA | Improved Phased Assembler (IPA) is the official PacBio software for HiFi genome assembly. IPA was designed to utilize the accuracy of PacBio HiFi reads to produce high-quality phased genome assemblies. IPA is an end-to-end solution, starting with input reads and resulting in a polished assembly. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IPK | IPK is a tool for computing phylo-k-mers for a fixed phylogeny. |
Genobioinfo Cluster: How to use |
ipyrad | An interactive toolkit for assembly and analysis of restriction-site associated genomic data sets (e.g., RAD, ddRAD, GBS) for population genetic and phylogenetic studies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IQ-TREE | Efficient phylogenomic software by maximum likelihood |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iREAD | iREAD (intron REtention Analysis and Detector)is a tool to detect intron retention(IR) events from RNA-seq datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IRFinder | Detecting intron retention from RNA-Seq experiments |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Iris | A module which corrects the sequences of structural variant calls (currently only insertions). |
Genobioinfo Cluster: How to use |
IRMA | IRMA was designed for the robust assembly, variant calling, and phasing of highly variable RNA viruses. Currently IRMA is deployed with modules for influenza and ebolavirus. IRMA is free to use and parallelizes computations for both cluster computing and single computer multi-core setups. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iSMC | This software extend the sequentially Markovian coalescent model to jointly infer the spatial variation in recombination rate (rho) from a single pair of unphased genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IsoLasso | IsoLasso is an algorithm to assemble transcripts and estimate their expression levels from RNA-Seq reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IsoSeq3 | Scalable De Novo Isoform Discovery from Single-Molecule PacBio Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ITSx | Improved software detection and extraction of ITS1 and ITS2 from ribosomal ITS sequences of fungi and other eukaryotes for use in environmental sequencing | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IVA | Iterative Virus Assembler is a de novo assembler designed to assemble virus genomes that have no repeat sequences, using Illumina read pairs sequenced from mixed populations at extremely high and variable depth | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
iVar | Var is a computational package that contains functions broadly useful for viral amplicon-based sequencing. Additional tools for metagenomic sequencing are actively being incorporated into iVar. While each of these functions can be accomplished using existing tools, iVar contains an intersection of functionality from multiple tools that are required to call iSNVs and consensus sequences from viral sequencing data across multiple replicates. We implemented the following functions in iVar: (1) trimming of primers and low-quality bases, (2) consensus calling, (3) variant calling - both iSNVs and insertions/deletions, and (4) identifying mismatches to primer sequences and excluding the corresponding reads from alignment files. |
Genobioinfo Cluster: How to use |
Jabba | A hybrid error correction tool for sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
JAGS | JAGS is Just Another Gibbs Sampler. It is a program for analysis of Bayesian hierarchical models using Markov Chain Monte Carlo (MCMC) simulation not wholly unlike BUGS. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Jasmine | JASMINE: Jointly Accurate Sv Merging with Intersample Network Edges. This tool is used to merge structural variants (SVs) across samples. Each sample has a number of SV calls, consisting of position information (chromosome, start, end, length), type and strand information, and a number of other values. Jasmine represents the set of all SVs across samples as a network, and uses a modified minimum spanning forest algorithm to determine the best way of merging the variants such that each merged variants represents a set of analogous variants occurring in different samples. Manual : Jasmine User Manual · mkirsche/Jasmine Wiki · GitHub Jasmine also includes a module for automating the creation of IGV screenshots of variants of interest. |
Genobioinfo Cluster: How to use |
JASPER | JASPER (Jellyfish based Assembly Sequence Polisher for Error Reduction) is an efficient polishing tool for draft genomes. It uses accurate reads (PacBio HiFi or Illumina) to evaluate consensus quality and correct consensus errors in genome assemblies. JASPER is substantially faster than polishing methods based on sequence alignment, and more accurate than currently available k-mer based methods. The efficiency and scalability of JASPER allows one to use it to create personalized reference genomes for specific populations very efficiently, even for large sequenced populations, by polishing the reference genome, such as GRCh38 or chm13v2.0 for human, with Illumina reads sequenced from many individuals from the population. Please see this manuscript for more details: Guo A, Salzberg SL, Zimin AV. JASPER: A fast genome polishing tool that improves accuracy of genome assemblies. PLoS Comput Biol. 2023 Mar 31;19(3):e1011032. doi: 10.1371/journal.pcbi.1011032. PMID: 37000853; PMCID: PMC10096238. |
Genobioinfo Cluster: How to use |
JCVI | Collection of Python libraries to parse bioinformatics files, or perform computation related to assembly, annotation, and comparative genomics. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Jellyfish | JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
jModeltest | jModelTest is a tool to carry out statistical selection of best-fit models of nucleotide substitution. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Juicebox | Software for visualizing data from Hi-C and other proximity mapping experiments | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
juicebox_scripts | A collection of scripts for working with Hi-C data, Juicebox, and other genomic file formats | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Juicer | A One-Click System for Analyzing Loop-Resolution Hi-C Experiments |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Julia | Julia is a high-level, high-performance dynamic programming language for technical computing, with syntax that is familiar to users of other technical computing environments. It provides a sophisticated compiler, distributed parallel execution, numerical accuracy, and an extensive mathematical function library. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
JustOrthologs | A Fast, Accurate, and User-Friendly Ortholog-Finding Algorithm |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Jvarkit | Java utilities for Bioinformatics (only requested tools are compiling) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KAD | KAD is designed for evaluating the accuracy of nucleotide base quality of genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Kaiju | Fast taxonomic classification of metagenomic sequencing reads using a protein reference database |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kallisto | kallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KAT | KAT (The K-mer Analysis Toolkit) is a suite of tools that generate, analyse and compare k-mer spectra produced from sequence files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KCOSS | A fast and space-saving multi-threaded k-mer frequency statistics algorithm |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
kentUtils | UCSC command line bioinformatic utilities |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
klocate | Standalone tool based on the bwa index to locate a set of kmers along a reference genome. klocate searches each kmer (full and perfect match) in the index and outputs all positions the kmer maps to (output to sdtout in bed format). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KMA | KMA is a mapping method designed to map raw reads directly against redundant databases, in an ultra-fast manner using seed and extend. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kmap | Standalone tool based on the bwa index to locate a set of kmers along a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KMC | Genologin Cluster: How to use Genobioinfo Cluster: How to use | |
kmdiif | kmdiff provides differential k-mers analysis between two populations (control and case). Each population is represented by a set of short-read sequencing. Outputs are differentially represented k-mers between controls and cases. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kmer-counter | A fast k-mer counter written in Rust. |
Genobioinfo Cluster: How to use |
KmerGenie | KmerGenie estimates the best k-mer length for genome de novo assembly. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kmersGWAS | A library for running k-mers based GWAS. |
Genobioinfo Cluster: How to use |
KofamScan | KofamScan is a gene function annotation tool based on KEGG Orthology and hidden Markov model |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
komplexity | A command-line tool built in Rust to quickly calculate and/or mask low-complexity sequences from a FAST file. This uses the number of unique k-mers over a sequence divided by the length to assess complexity. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Kraken | Kraken is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Kraken2 | Kraken2 is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KrakenTools | KrakenTools provides individual scripts to analyze Kraken/Kraken2/Bracken/KrakenUniq output files. |
Genobioinfo Cluster: How to use |
KrakenUniq | KrakenUniq (formerly KrakenHLL) is a novel metagenomics classifier that combines the fast k-mer-based classification of Kraken with an efficient algorithm for assessing the coverage of unique k-mers found in each species in a dataset. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Krona | Krona allows hierarchical data to be explored with zoomable pie charts. Krona charts can be created using an Excel template or KronaTools, which includes support for several bioinformatics tools and raw data formats. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kSNP4 | kSNP4 identifies the pan-genome SNPs in a set of genome sequences, and estimates phylogenetic trees based upon those SNPs. |
Genobioinfo Cluster: How to use |
LACHESIS | Software that uses Hi-C data for ultra-long-range scaffolding of de novo genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
lamassemble | Merge overlapping "long" DNA reads into a consensus sequence. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LAST | LAST finds similar regions between sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
lastp_aai | A simple Python script for calculating pairwise amino acid identity (AAI) between protein files (extension .faa) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LASTZ | A tool for aligning two DNA sequences, and inferring appropriate scoring parameters automatically. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
lcMLkin | lcMLkin is a C++ program that allows users to infer biological relatedness from low coverage 2nd generation sequencing data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LDhat | LDhat is a package written in the C and C++ languages for the analysis of recombination rates from population genetic data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LDhelmet | LDhelmet performs statistical inference for fine-scale variable recombination rate estimation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
leeHom | A program for the Bayesian reconstruction of ancient DNA. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LEfSe | LEfSe (Linear discriminant analysis effect size) is a tool developed by the Huttenhower group to find biomarkers between 2 or more groups using relative abundances. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Lep-MAP3 | Lep-MAP3 is a novel and free software for linkage mapping. It can construct linkage maps on very large number of markers and individuals on single or multiple families. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
libplinkio | This is a small C and Python library for reading Plink genotype files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
libstree | libstree is a generic suffix tree implementation, written in C. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Liftoff | Liftoff is a tool that accurately maps annotations in GFF or GTF between assemblies of the same, or closely-related species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
lima | Demultiplex Barcoded PacBio Samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Linker | Linker is a suite of C++ tools useful for interpreting long and linked read sequencing of cancer genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LINKS | LINKS is a scalable genomics application for scaffolding or re-scaffolding genome assembly drafts with long reads, such as those produced by Oxford Nanopore Technologies Ltd and Pacific Biosciences. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LIQA | Long-read Isoform Quantification and Analysis) is an Expectation-Maximization based statistical method to quantify isoform expression and detect differential alternative splicing (DAS) events using long-read RNA-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LJA | La Jolla Assembler (LJA) is a tool for genome assembly from HiFI reads based on de Bruijn graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
llvm | The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LOCALIZER | LOCALIZER is a machine learning method for predicting the subcellular localization of both plant proteins and pathogen effectors in the plant cell. |
Genobioinfo Cluster: How to use |
LocARNA | LocARNA is a tool for multiple alignment of RNA molecules. LocARNA requires only RNA sequences as input and will simultaneously fold and align the input sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
locator | A supervised machine learning method for predicting the geographic origin of a sample from genotype or sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Loctree3 | Protein Subcelullar Localization Sequenced-Based Predictor |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LoFreq | A sequence-quality aware, ultra-sensitive variant caller for NGS data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LongRanger | Long Ranger is a set of analysis pipelines that processes Chromium sequencing output to align reads and call and phase SNPs, indels, and structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
longshot | Longshot is a variant calling tool for diploid genomes using long error prone reads such as Pacific Biosciences (PacBio) SMRT and Oxford Nanopore Technologies (ONT). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LongStitch | A genome assembly correction and scaffolding pipeline using long reads. Basically runs Tigmint, ntLink, ARKS. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Look4TRs | A de-novo tool for detecting simple tandem repeats using self-supervised hidden Markov models. |
Genobioinfo Cluster: How to use |
LoRDEC | LoRDEC is a program to correct sequencing errors in long reads from 3rd generation sequencing with high error rate, and is especially intended for PacBio reads. It uses a hybrid strategy, meaning that it uses two sets of reads: the reference read set, whose error rate is assumed to be small, and the PacBio read set, which is then corrected using the reference set. Typically, the reference set contains Illumina reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LR_Gapcloser | LR_Gapcloser is a gap closing tool using uncorrected or corrected long reads generated from Pacbio platform or Nanopore platform. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRez | Standalone tool and library allowing to work with barcoded linked-reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRScaf | TGS scaffolding . Improving draft genomes using long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRSIM | Simulator for Linked Reads: this package simulates whole genome sequencing using 10X Genomics Linked Read technology. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LSx | LSX is a script in R that runs the LS³ and LS⁴ algorithms of data subsampling for multigene phylogenetic inference. Both of these algorithms do a gene-by-gene inspection of the heterogeneity of evolutionary rates among user-defined lineages of interest (LOI). Then, using criteria that differ in both algorithms (see details here or in the papers), they try to find a subsample of sequences that evolve at a homogeneous rate across all LOIs. If this subset is found, an alignment of the gene is produced with only the sequences that evolve homogeneously. At the same time, a table is also produced showing which sequences were “flagged” (the sequences that were removed), and which sequences were kept. If a subset of sequences that evolve at a homogeneous rate is not found, the gene is flagged entirely. |
Genobioinfo Cluster: How to use |
LtrDetector | A tool-suite for detecting long terminal repeat retrotransposons de-novo on the genomic scale. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LUMPY | A general probabilistic framework for structural variant discovery | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MACH | MACH is a Markov Chain based haplotyper that can resolve long haplotypes or infer missing genotypes in samples of unrelated individuals. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MACS | We present Model-based Analysis of ChIP-Seq (MACS) on short reads sequencers such as Genome Analyzer (Illumina / Solexa). MACS empirically models the length of the sequenced ChIP fragments, which tends to be shorter than sonication or library construction size estimates, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome sequence, allowing for more sensitive and robust prediction. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, is publicly available open source, and can be used for ChIP-Seq with or without control samples. | Genologin Cluster: in Python-2.7.2 Genobioinfo Cluster: How to use |
MACS2 | Model-based Analysis of ChIP-Seq |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MAESTRO | A Multi AgEnt STability pRedictiOn tool for changes in unfolding free energy upon point mutation. MAESTRO is structure based and distinguishes from similar approaches in the following points: (i) MAESTRO implements a multi-agent machine learning system. (ii) It provides predicted ΔΔG values along with a corresponding prediction quality measure. (iii) MAESTRO is applicable to biological assemblies. (iv) It provides high throughput scanning for multi-point mutations where sites and types of mutation can be comprehensively controlled. (v) Finally, the software provides a specific mode for the prediction of stabilizing disulfide bonds. |
Genobioinfo Cluster: How to use |
MAFFT | MAFFT is a multiple sequence alignment program for unix-like operating systems. It offers a range of multiple alignment methods, L-INS-i (accurate; for alignment of <?200 sequences), FFT-NS-2 (fast; for alignment of <?10,000 sequences), etc. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MAGeCK | Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout (MAGeCK) is a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens technology. Manual and video tutorials : liulab / mageck-vispr — Bitbucket |
Genobioinfo Cluster: How to use |
MAGIC | A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Magic-BLAST | Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MAKER | MAKER is a portable and easily configurable genome annotation pipeline. Its purpose is to allow smaller eukaryotic and prokaryotic genome projects to independently annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab-initio gene predictions and automatically synthesizes these data into gene annotations having evidence-based quality values. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MALDER | This is a version of ALDER (http://groups.csail.mit.edu/cb/alder/) that has been modified to allow multiple admixture events. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MALT | MALT (MEGAN alignment tool) is an extension of MEGAN (metagenome analyzer). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Manta | Manta calls structural variants (SVs) and indels from mapped paired-end sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mapDamage | tracking and quantifying damage patterns in ancient DNA sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Mapsembler2 | Mapsembler2 is a targeted assembly software. It takes as input any number of NGS raw read set(s) (fasta or fastq, gzipped or not) and a set of input sequences (starters). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MapSplice | Accurate mapping of RNA-seq reads for splice junction discovery. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MapThin | Reduce the number of SNPs in a gene marker dense map computed by PLINK. First, by eliminating linked SNPs. Then, by applying different criteria. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MARVEL | MARVEL consists of a set of tools that facilitate the overlapping, patching, correction and assembly of noisy (not so noisy ones as well) long reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mash | Genobioinfo Cluster: How to use | |
Mash | Fast genome and metagenome distance estimation using MinHash. documentation : Publications — Mash 2.0 documentation |
Genobioinfo Cluster: How to use |
MashMap | MashMap implements a fast and approximate algorithm for computing local alignment boundaries between long DNA sequences. It can be useful for mapping genome assembly or long reads (PacBio/ONT) to reference genome(s). Given a minimum alignment length and an identity threshold for the desired local alignments, Mashmap computes alignment boundaries and identity estimates using k-mers. It does not compute the alignments explicitly, but rather estimates a k-mer based Jaccard similarity using a combination of Minimizers and MinHash. This is then converted to an estimate of sequence identity using the Mash distance. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mashtree | Create a tree using Mash distances. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MaSuRCA | MaSuRCA is whole genome assembly software. It combines the efficiency of the de Bruijn graph and Overlap-Layout-Consensus (OLC) approaches. MaSuRCA can assemble data sets containing only short reads from Illumina sequencing or a mixture of short reads and long reads (Sanger, 454) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MAtCHap | An ultra fast algorithm for solving the single individual haplotype assembly problem. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mauve | Mauve is a system for efficiently constructing multiple genome alignments in the presence of large-scale evolutionary events such as rearrangement and inversion. Multiple genome alignment provides a basis for research into comparative genomics and the study of evolutionary dynamics. Aligning whole genomes is a fundamentally different problem than aligning short sequences. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MaxBin2 | MaxBin is a software for binning assembled metagenomic sequences based on an Expectation-Maximization algorithm. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mCaller | This program is designed to call m6A from nanopore data using the differences between measured and expected currents. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MCL | The MCL algorithm is short for the Markov Cluster Algorithm, a fast and scalable unsupervised cluster algorithm for graphs (also known as networks) based on simulation of (stochastic) flow in graphs. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MCScanX | MCScan is an algorithm to scan multiple genomes or subgenomes to identify putative homologous chromosomal regions, then align these regions using genes as anchors. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MECAT | MECAT is an ultra-fast Mapping, Error Correction and de novo Assembly Tools for single molecula sequencing (SMRT) reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Medaka | Medaka demonstrates a framework for error correcting sequencing data, particularly aimed at nanopore sequencing. Tools are provided for both training and inference. The code exploits the keras deep learning library. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MEGA-CC | Software suite for analyzing DNA and protein sequence data from species and populations. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MEGAHIT | An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Megalodon | Megalodon provides "basecalling augmentation" for raw nanopore sequencing reads, including direct, reference-guided SNP and modified base calling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MeGAMerge | A tool to merge assembled contigs, long reads from metagenomic sequencing runs |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MEGAN | MEtaGenome ANalyzer : Metagenomic data analysis : taxonomic and functionnal (SEED and KEGG classification) analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MEME | The MEME Suite allows you to: (1) discover motifs using MEME or GLAM2 on groups of related DNA or protein sequences, (2) search sequence databases using motifs, (3) compare a motif to all motifs in a database of motifs, and (3) associate motifs with Gene Ontology terms via their putative target genes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Merfin | Evaluate variant calls and its combination with k-mer multiplicity. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Merqury | Evaluate genome assemblies with k-mers and more |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Meryl | A genomic k-mer counter (and sequence utility) with nice features. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Met4j | Met4J is an open-source Java library dedicated to the structural analysis of metabolic networks. It also came with a toolbox gathering CLI for several analyses relevant to metabolism-related research. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaBat | An adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
metaBIT | An integrative and automated metagenomic pipeline for analysing microbial profiles from high-throughput sequencing shotgun data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
metaDMG | A fast and accurate ancient DNA damage toolkit for metagenomic data. |
Genobioinfo Cluster: How to use |
MetaEuk | MetaEuk is a modular toolkit designed for large-scale gene discovery and annotation in eukaryotic metagenomic contigs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
METAL | The METAL software is designed to facilitate meta-analysis of large datasets (such as several whole genome scans) in a convenient, rapid and memory efficient manner. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaMaps | MetaMaps is tool specifically developed for the analysis of long-read (PacBio/ONT) metagenomic datasets. It simultaenously carries out read assignment and sample composition estimation. It is faster than classical exact alignment-based approaches, and its output is more information-rich than that of kmer-spectra-based methods. For example, each MetaMaps alignment comes with an approximate alignment location, an estimated alignment identity and a mapping quality. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaMDBG | A lightweight assembler for long and accurate metagenomics reads. |
Genobioinfo Cluster: How to use |
MetaPhlAn | MetaPhlAn is a computational tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaPhlAn2 | MetaPhlAn2 is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea, Eukaryotes and Viruses) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaPhlAn3 | MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea and Eukaryotes) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaPhlAn4 | MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea and Eukaryotes) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With StrainPhlAn, it is possible to perform accurate strain-level microbial profiling. |
Genobioinfo Cluster: How to use |
MetaWRAP | A flexible pipeline for genome-resolved metagenomic data analysis. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Metaxa2 | Improved Identification and Taxonomic Classification of Small and Large Subunit rRNA in Metagenomic Data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MethyLasso | A segmentation approach to analyze DNA methylation patterns and identify differentially methylation regions from whole-genome datasets. |
Genobioinfo Cluster: How to use |
MethylDackel | MethylDackel (formerly named PileOMeth, which was a temporary name derived due to it using a PILEup to extract METHylation metrics) will process a coordinate-sorted and indexed BAM or CRAM file containing some form of BS-seq alignments and extract per-base methylation metrics from them. MethylDackel requires an indexed fasta file containing the reference genome as well. |
Genobioinfo Cluster: How to use |
methylKit | DNA methylation analysis from high-throughput bisulfite sequencing results | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MethylScore | Identification of differentially methylated regions between multiple epigenomes from BS-treated read mappings via methylated region calling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mgatk | A mitochondrial genome analysis toolkit. |
Genobioinfo Cluster: How to use |
MGSE | MGSE can harness the power of files generated in genome sequencing projects to predict the genome size. Required are the FASTA file containing a high continuity assembly and a BAM file with all available reads mapped to this assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Migrate | Migrate estimates effective population sizes,past migration rates between n population assuming a migration matrix model with asymmetric migration rates and different subpopulation sizes, and population divergences or admixture. | Genobioinfo Cluster: How to use |
MinCED | MinCED is a program to find Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) in full genomes or environmental datasets such as metagenomes, in which sequence size can be anywhere from 100 to 800 bp. MinCED runs from the command-line and was derived from CRT |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Miniasm | Ultrafast de novo assembly for long noisy reads (though having no consensus step). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Minigraph | Minigraph is a sequence-to-graph mapper and graph constructor. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Minimac4 | Minimac4 is a latest version in the series of genotype imputation software - preceded by Minimac3 (2015), Minimac2 (2014), minimac (2012) and MaCH (2010). Minimac4 is a lower memory and more computationally efficient implementation of the original algorithms with comparable imputation quality. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Minimap | Experimental tool to find approximate mapping positions between long sequences |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Minipolish | A tool for Racon polishing of miniasm assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Miniprot | Aligning proteins to genomes with splicing and frameshift. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiniScrub | MiniScrub is a de novo long sequencing read preprocessing method that improves read quality by predicting and removing ("scrubbing") read segments that have a high concentration of errors. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MIPgen | Use MIPgen to design custom mip panels for target enrichment of moderate to high complexity DNA targets ranging from 120 to 250bp in size. |
Genobioinfo Cluster: How to use |
MIRA | Whole genome shotgun and EST sequence assembler for Sanger, 454, and Solexa / Illumina. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
miRDeep2 | miRDeep2 is a software package for identification of novel and known miRNAs in deep sequencing data. Furthermore, it can be used for miRNA expression profiling across samples. Last, a new module for preprocessing of raw Illumina sequencing data produces files for downstream analysis with the miRDeep2 or quantifier module. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiRfold | MiRfold searches for a good miRNA-like folding in the sequence surrounding a putative miRNA. It was optimized on plant miRNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MITGARD | MITGARD (Mitochondrial Genome Assembly from RNA-seq Data) is a computational tool designed to recover the mitochondrial genome from RNA-seq data of any Eukaryote species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MITObim | The MITObim procedure (mitochondrial baiting and iterative mapping) represents a highly efficient approach to assembling novel mitochondrial genomes of non-model organisms directly from total genomic DNA derived NGS reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MitoHPC | MitoHPC : Mitochondrial High Performance Caller. For Calling Mitochondrial Homoplasmies and Heteroplasmies. |
Genobioinfo Cluster: How to use |
MitoSeek | MitoSeek is an open-source software tool to reliably and easily extract mitochondrial genome information from exome sequencing data. MitoSeek evaluates mitochondrial genome alignment quality, estimates relative mitochondrial copy numbers, and detects heteroplasmy, somatic mutation, and structural variance of the mitochondrial genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MitoZ | MitoZ is a Python3-based toolkit which aims to automatically filter pair-end raw data (fastq files), assemble genome, search for mitogenome sequences from the genome assembly result, annotate mitogenome (genbank file as result), and mitogenome visualization. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MLST | Multi-Locus sequence Typing. The method enables investigators to determine the ST based on WGS data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mmannot | mmannot annotates reads, or quantifies the features. For instance, suppose that you have sequenced your organism of interest with sRNA-Seq (RNA-Seq works too), and you want to know how many times you have sequenced miRNAs, rRNAs, tRNAs, etc. This is what mmannot does. A huge proportion of the reads may actually map at several locations. These multi-mapping reads are usually handled poorly by similar quantification tools. In our methods, when a read maps at several locations, all these locations are inspected: If all these locations belong to the same feature (e.g. miRNAs, in case of a duplicated gene family), the read is still annotated as a miRNA. If the location belong to different features (e.g. 3'UTR and miRNA), the read is ambiguous, and is flagged as 3'UTR--miRNA. In case 1, we say when have rescued a read. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mmquant | This tool counts the number of reads (produced by RNA-Seq) per gene, much like HTSeq-count and featureCounts. The main difference with other tools is that multi-mapping reads are counted differently: if a read is mapped to gene A, gene B, and gene C, the tool will create a new feature, "geneA--geneB--geneC", that will be counted once. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MMseqs2 | MMseqs2 (Many-against-Many sequence searching) is a software suite to search and cluster huge proteins/nucleotide sequence sets. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MobileElementFinder | MobileElementFinder is a tool for identifying Mobile Genetic Elements (MGEs) in Whole Genome Shotgun sequence data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Mobster | Mobster is used to detect novel (non-reference) Mobile Element Insertion (MEI) events in BAM files and uses both a discordant read pair method and a split-read method. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
modbam2bed | A program to aggregate modified base counts stored in a modified-base BAM file to a bedMethyl file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ModelTest-NG | ModelTest-NG is a tool for selecting the best-fit model of evolution for DNA and protein alignments. ModelTest-NG supersedes jModelTest and ProtTest in one single tool, with graphical and command console interfaces. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Modkit | A bioinformatics tool for working with modified bases from Oxford Nanopore. Specifically for converting modBAM to bedMethyl files using best practices, but also manipulating modBAM files and generating summary statistics. |
Genobioinfo Cluster: How to use |
MOSAIK | MOSAIK is a reference-guided assembler comprising of two main modular programs |
Genologin Cluster How to use Genobioinfo Cluster: Ask for Install |
mosdepth | Fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing. mosdepth can output: per-base depth about 2x as fast samtools depth--about 25 minutes of CPU time for a 30X genome. mean per-window depth given a window size--as would be used for CNV calling. the mean per-region given a BED file of regions. a distribution of proportion of bases covered at or above a given threshhold for each chromosome and genome-wide. quantized output that merges adjacent bases as long as they fall in the same coverage bins e.g. (10-20) threshold output to indicate how many bases in each region are covered at the given thresholds. when appropriate, the output files are bgzipped and indexed for ease of use. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mothur | The one-stop source for your computational microbial ecology needs. mothur offers the ability to go from raw sequences to the generation of visualization tools to describe alpha and beta diversity. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mPTP | A tool for single-locus species delimitation. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MrBayes | MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mreps | Software for tandem repeat identification in DNA. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ms | A program for generating samples under neutral models. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
msBayes | msBayes allows complex and flexible comparative phylogeographic inference. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MSI | MSI was designed for sequencing reads with higher error rates (e.g., as the ones produced by Nanopore's sequencers) but also works with reads with lower error rates (e.g., Illumina). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MSIsensor-pro | MSIsensor-pro is an updated version of msisensor. MSIsensor-pro evaluates Microsatellite Instability (MSI) for cancer patients with next generation sequencing data. |
Genobioinfo Cluster: How to use |
msmc | This software implements MSMC, a method to infer population size and gene flow from multiple genome sequences | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
msmc2 | This program implements MSMC2, a method to infer population size history and population separation history from whole genome sequencing data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
msprime |
Please see the documentation for more details |
Genobioinfo Cluster: How to use |
msums | A program for the efficient computation of a number of population genetics summary statistics. msums can read ms-format data on (nearly) arbitrary numbers of populations. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MTG-Link | MTG-Link is a novel gap-filling tool for draft genome assemblies, dedicated to linked read data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mugsy | Mugsy is a multiple whole genome aligner. Mugsy uses Nucmer for pairwise alignment, a custom graph based segmentation procedure for identifying collinear regions, and the segment-based progressive multiple alignment strategy from Seqan::TCoffee. Mugsy accepts draft genomes in the form of multi-FASTA files and does not require a reference genome. Angiuoli SV and Salzberg SL. Mugsy: Fast multiple alignment of closely related whole genomes. Bioinformatics 2011 27(3):334-4 | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MultAlin | Multiple sequence alignment with hierarchical clustering. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MultiQC | Aggregate results from bioinformatics analyses across many samples into a single report. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MUMmer | MUMmer is a package for rapidly aligning entire genomes, whether in complete or draft form. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MuMRescueLite | MuMRescueLite is the software that enable to use the tag sequencies of mapped to multiple loci to the genome, for the expression analysis. At the mapping of short sequence tags of CAGE or ChIP-Seq to the genome, sequence tags that map to multiple genomic loci (multi-mapping tags or MuMs), are routinely omitted from further analysis, leading to experimental bias and reduced coverage. MuMRescueLite probabilistically reincorporates multi-mapping tags into mapped short read data with acceptable computational requirements. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MUSCLE | Multiple sequence alignment (nucleic or proteic). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Musket | Musket is a well-established leading next-generation sequencing read error correction algorithm targetting Illumina sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MVP | MVP stands for Multi-choice Viromics Pipeline. It is a simplified pipeline that utilizes a suite of state-of-art tools to easily get from a set of contigs to a vOTU heatmap (and more). |
Genobioinfo Cluster: How to use |
MyCC | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MZmine | MZmine is an open-source software for mass-spectrometry data processing. |
Genobioinfo Cluster: How to use |
NAMD | NAMD, recipient of a 2002 Gordon Bell Award and a 2012 Sidney Fernbach Award, is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Nano-Q | Python script for conservatively cleaning ONT reads from bam files and estimate variant frequencies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoCaller | NanoCaller is a computational method that integrates long reads in deep convolutional neural network for the detection of SNPs/indels from long-read sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NanoComp | Compare multiple runs of long read sequencing data and alignments. |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: in Python-3.11.1 |
NanoCount | NanoCount estimates transcripts abundance from Oxford Nanopore direct-RNA sequencing datasets, using an expectation-maximization approach like RSEM, Kallisto, salmon, etc to handle the uncertainty of multi-mapping reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoFilt | Filtering and trimming of Oxford Nanopore sequencing data |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: Ask for Install |
NanoLyse | Remove reads mapping to the lambda phage genome from a fastq file | Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: How to use |
NanoPlot | Plotting tool for Oxford Nanopore sequencing data and alignments. |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: How to use |
Nanopolish | A nanopore consensus algorithm using a signal-level hidden Markov model. Signal-level algorithms for MinION data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoSimm | NanoSim is a fast and scalable read simulator that captures the technology-specific features of ONT data, and allows for adjustments upon improvement of nanopore sequencing technology. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoSPC | NanoSPC is a scalable, portable and cloud compatible pipeline for analyzing Nanopore sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NanoStat | Create statistic summary of an Oxford Nanopore read dataset | Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: Ask for Install |
NaS | NaS is a hybrid approach developed to take advantage of data generated using MinION device. It combines Illumina and Oxford Nanopore technologies to produce NaS (Nanopore Synthetic-long) reads |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
natsort | Simple yet flexible natural sorting in Python | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NCBI_Blast | Similarity search against databanks. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_Blast+ | Similarity search against databanks. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_datasets | NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. You can use it to find and download sequence, annotation, and metadata for genes and genomes using our command-line interface (CLI) tools or NCBI Datasets web interface. |
Genobioinfo Cluster: How to use |
NCBI_tools | NCBI portable software toolkit |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_tools++ | NCBI C++ Toolkit provides free, portable, public domain libraries. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NECAT | NECAT is an error correction and de-novo assembly tool for Nanopore long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NetLogo | NetLogo is a multi-agent programmable modeling environment. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Neural-ADMIXTURE | Neural ADMIXTURE is an unsupervised global ancestry inference technique based on ADMIXTURE. By using neural networks, Neural ADMIXTURE offers high quality ancestry assignments with a running time which is much faster than ADMIXTURE's. |
Genobioinfo Cluster: How to use |
Newbler | Newbler is a software package for de novo DNA sequence assembly. It is designed specifically for assembling sequence data generated by the 454 GS-series of pyrosequencing platforms sold by 454 Life Science, a Roche diagnostic. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Newick_Utilities | The Newick Utilities are a suite of Unix shell tools for processing phylogenetic trees. We distribute the package under the BSD License. Functions include re-rooting, extracting subtrees, trimming, pruning, condensing, drawing (ASCII graphics or SVG). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NextCloudcmd | A command line client that can be used to synchronize Nextcloud files to client machines. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NextDenovo | NextDenovo is a string graph-based de novo assembler for TGS long reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Nextflow | Nextflow enables scalable and reproducible scientific workflows using software containers. It allows the adaptation of pipelines written in the most common scripting languages. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NextPolish | NextPolish is used to fix base errors (SNV/Indel) in the genome generated by noisy long reads, it can be used with short read data only or long read data only or a combination of both. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nf-core workflows | This module provide access to workflows nf-core, there are automatically downloaded into your home. More info at nf-core/config page.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NGMLR | NGMLR is a long-read mapper designed to align PacBio or Oxford Nanopore (standard and ultra-long) to a reference genome with a focus on reads that span structural variations SV detection from paired end reads mapping |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ngsLD | ngsLD is a program to estimate pairwise linkage disequilibrium (LD) taking the uncertainty of genotype's assignation into account. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ngsRelate | NgsRelate can be used to infer relatedness coefficients for pairs of individuals from low coverage Next Generation Sequencing (NGS) data by using genotype likelihoods instead of called genotypes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ngsTools | ngsTools is a collection of programs for population genetics analyses from NGS data, taking into account its statistical uncertainty. The methods implemented in these programs do not rely on SNP or genotype calling, and are particularly suitable for low sequencing depth data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ngsutils | Tools for next-generation sequencing analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NINJA | Nearly Infinite Neighbor Joining Application | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NLR-Annotator | Disease resistance genes encoding nucleotide-binding and leucine-rich repeat (NLR) intracellular immune receptor proteins detect pathogens by the presence of pathogen effectors. Although developed for wheat, we demonstrate the universal applicability of NLR-Annotator across diverse plant taxa. NLR-Annotator is a tool to annotate loci associated with NLRs in large sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NLRtracker | NLRtracker extracts and functionally annotates NLRs from protein or transcript files based on the core features found in the RefPlantNLR dataset. |
Genobioinfo Cluster: How to use |
non-B_gfa | gfa programs for Non-B site at NCI/FNLCR. gfa is a Suite of programs developed at NCI-Frederick/Frederick National Lab to find sequences associated with non-B DNA forming motifs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NOVOPlasty | NOVOPlasty is a de novo assembler and variance caller for short circular genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nQuire | A statistical framework for ploidy estimation using NGS short-read data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NSEG | NSEG is used to mask nucleic acid sequences, needed by RepeatScout. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ntEdit | ntEdit is a fast and scalable genomics application for polishing genome assembly drafts. It simplifies polishing and "haploidization" of gene and genome sequences with its re-usable Bloom filter design. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ntHits | ntHits is a method for identifying reapeats in high-throughput DNA sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ntJoin | Scaffolding draft assemblies using reference assemblies and minimizer graphs | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
numpy | NumPy is a package needed for scientific computing with Python. | Genologin Cluster: In Python (all versions) Genobioinfo Cluster: Ask for Install |
Oases | Oases is a de novo transcriptome assembler designed to produce transcripts from short read sequencing technologies, such as Illumina, SOLiD, or 454 in the absence of any genomic assembly. It was developed by Marcel Schulz (MPI for Molecular Genomics) and Daniel Zerbino (previously at the European Bioinformatics Institute (EMBL-EBI), now at UC Santa Cruz). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OBITools | OBITools is a set of python programs developed to simplify the manipulation of sequence files in our labs. They were mainly designed to help us for analyzing Next Generation Sequencer outputs (454 or Illumina) in the context of DNA Metabarcoding. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OMA | The OMA (Orthologous MAtrix) database is a well-established resource for identifying orthologs among publicly available complete genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OMArk | OMArk is a software for proteome (protein-coding gene repertoire) quality assessment. |
Genobioinfo Cluster: How to use |
OMBlast | An alignment tool for optical mapping data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
omegaplus | A scalable tool for rapid detection of selective sweeps in whole-genome datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Onecodetofindthemall | One code to find them all is a set of perl scripts to extract useful information from RepeatMasker about transposable elements, retrieve their sequences and get some quantitative information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ont_fast5_api | ont_fast5_api is a simple interface to HDF5 files of the Oxford Nanopore fast5 file format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ORA | Bio::ORA is a featherweight object for identifying mammalian olfactory receptor genes. The sequences should not be longer than 40kb. The returned array include location, sequence and statistic for the putative olfactory receptor gene. Fully functional with DNA and EST sequence, no intron supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ORFfinder | ORFfinder searches for open reading frames (ORFs) in the DNA sequence you enter. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ORG.asm | The ORGanelle ASseMbler aims to target the assembling of small sequences over-represented in a whole genome shotgun sequence dataset. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Organelle_PBA | OrganelleRef_PBA is a script to perform a de-novo PacBio assemblies of any organelle (chloroplast or mitochondrial genomes) using several programs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
orthAgogue | a tool for high speed estimation of homology relations within and between species in massive data sets. orthAgogue is easy to use and offers flexibility through a range of optional parameters. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
OrthoFinder | OrthoFinder is a fast, accurate and comprehensive analysis tool for comparative genomics. It finds orthologues and orthogroups infers rooted gene trees for all orthogroups and infers a rooted species tree for the species being analysed. OrthoFinder also provides comprehensive statistics for comparative genomic analyses. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OrthoLoger | Standalone pipeline for delineation of orthologs. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
OrthoMCL | OrthoMCL is a genome-scale algorithm for grouping orthologous protein sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Pacasus | Tool for detecting and cleaning PacBio / Nanopore long reads after whole genome amplification. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pairtools | pairtools is a simple and fast command-line framework to process sequencing data from a Hi-C experiment. |
Genobioinfo Cluster: How to use |
PALEOMIX | The PALEOMIX pipeline is a set of free and open-source pipelines and tools designed to enable the rapid processing of Next Generation Sequencing (NGS) data, starting from de-multiplexed reads from one or more samples, through sequence processing and alignment, and ending with genotyping, phylogenetic inference on the samples, as well as metagenomic analysis of the samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PAML | PAML is a package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
panacus | Genobioinfo Cluster: How to use | |
Pandoc | Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PanGenie | A short-read genotyper for various types of genetic variants (such as SNPs, indels and structural variants) represented in a pangenome graph. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PanTools | PanTools is a toolkit for comparative analysis of large number of genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Paragraph | Graph realignment tools for structural variants. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
parallel | GNU parallel is a shell tool for executing jobs in parallel using one or more computers. |
Genologin Cluster: How to use Genobioinfo Cluster: default system |
parallel-fastq-dump | NCBI fastq-dump can be very slow sometimes, even if you have the resources (network, IO, CPU) to go faster, even if you already downloaded the sra file (see the protip below). This tool speeds up the process by dividing the work into multiple threads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ParGenes | A massively parallel tool for model selection and tree inference on thousands of genes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
parseRM | Few scripts facilitating the extraction of info from Repeat Masker .out files | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PartitionFinder | PartitionFinder is free open source software to select best-fit partitioning schemes and models of molecular evolution for phylogenetic analyses. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PASApipeline | PASA, acronym for Program to Assemble Spliced Alignments, is a eukaryotic genome annotation tool that exploits spliced alignments of expressed transcript sequences to automatically model gene structures, and to maintain gene structure annotation consistent with the most recently available experimental sequence data. PASA also identifies and classifies all splicing variations supported by the transcript alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PASTA | PASTA estimates alignments and ML trees from unaligned sequences using an iterative approach. In each iteration, it first estimates a multiple sequence alignment using the current tree as a guide and then estimates an ML tree on (a masked version of) the alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PathoFact | PathoFact is an easy-to-use modular pipeline for the metagenomic analyses of toxins, virulence factors and antimicrobial resistance. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pathPhynder | A workflow for integrating ancient lineages into present-day phylogenies. |
Genobioinfo Cluster: How to use |
PAUP | Tools for inferring and interpreting phylogenetic trees |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pb-assembly | PacBio® tools : everything needed to run Falcon and Unzip | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pblat | Parallelized blat with multi-threads support. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pbtk | PacBio BAM toolkit |
Genobioinfo Cluster: How to use |
PCAdmix | PCAdmix is a method that estimates local ancestry via principal components analysis (PCA) using phased haplotypes. The method considers data chromosome by chromosome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PCAngsd | Framework for analyzing low depth next-generation sequencing (NGS) data in heterogeneous populations using principal component analysis (PCA). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PEAR | PEAR is an ultrafast, memory-efficient and highly accurate pair-end read merger. It is fully parallelized and can run with as low as just a few kilobytes of memory. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PECAT | PECAT is a phased error correction and assembly tool for long reads. It includes a haplotype-aware correction method and an efficient diploid assembly method. |
Genobioinfo Cluster: How to use |
Pelican | Pelican is a reimplementation of the model of Tamuri et al. (2009) to identify sites undergoing different kinds of directional selection in different parts of a phylogenetic tree. |
Genobioinfo Cluster: How to use |
Peregrine | Peregrine is a fast genome assembler for accurate long reads (length > 10kb, accuraccy > 99%). It can assemble a human genome from 30x reads within 20 cpu hours from reads to polished consensus. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PETfold | PETfold performs Probabilistic Evolutionary and Thermodynamic folding of a multiple alignment of RNA sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pftools3 | The pftools package contains all the software necessary to build protein and DNA generalized profiles and use them to scan and align sequences, and search databases | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PGAP | The NCBI Prokaryotic Genome Annotation Pipeline is designed to annotate bacterial and archaeal genomes (chromosomes and plasmids). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PGDSpider | PGDSpider is a powerful automated data conversion tool for population genetic and genomics programs. It facilitates the data exchange possibilities between programs for a vast range of data types (e.g. DNA, RNA, NGS, microsatellite, SNP, RFLP, AFLP, multi-allelic data, allele frequency or genetic distances) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pggb | Pangenome graph builder. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PHASE | PHASE is a package that performs molecular phylogenetic inference. The software seeks to accurately compare molecular sequences to determine the likely evolutionary relationships between a group of species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
phasebook | phasebook is a novel approach for reconstructing the haplotypes of diploid genomes from long reads de novo, that is without the need for a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PhaseTank | To systemically characterize phasiRNAs/tasiRNAs and their regulatory cascades 'miRNA/phasiRNA -> | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PHASTER_scripts | Small utility scripts to query the PHASTER API endpoint, to identify and annotate prophage sequences within bacterial genomes and plasmids. |
Genobioinfo Cluster: How to use |
PhiPack | The Phi Test is a simple, rapid, and statistically efficient test for recombination. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PhiSpy | PhiSpy identifies prophages in Bacterial (and probably Archaeal) genomes. Given an annotated genome it will use several approaches to identify the most likely prophage regions. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Phobius | A combined transmembrane topology and signal peptide predictor. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Phy-Mer | A novel alignment-free and reference-independent mitochondrial haplogroup classifier. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PhyKIT | PhyKIT is a UNIX shell toolkit for processing and analyzing phylogenomic data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PHYLIP | PHYLIP (PHYLogeny Inference Package), is a package composed by 34 programs dedicated to phylogeny inference. Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites and fragments, distance matrices, and discrete characters.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PhyloBayes | PhyloBayes is a Bayesian Monte Carlo Markov Chain (MCMC) sampler for phylogenetic reconstruction and molecular dating using protein and nucleic acid alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Phylobayes_MPI | PhyloBayes (Lartillot et al, 2009) is a Bayesian Monte Carlo Markov Chain (MCMC) sampler for phylogenetic reconstruction. With MPI. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
phyloFlash | phyloFlash is a pipeline to rapidly reconstruct the SSU rRNAs and explore phylogenetic composition of an illumina (meta)genomic dataset. |
Genobioinfo Cluster: How to use |
PhyloPhlAn | PhyloPhlAn is a computational pipeline for reconstructing highly accurate and resolved phylogenetic trees based on whole-genome sequence information. The pipeline is scalable to thousands of genomes and uses the most conserved 400 proteins for extracting the phylogenetic signal. PhyloPhlAn also implements taxonomic curation, estimation, and insertion operations. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
phyluce | phyluce (phy-loo-chee) is a software package that was initially developed for analyzing data collected from ultraconserved elements in organismal genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PhyML | PhyML is a phylogeny software based on the maximum-likelihood principle. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
phyx | phyx performs phylogenetics analyses on trees and sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
picard-tools | Picard comprises Java-based command-line utilities that manipulate SAM files, and a Java API (SAM-JDK) for creating new programs that read and write SAM files. Both SAM text format and SAM binary (BAM) format are supported. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PICRUSt | PICRUSt (pronounced モpie crustヤ) is a bioinformatics software package designed to predict metagenome functional content from marker gene (e.g., 16S rRNA) surveys and full genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PILER | Genomic repeat analysis software. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PILERCR | PILERCR is public domain software for finding CRISPR repeats. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Pilon | Pilon is an automated genome assembly improvement and variant detection tool. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
piMASS | posterior inference via Model Averaging and Subset Selection: performs genome-wide joint analysis of all SNPs in association with a phenotype. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Pindel | Pindel can detect breakpoints of large deletions, medium sized insertions, inversions, tandem duplications and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of these variants from paired-end short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PingPongPro | Find ping-pong signatures in piRNA-Seq data like a pro. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pixy | pixy is a command-line tool for painlessly and correctly estimating average nucleotide diversity within (π) and between (dxy) populations from a VCF. | Genobioinfo Cluster: How to use |
Pizzly | A program for detecting gene fusions from RNA-Seq data of cancer samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PlasFlow | Software for prediction of plasmid sequences in metagenomic assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Plasmer | An accurate and sensitive bacterial plasmid identification tool based on deep machine-learning of shared k-mers and genomic features. |
Genobioinfo Cluster: How to use |
PlasmidFinder | The service identifies plasmids in total or partial sequenced isolates of bacteria. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Plass | Plass (Protein-Level ASSembler) is a software to assemble short read sequencing data on a protein level. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
plassembler | plassembler is a program that is designed for automated & fast assembly of plasmids in bacterial genomes that have been hybrid sequenced with long read & paired-end short read sequencing. |
Genobioinfo Cluster: How to use |
PLAST | PLAST is a fast, accurate and NGS scalable bank-to-bank sequence similarity search tool providing significant accelerations of seeds-based heuristic comparison methods, such as the Blast suite of algorithms. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Platanus | Platanus is a novel de novo sequence assembler that can reconstruct genomic sequences of highly heterozygous diploids from massively parallel shotgun sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Platanus_trim | Platanus_trim is a tool for trimming adaptor sequences and low quality regions. In contrast, Platanus_internal_trim is a tool for trimming internal adaptor sequence, adaptor sequences, and low quality regions. Platanus_trim is designed for paired-end library and Platanus_internal_trim is for mate-pair library. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Platanus2 | Platanus-allee (Platanus2) is a de novo haplotype assembler (phasing tool), which assembles each haplotype sequence in a diploid genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PLINK | PLINK is a free, open-source whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ploidyNGS | A model-free, open source tool to visualize and explore ploidy levels in a newly sequenced genome, exploiting short read data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
plotsr | Tool to plot synteny and structural rearrangements between genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PMERGE | PMERGE, is a software, which implements a new method that identifies candidate PSVs by building networks of loci that share high levels of nucleotide similarity. The PMERGE is embedded in the analysis pipeline of the widely used Stacks software, and it is straightforward to apply it as an additional filter in population-genomic studies using RAD-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Polypolish | Polypolish is a tool for polishing genome assemblies with short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pomoxis | Pomoxis comprises a set of basic bioinformatic tools tailored to nanopore sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pong | pong is a freely available software package, released by Behr et al. (2016, Bioinformatics), for post-processing output from clustering inference using population genetic data. |
Genologin Cluster: In Python-2.7.15 Genobioinfo Cluster: Ask for Install |
popins | Population-scale detection of novel-sequence insertions. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PopIns2 | Population-scale detection of non-reference sequence variants using colored de Bruijn Graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PoPoolation | PoPoolation is a pipeline for analysing pooled next generation sequencing data. Currently PoPoolation allows to calculate Tajima’s Pi, Watterson’s Theta and Tajima’s D with a sliding window approach for chromosomes or for set of genes. |
Genobioinfo Cluster: How to use |
PoPoolation2 | PoPoolation2 allows to compare allele frequencies for SNPs between two or more populations and to identify significant differences. PoPoolation2 requires next generation sequencing data of pooled genomic DNA (Pool-Seq). It may be used for measuring differentiation between populations, for genome wide association studies and for experimental evolution. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
popPhylABC | Scripts used for ABC analysis with homo- and heterogeneity in Migration rates or/and Effective population sizes | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
POPS | The POPS program performs inference of ancestry distribution models. |
Genobioinfo Cluster: How to use |
Porechop | Porechop is a tool for finding and removing adapters from Oxford Nanopore reads. Adapters on the ends of reads are trimmed off, and when a read has an adapter in its middle, it is treated as chimeric and chopped into separate reads. Porechop performs thorough alignments to effectively find adapters, even at low sequence identity. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Porechop_ABI | Porechop_abi (ab initio) is an extension of Porechop that is able to infer the adapter sequence from the Oxford Nanopore reads. It discovers the adapter sequence from the reads using approximate k-mers and assembly, and add the sequence found to the adapter list (adapters.py file). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Portcullis | Splice junction analysis and filtering from BAM files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pp-popularity-contest | The pp-popularity-contest package sets up a cron job that periodically submits the developers anonymous statistics on the usage of Rost Lab prediction methods installed on this system. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PRANK | PRANK is a probabilistic multiple alignment program for DNA, codon and amino-acid sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PredGPI | Prediction of glycosylphosphatidylinositol-anchors in proteins based on HMMs and SVMs. |
Genobioinfo Cluster: How to use |
preseq | Software for predicting library complexity and genome coverage in high-throughput sequencing. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PretextMap | Paired REad TEXTure Mapper. Converts SAM formatted read pairs into genome contact maps. Full suite of Pretext tools. |
Genobioinfo Cluster: How to use |
PretextView | OpenGL Powered Pretext Contact Map Viewer. |
Genobioinfo Cluster: How to use |
Primer3 | Primer3 is a widely used program for designing PCR primers (PCR = "Polymerase Chain Reaction"). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PRINSEQ | PRINSEQ is a tool that generates summary statistics of sequence and quality data and that is used to filter, reformat and trim next-generation sequence data. The standalone version is primarily designed for data preprocessing and does not generate summary statistics in graphical form. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Prodigal | Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) is a microbial (bacterial and archaeal) gene finding program developed at Oak Ridge National Laboratory and the University of Tennessee. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ProgressiveCactus | Progressive Cactus is a whole-genome alignment package. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PROJ4 | Cartographic Projections Library | Genologin Cluster: /tools/libraries/PROJ/ Genobioinfo Cluster: Ask for Install |
PROKKA | Prokka is a software tool for the rapid annotation of prokaryotic genomes. A typical 4 Mbp genome can be fully annotated in less than 10 minutes on a quad-core computer, and scales well to 32 core SMP systems. It produces GFF3, GBK and SQN files that are ready for editing in Sequin and ultimately submitted to Genbank/DDJB/ENA. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PromoTech | Promotech is a machine-learning-based classifier trained to generate a model that generalizes and detects promoters in a wide range of bacterial species. | Genobioinfo Cluster: How to use |
Prost | Prost! (PRocessing Of Short Transcripts) can analyze smallRNA sequencing data generated on any sequencing platform. Prost! does not rely on existing annotation to filter sequencing reads but instead starts by aligning all the reads on a user-provided genomic reference, allowing the study of miRNAs in any species. Additionally, any number of samples can be studied together in a single Prost! run, allowing an accurate analysis of an entire dataset. After grouping the processed reads by genomic location, Prost! then annotates them using a user-defined annotation database (public or personal annotation database). |
Genobioinfo Cluster: How to use |
Proteinortho | Proteinortho is a tool to detect orthologous genes within different species. |
Genobioinfo Cluster: How to use |
proteinortho_curves | Draw pan- and core-genome curves from proteinortho output |
Genobioinfo Cluster: How to use |
PROVEAN | PROVEAN (Protein Variation Effect Analyzer) is a software tool which predicts whether an amino acid substitution or indel has an impact on the biological function of a protein. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PSAURON | PSAURON is a machine learning model for rapid assessment of protein coding gene annotation. |
Genobioinfo Cluster: How to use |
Pseudofinder | Detection of pseudogene candidates in bacterial and archaeal genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PSGInfer | PSGInfer is a software package for the analysis of RNA-Seq data with probabilistic splice graph (PSG) models of gene alternative processing (splicing, transcription initiation, and polyadenylation) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PSI-Sigma | Percent Spliced-In (PSI) values are commonly used to report alternative pre-mRNA splicing (AS) changes. However, previous PSI-detection methods are limited to specific types of AS events. PSI-Sigma is using a new splicing index (PSIΣ) that is more flexible, can incoporate novel junctions, and can compute PSI values of individual exons in complex splicing events. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
psmc | This software package infers population size history from a diploid sequence |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Puffaligner | Puffaligner is a fast, sensitive and accurate aligner built on top of the Pufferfish index. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Purge_Dups | purge_dups is designed to remove haplotigs and contig overlaps in a de novo assembly based on read depth. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Purge_Haplotigs | Pipeline to help with curating heterozygous diploid genome assemblies (for instance when assembling using FALCON or FALCON-unzip). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pybam | Very simple, pure python, BAM file reader. If you do not need to use BAM indexes, pybam is probably the fastest and simplest BAM parser out there, particularly if run under PyPy. |
Genobioinfo Cluster: How to use |
PyCharm | PyCharm is a dedicated Python and Django IDE providing a wide range of essential tools for Python developers, tightly integrated together to create a convenient environment for productive Python development and Web development. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pycoQC | pycoQC computes metrics and generates Interactive QC plots from the sequencing summary report generated by Oxford Nanopore technologies basecaller (Albacore/Guppy) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pypolca | pypolca is a Standalone Python re-implementation of the POLCA polisher from the MaSuRCA genome assembly and analysis toolkit. |
Genobioinfo Cluster: How to use |
PyPy | A fast, compliant alternative implementation of Python. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pyrho | Fast demography-aware inference of fine-scale recombination rates based on fused-LASSO. |
Genobioinfo Cluster: How to use |
PyroCleaner | PyroCleaner is intended to clean reads coming from pyrosequencing in order to ease the assembly process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pysam | Pysam is a python module for reading and manipulating Samfiles. It's a lightweight wrapper of the samtools C-API. | Genologin Cluster: In Python module Genobioinfo Cluster: Ask for Install |
PySlurm | This module provides a low-level Python wrapper around the Slurm C-API using Cython. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
qgrs-cpp | C++ implementation of QGRS mapping algorithm (QGRS Mapper is a software program that generates information on composition and distribution of putative Quadruplex forming G-Rich Sequences (QGRS) in nucleotide sequences.) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
QIIME | QIIME (pronounced "chime") stands for Quantitative Insights Into ttMicrobial Ecology. QIIME is an open source software package for ttcomparison and analysis of microbial communities, primarily based on tthigh-throughput amplicon sequencing data (such as SSU rRNA) generated tton a variety of platforms, but also supporting analysis of other types ttof data (such as shotgun metagenomic data). QIIME takes users from tttheir raw sequencing output through initial analyses such as OTU ttpicking, taxonomic assignment, and construction of phylogenetic trees ttfrom representative sequences of OTUs, and through downstream ttstatistical analysis, visualization, and production of ttpublication-quality graphics. QIIME has been applied to single studies ttbased on billions of sequences from thousands of samples. ttttt |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
QmRLFS-finder | QmRLFS-finder, the first R-loop finding tool which uses (unsupervised) QmRLFS (Quantitative Models of RLFS) models to predict RLFSs. This command line tool generates locations and detailed information of RLFSs as well as standards-compliant output files for further analysis and visualization. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
qpWrapper | Tools allowing to launch qpAdmn analyzes (Admixtools) in series on a list of individuals. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Quake | t Quake is a package to correct substitution sequencing errors in experiments with deep coverage (e.g. >15X), specifically intended for Illumina sequencing reads. Quake adopts the k-mer error correction framework, first introduced by the EULER genome assembly package. Unlike EULER and similar progams, Quake utilizes a robust mixture model of erroneous and genuine k-mer distributions to determine where errors are located. Then Quake uses read quality values and learns the nucleotide to nucleotide error rates to determine what types of errors are most likely. This leads to more corrections and greater accuracy, especially with respect to avoiding mis-corrections, which create false sequence unsimilar to anything in the original genome sequence from which the read was taken.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Qualimap | Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
quarTeT | quarTeT is a collection of tools for T2T genome assembly and basic analysis in automatic workflow. |
Genobioinfo Cluster: How to use |
Quarto | Quarto is a software that compiles a markdown code to html, pdf, or many other formats. It is a successor of pandoc. |
Genobioinfo Cluster: How to use |
QUAST | QUAST evaluates genome assemblies by computing various metrics |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
quickLD | High-performance Computation of Linkage Disequilibrium on CPUs and GPUs. |
Genobioinfo Cluster: How to use |
quickmerge | A simple and fast metassembler and assembly gap filler designed for long molecule based assemblies | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Quorum | QuorUM (Quality Optimized Reads from the University of Maryland) is an error corrector for Illumina reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
R | R is "GNU S", a freely available language and environment for statistical computing and graphics which provides a wide variety of statistical and graphical techniques: linear and nonlinear modelling, statistical tests, time series analysis, classification, clustering, etc. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
R-scape | R-scape looks for evidence of a conserved RNA structure by measuring pairwise covariations observed in an input multiple sequence alignment. It analyzes all possible pairs, including those in your proposed structure (if you provide one). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
r8s | This package implements several methods to infer divergence times on a molecular phylogeny, using penalized likelihood, maximum likelihood and nonparametric rate smoothing methods. It also implements miscellaneous tree and character evolution models and tests. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ra | Ra is as a fast and easy to use assembler for raw reads generated by third generation sequencing. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Rabaler | Rebaler is a program for conducting reference-based assemblies using long reads. It relies mainly on minimap2 for alignment and Racon for making consensus sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RabbitUniq | Compute unique k-mer faster. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RabbitV | RabbitV is a highly optimized and practical toolkit for the detection of viruses and microorganisms in sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Racon | Consensus module for raw de novo DNA assembly of long uncorrected reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RADIS | Analysis of RAD-seq data for InterSpecific phylogeny |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RAFT | RAFT (Repeat Aware Fragmentation Tool) is an algorithm designed to improve assembly quality by rescuing contained reads. RAFT breaks long reads into smaller sub-reads by following an algorithm described in our preprint. The read fragmentation allows an OLC assembler to retain contained reads during string graph construction. When input reads have non-uniform lengths, retaining contained reads improves assembly contiguity and base-level accuracy. The inputs to RAFT include an error-corrected read file in FASTA/FASTQ format and an all-vs-all alignment file in PAF format. It performs read fragmentation and outputs the fragmented reads in FASTA format. We recommend users to use hifiasm for the initial steps (read error correction, all-vs-all overlap computation) and also for the final step (assembly of fragmented reads). The assembly output format of hifiasm is described here. The RAFT-hifiasm workflow is recommended for long accurate reads with non-uniform length distribution (e.g., ONT Duplex, or a mixture of ONT Duplex and HiFi reads). ONT UL reads can optionally be integrated during the final assembly step. |
Genobioinfo Cluster: How to use |
RaGOO | A tool to order and orient genome assembly contigs via Minimap2 alignments to a reference genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ragout | Ragout (Reference-Assisted Genome Ordering UTility) is a tool for chromosome assembly using multiple references. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RagTag | RagTag, the successor to RaGOO, is a command line tool for reference-guided genome assembly improvement. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAiSD | RAiSD (Raised Accuracy in Sweep Detection) is a stand-alone software implementation of the μ statistic for selective sweep detection. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAMPART | RAMPART is a configurable pipeline for de novo assembly of DNA sequence data. RAMPART is not a de novo assembler. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ranbow | Ranbow is a haplotype assembler for polyploid genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
randfold | The software compute the probability that, for a given RNA sequence, the Minimum Free Energy (MFE) of the secondary structure is different from a distribution of MFE computed with random sequences.. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
rastair | Fast processing of TET-assisted pyridine borane sequencing (TAPS)-based sequencing data. |
Genobioinfo Cluster: How to use |
Ratatosk | Ratatosk is a phased error correction tool for erroneous long reads based on compacted and colored de Bruijn graphs built from accurate short reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RATT | RATT is software to transfer annotation from a reference (annotated) genome to an unannotated query genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RAxML | RAxML (Randomized Axelerated Maximum Likelihood) is a program for sequential and parallel Maximum Likelihood based inference of large phylogenetic trees. It can also be used for postanalyses of sets of phylogenetic trees, analyses of alignments and, evolutionary placement of short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAxML-NG | RAxML-NG is a phylogenetic tree inference tool which uses maximum-likelihood (ML) optimality criterion. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ray | Assemble genomes in parallel using the message-passing interface | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RDP Classifier | The RDP Classifier is a naive Bayesian classifier that can rapidly and accurately provides taxonomic assignments from domain to genus, with confidence estimates for each assignment.
|
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RDPTools | Collection of commonly used RDP Tools for easy building | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RDXplorer | The RDXplorer (Read Depth eXplorer) is a computational tool for copy number variants (CNV) detection in whole human genome sequence data using read depth (RD) coverage. |
Genobioinfo Cluster: How to use |
READ | READ is a method to infer the degree of relationship (up to second degree, i.e. nephew/niece-uncle/aunt, grandparent-grandchild or half-siblings) for a pair of low-coverage individuals. |
Genobioinfo Cluster: How to use |
READv2 | Relationship Estimation from Ancient DNA version 2. |
Genobioinfo Cluster: How to use |
REAPR | REAPR is a tool that evaluates the accuracy of a genome assembly using mapped paired end reads, without the use of a reference genome for comparison. It can be used in any stage of an assembly pipeline to automatically break incorrect scaffolds and flag other errors in an assembly for manual inspection. It reports mis-assemblies and other warnings, and produces a new broken assembly based on the error calls. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RECON | A package for automated de novo identification of repeat families from genomic sequence. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
REDItools | REDItools are python scripts developed with the aim to study RNA editing at genomic scale by next generation sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Redundans | Redundans is a pipeline that assists an assembly of heterozygous/polymorphic genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REFMAKER | REFMAKER is a command-line and user-friendly pipeline providing different tools to create nuclear references from genomic assemblies of shotgun libraries. |
Genobioinfo Cluster: How to use |
RegTools | RegTools is a set of tools that integrate DNA-seq and RNA-seq data to help interpret mutations in a regulatory and splicing context. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ReLERNN | Recombination Landscape Estimation using Recurrent Neural Networks |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepAHR | RepAHR is used to identify repeats(repetitive sequences) in genome using Next-Generation Sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REPdenovo | REPdenovo is designed for constructing repeats directly from sequence (paired-end) reads. It based on the idea of frequent k-mer assembly. REPdenovo provides many functionalities, and can generate much longer repeats than existing tools. Internally, REPdenovo uses Jellyfish for k-mer counting, Velvet for assembly, and bwa to map reads on the Transposable Elements. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RepeatMasker | RepeatMasker is a program that screens DNA sequences for interspersed repeats (thanks to RepBase repeats databanks specially formatted) and low complexity DNA sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepeatModeler | RepeatModeler is a de-novo repeat family identification and modeling package. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepeatScout | RepeatScout is a tool to discover repetitive substrings in DNA. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepEnrich2 | RepEnrich2 is an updated method to estimate repetitive element enrichment using high-throughput sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REPET | The REPET package (t Flutre et al, 2011 ) integrates bioinformatics programs in order to tackle biological issues at the genomic scale. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RetroScan | RetroScan is an easy-to-use tool for retrocopy identification that integrates a series of bioinformatics tools (LAST, BEDtools, ClustalW2, KaKs_Calculator, HISAT2, StringTie, SAMtools and Shiny) and scripts. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RetroSeq | RetroSeq is a bioinformatics tool that searches for mobile element insertions from aligned reads in a BAM file and a library of reference transposable elements. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RevBayes | RevBayes provides an interactive environment for statistical computation in phylogenetics. It is primarily intended for modeling, simulation, and Bayesian inference in evolutionary biology, particularly phylogenetics. However, the environment is quite general and can be useful for many complex modeling tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RFMIX | A discriminative method for local ancestry inference |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Rfold | Rfold computes local base pairing probabilities for long DNA sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RFPlasmid | RFPlasmid predicts plasmid contigs from assemblies using single copy marker genes, plasmid genes, and kmers. |
Genobioinfo Cluster: How to use |
RGI | Software to predict resistomes from protein or nucleotide data, including metagenomics data, based on homology and SNP models. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RiboTaper | RiboTaper is a new analysis pipeline for Ribosome Profiling (Ribo-seq) experiments, which exploits the triplet periodicity of ribosomal footprints to call translated regions. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
rMATS | MATS is a computational tool to detect differential alternative splicing events from RNA-Seq data. From the RNA-Seq data, MATS can automatically detect and analyze alternative splicing events corresponding to all major types of alternative splicing patterns. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
rMATS turbo | rMATS turbo is the C/Cython version of rMATS (refer to http://rnaseq-mats.sourceforge.net) : Multivariate Analysis of Transcript Splicing (MATS). The major difference between rMATS turbo and rMATS is speed and space usage. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RMBlast | RMBlast is a RepeatMasker compatible version of the standard NCBI BLAST suite. The primary difference between this distribution and the NCBI distribution is the addition of a new program "rmblastn" for use with RepeatMasker and RepeatModeler. RMBlast supports RepeatMasker searches by adding a few necessary features to the stock NCBI blastn program. These include: |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RNAmmer | Rnammer predicts 5s/8s, 16s/18s, and 23s/28s ribosomal RNA in tttfull genome sequences. The program uses hidden Markov models trained on data from the 5S ribosomal RNA database and the European ribosomal RNA database project. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RNAscClust | RNAscClust is a pipeline to cluster a set of structured RNAs taking their respective structural conservation into account. The aim of RNAscClust is to aid the discovery of families and classes of ncRNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RNAz | RNAz detects stable and conserved RNA secondary structures in multiple sequence alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Roary | Roary is a high speed stand alone pan genome pipeline, which takes annotated assemblies in GFF3 format (produced by Prokka (Seemann, 2014)) and calculates the pan genome. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RogueNaRok | A versatile and scalable algorithm for rogue taxon identification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ROHan | ROHan is a Bayesian framework to estimate local rates of heterozygosity, infer runs of homozygosity (ROH) and compute global rates of heterozygosity. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RopeBWT2 | RopeBWT2 is an tool for constructing the FM-index for a collection of DNA sequences. |
Genobioinfo Cluster: How to use |
ROSE | To create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RSEG | The RSEG software package is aimed to analyze ChIP-Seq data, especially for identifying genomic regions and their boundaries marked by diffusive histone modification markers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RSEM | RSEM (RNA-Seq by Expectation-Maximization) is a software package for estimating gene and isoform expression levels from RNA-Seq data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RSeQC | RSeQC package provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ruby | A dynamic, open source programming language. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
rust-mdbg | rust-mdbg is an ultra-fast minimizer-space de Bruijn graph (mdBG) implementation, geared towards the assembly of long and accurate reads such as PacBio HiFi. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
S3V2_IDEAS_ESMP | A package for normalizing, denoising and integrating epigenomic datasets across different cell types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sabre | A barcode demultiplexing and trimming tool for FastQ files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Salmon | Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using lightweight alignments |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SALSA | A tool to scaffold long read assemblies with Hi-C data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
sambamba | Sambamba is a high performance modern robust and fast tool (and library), written in the D programming language, for working with SAM and BAM files. Current functionality is an important subset of samtools functionality, including view, index, sort, markdup, and depth. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
samblaster | samblaster is a fast and flexible program for marking duplicates in read-id grouped1 paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
samclip | Filter SAM file for soft and hard clipped alignments | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
samtools | SAM (Sequence Alignment/Map). SAM Tools provide various utilities for manipulating alignments in the SAM format, including sorting, merging, indexing and generating alignments in a per-position format. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SARTools | SARTools is a R package dedicated to the differential analysis of RNA-seq data. It provides tools to generate descriptive and diagnostic graphs, to run the differential analysis with one of the well known DESeq2 or edgeR packages and to export the results into easily readable tab-delimited files. |
Genologin Cluster: In R-3.4.3 Genobioinfo Cluster: Ask for Install |
Satsuma | Highly sensitive whole-genome synteny alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Saturn | A tool for assessing the library saturation without any reference genome. . |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sbt | sbt is a build tool for Scala, Java, and more. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Scaff10X | Pipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Scan For Matches | scan_for_matches is a utility written in C for locating patterns in DNA or protein FASTA files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Scanpy | Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, trajectory inference and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells. |
Genobioinfo Cluster: How to use |
SCCmecFinder | SCCmecFinder identifies SCCmec elements in sequenced S. aureus isolates. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
schmutzi | Bayesian maximum a posteriori contamination estimate for ancient samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
scipy | SciPy (pronounced "Sigh Pie") is open-source software for mathematics, science, and engineering. The SciPy library depends on Numpy, which provides convenient and fast N-dimensional array manipulation. | Genologin Cluster: In Python modules Genobioinfo Cluster: Ask for Install |
Scoary | Scoary is designed to take the gene_presence_absence.csv file from Roary as well as a traits file created by the user and calculate the assocations between all genes in the accessory genome and the traits. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
scrm | A coalescent simulator for genome-scale sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SDA | Segmental Duplication Assembler |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SEACells | Single-cEll Aggregation for High Resolution Cell States. SEACells algorithm for Inference of transcriptional and epigenomic cellular states from single-cell genomics data. |
Genobioinfo Cluster: How to use |
SEDEF | SEDEF is a quick tool to find all segmental duplications in the genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
segemehl | segemehl is a software to map short sequencer reads to reference genomes. segemehl implements a matching strategy based on enhanced suffix arrays (ESA). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SELFISH | SELFISH is a tool for finding differential chromatin interactions between two Hi-C contact maps. |
Genobioinfo Cluster: How to use |
selscan | A program to calculate EHH-based scans for positive selection in genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Seq | Seq is a programming language for computational genomics and bioinformatics. With a Python-compatible syntax and a host of domain-specific features and optimizations, Seq makes writing high-performance genomics software as easy as writing Python code, and achieves performance comparable to (and in many cases better than) C/C++. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Seq-Gen | Seq-Gen is a program that will simulate the evolution of nucleotide or amino acid sequences along a phylogeny, using common models of the substitution process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SeqAn | SeqAn is an open source C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
seqclean | SeqClean is a tool for validation and trimming of DNA sequences from a flat file database (FASTA format). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
seqfilter | Filter fasta/fastq(.gz) files by ID and/or sequence length |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SeqFu | A general-purpose program to manipulate and parse information from FASTA/FASTQ files, supporting gzipped input files. Includes functions to interleave and de-interleave FASTQ files, to rename sequences and to count and print statistics on sequence lengths. |
Genobioinfo Cluster: How to use |
SeqKit | A cross-platform and ultrafast toolkit for FASTA/Q file manipulation. Common manipulations of FASTA/Q file include converting, searching, filtering, deduplication, splitting, shuffling, and sampling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Seqtk | Toolkit for processing sequences in FASTA/Q formats |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SequenceTools | Tools for population genetics on sequencing datas |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SGA | SGA is a de novo genome assembler based on the concept of string graphs. The major goal of SGA is to be very memory efficient, which is achieved by using a compressed representation of DNA sequence reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SGSGeneLoss | Gene presence/absence variation discovery. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SHAPEIT | SHAPEIT is a fast and accurate method for estimation of haplotypes (aka phasing) from genotype or sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Shasta | The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
shiver | shiver is a tool for mapping paired-end short reads to a custom reference sequence constructed using do novo assembled contigs, in order to minimise the biased loss of information that occurs from mapping to a reference that differs from the sample. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ShortStack | ShortStack is a tool developed to process and analyze smallRNA-seq data with respect to a reference genome, and output a comprehensive and informative annotation of all discovered small RNA genes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SHRiMP | SHRiMP is a software package for aligning genomic reads against a target genome. It was primarily developed with the multitudinous short reads of next generation sequencing machines in mind, as well as Applied Biosystem's colourspace genomic representation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SibeliaZ | SibeliaZ is a whole-genome alignment and locally-coliinear blocks construction pipeline. The blocks coordinates are output in GFF format and the alignment is in MAF. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SICER2 | Redesigned and improved version of the original ChIP-seq broad peak calling tool SICER. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sickle | Sickle is a tool that uses sliding windows along with quality and length thresholds to determine when quality is sufficiently low to trim the 3'-end of reads and also determines when the quality is sufficiently high enough to trim the 5'-end of reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SIFT4G | Sorting Intolerant From Tolerant For Genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SIFT4G_Annotator | Annotating VCF files using the SIFT4G databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SignalP | SignalP 4.0 server predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms: Gram-positive prokaryotes, Gram-negative prokaryotes, and eukaryotes.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Silix | The software package SiLiX implements an ultra-efficient algorithm for the clustering of homologous sequences, based on single transitive links (single linkage) with alignment coverage constraints. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Simka | Simka is a de novo comparative metagenomics tool. Simka represents each dataset as a k-mer spectrum and compute several classical ecological distances between them. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
simuPOP | simuPOP is a general-purpose individual-based forward-time population genetics simulation environment. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
singlem | SingleM is a tool to find the abundances of discrete operational taxonomic units (OTUs) directly from shotgun metagenome data, without heavy reliance on reference sequence databases. It is able to differentiate closely related species even if those species are from lineages new to science. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Singularity | Singularity enables users to have full control of their environment. Singularity containers can be used to package entire scientific workflows, software and libraries, and even data. |
Genologin Cluster: How to use Genobioinfo Cluster: module load containers/singularity/3.9.9 |
sleuth | sleuth is a program for analysis of RNA-Seq experiments for which transcript abundances have been quantified with kallisto. |
Genologin Cluster: in R-3.5.3 (with module load compiler/gcc-7.2.0) Genobioinfo Cluster: Ask for Install |
SLICEMBLER | SLICEMBLER is a meta-assembler designed for ultra-deep sequencing data/ | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SLiM | SLiM is an evolutionary simulation framework that combines a powerful engine for population genetic simulations with the capability of modeling arbitrarily complex evolutionary scenarios. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SLR | SLR is a program to detect sites in coding DNA that are unusually conserved and/or unusually variable (that is, evolving under purify or positive selection) by analysing the pattern of changes for an alignment of sequences on an evolutionary tree. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SLR-superscaffolder | This is a scaffold assembler designed for stLFR reads. It uses the link-reads information from stLFR reads to assemble contigs to scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SMALT | SMALT aligns DNA sequencing reads with a reference genome. Reads from a wide range of sequencing platforms can be processed, for example Illumina, Roche-454, Ion Torrent, PacBio or ABI-Sanger. Paired reads are supported. There is no support for SOLiD reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
smartdenovo | SMARTdenovo is a de novo assembler for PacBio and Oxford Nanopore (ONT) data. It produces an assembly from all-vs-all raw read alignments without an error correction stage. It also provides tools to generate accurate consensus sequences, though a platform dependent consensus polish tools (e.g. Quiver for PacBio or Nanopolish for ONT) are still required for higher accuracy. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SMC++ | SMC++ is a program for estimating the size history of populations from whole genome sequence data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SMRTLink | SMRT Link is the web-based end-to-end workflow manager for the Sequel™ System. (installed in mode command line on our cluster) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Smudgeplots | Inference of ploidy and heterozygosity structure using whole genome sequencing data. This tool extracts heterozygous kmer pairs from kmer dump files (from jellyfish or KMC) and performs gymnastics with them. We are able to disentangle genome structure by comparing the sum of kmer pair coverages (CovA + CovB) to their relative coverage (CovA / (CovA + CovB)). Smudgeplots are computed from raw/trimmed reads and show the haplotype structure using heterozygous kmer pairs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Snakemake | Snakemake is a workflow management system that aims to reduce the complexity of creating workflows by providing a fast and comfortable execution environment, together with a clean and modern specification language in python style. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
snakePipes | Customizable workflows based on snakemake and python for the analysis of NGS data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SNAP | Gene prediction tool |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
snape-pooled | SNAPE-pooled computes the probability distribution for the frequency of the minor allele in a certain population, at a certain position in the genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Snippy | Rapid haploid variant calling and core genome alignment. Snippy finds SNPs between a haploid reference genome and your NGS sequence reads. It will find both substitutions (snps) and insertions/deletions (indels). It will use as many CPUs as you can give it on a single computer (tested to 64 cores). It is designed with speed in mind, and produces a consistent set of output files in a single folder. It can then take a set of Snippy results using the same reference and generate a core SNP alignment (and ultimately a phylogenomic tree). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
sNMF | A fast and efficient program for estimating individual admixture coefficients based on sparse non-negative matrix factorization and population genetics. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
snoGPS | Search for H/ACA snoRNA genes in a genomic sequence. |
Genobioinfo Cluster: How to use |
SnoReport | Computational identification of snoRNAs with unknown targets. Detecting novel or orphan snoRNAs in RNA sequence data using sequence and structure information only without relying on target information |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Snoscan | Search for C/D box methylation guide snoRNA genes in a genomic sequence. |
Genobioinfo Cluster: How to use |
snpArcher | snpArcher is a reproducible workflow optimized for nonmodel organisms and comparisons across datasets, built on the Snakemake workflow management system, for dataset acquisition, variant calling, quality control, and downstream analysis. |
Genobioinfo Cluster: How to use |
SnpEff | SnpEff is a variant annotation and effect prediction tool. ttttIt annotates and predicts the effects of variants on genes (such as amino acid changes) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SNPGenie | SNPGenie is a collection of Perl scripts for estimating πN/πS, dN/dS, and gene diversity from next-generation sequencing (NGS) single-nucleotide polymorphism (SNP) variant data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SNPhylo | a pipeline to generate a phylogenetic tree from huge SNP data | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SNPsplit | SNPsplit is an allele-specific alignment sorter which is designed to read alignment files in SAM/BAM format and determine the allelic origin of reads that cover known SNP positions. |
Genobioinfo Cluster: How to use |
soap.coverage | Can calculate sequencing coverage or physical coverage as well as duplication rate and details of specific block for each segments and whole genome by using SOAP, Blat, Blast, BlastZ, mummer and MAQ aligement results with multi-thread. Gzip file supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SOAPdenovo | ttSOAPdenovo is a novel short-read assembly method that can build a de novo draft assembly for the human-sized genomes. The program is specially designed to assemble Illumina GA short reads. It creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost effective way.
|
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SOAPdenovo-Trans | SOAPdenovo-Trans is a de novo transcriptome assembler basing on the SOAPdenovo framework, adapt to alternative splicing and different expression level among transcripts. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SortaDate | Scripts that you can use at different stages to attempt to find more clock-like genes. Generally, you would use these for dating analyses with another package | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SortMeRNA | SortMeRNA is a software designed to rapidly filter ribosomal RNA fragments from metatransriptomic data produced by next-generation sequencers. It is capable of handling large RNA databases and sorting out all fragments matching to the database with high accuracy and specificity | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
souporcell | souporcell is a method for clustering mixed-genotype scRNAseq experiments by individual. |
Genobioinfo Cluster: How to use |
sourmash | sourmash is a command-line tool and Python library for computing hash sketches from DNA sequences, comparing them to each other, and plotting the results. | Genologin Cluster: In Python-3.7.4 Genobioinfo Cluster: How to use |
SpaceRanger | Space Ranger is a set of analysis pipelines that process Visium spatial RNA-seq output and brightfield and fluorescence microscope images in order to detect tissue, align reads, generate feature-spot matrices, perform clustering and gene expression analysis, and place spots in spatial context on the slide image. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SPAdes | SPAdes ヨ St. Petersburg genome assembler ヨ is intended for both standard isolates and single-cell MDA bacteria assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Spaln | Spaln (space-efficient spliced alignment) is a stand-alone program that maps and aligns a set of cDNA or protein sequences onto a whole genomic sequence in a single job. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
spaTyper | Computational method for finding spa types. Staphylococcus aureus is a major human pathogen causing skin and tissue infections, pneumonia, septicemia, and device-associated infections. The emergence of strains resistant to methicillin (MRSA) and other antibacterial agents has become a major concern, especially in the hospital environment, because of the high mortality of the infections caused by these strains. Single locus DNA-sequencing of the repeat region of the Staphylococcus protein A gene (spa) can be used for reliable, accurate and discriminatory typing of MRSA. Repeats are assigned a numerical code and the spa-type is deduced from the order of specific repeats. However, spa-typing was hampered in the past by the lack of a consensus on assignments of new spa-repeats and -types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SPECTRE | A collection of Phylogenetics tools for creating and manipulating networks and trees. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SpeedSeq | A flexible framework for rapid genome analysis and interpretation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SpliceGrapher | SpliceGrapher predicts alternative splicing patterns and produces splice graphs that capture in a single structure the ways a gene's exons may be assembled. It enhances gene models using evidence from next-generation sequencing and EST alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SpliceTools | A suite of downstream RNA splicing analysis tools to investigate mechanisms and impact of alternative splicing, Nucleic Acids Research, 2023. |
Genobioinfo Cluster: How to use |
Spoa | A multiple sequence alignment tool/library that implements the POA (partial order alignement) algorithm using SIMD. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sprai | Sprai (single-pass read accuracy improver) is a tool to correct sequencing errors in single-pass reads for de novo assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
squeakr | Squeakr is a k-mer-counting and multiset-representation system using the recently-introduced counting quotient filter (CQF) Pandey et al. (2017), a feature-rich approximate membership query (AMQ) data structure. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
squid | A C library that is bundled with much of the above software. C function library for sequence analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SquiggleKit | A toolkit for manipulating nanopore signal data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SQuIRE | SQuIRE reveals locus-specific regulation of interspersed repeat expression, Nucleic Acids Research | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SRAsembler | SRAssembler (Selective Recursive local Assembler) is a modular pipeline program that can assemble genomic DNA reads into contigs that are homologous to a query DNA or protein sequence. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SRAToolkit | Toolkit to query Short Reads Archive at NCBI |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
srnaMapper | This tool maps reads produced by sRNA-Seq to a genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sslHiC | sslHiC is a computational framework for comparative analyses of Hi-C data, including reproducibility measurement and differential chromatin interaction (DCI) detection. |
Genobioinfo Cluster: How to use |
SSPACE | SSPACE standard is a stand-alone program for scaffolding pre-assembled contigs using NGS paired-read data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SSPACE-LongRead | SSPACE-LongRead is a stand-alone program for scaffolding pre-assembled contigs using long reads (e.g. PacBio RS reads). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Stacks | Stacks is a software suite for analysing RAD Sequencing data by Julian Catchen at the University of Oregon. It will process raw Illumina RAD data or RAD data aligned to a reference genome, and produce genotypes that can be viewed and filtered via a web interface. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
stairway_plot | The stairway plot is a method for inferring detailed population demographic history using the site frequency spectrum (SFS) from DNA sequence data. It does not need a pre-defined population model and can be applied to hundreds of unphased sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
STAR | RNA-seq aligner |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
STAR-Fusion | STAR-Fusion further processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set (using a GTF file, ideally the same annotation file used during the STAR genome index building process during the intial STAR setup). |
Genobioinfo Cluster: How to use |
STITCH | STITCH is an R program for reference panel free, read aware, low coverage sequencing genotype imputation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Strelka | Strelka2 is a fast and accurate small variant caller optimized for analysis of germline variation in small cohorts and somatic variation in tumor/normal sample pairs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
StringTie | StringTie is a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts. It uses a novel network flow algorithm as well as an optional de novo assembly step to assemble and quantitate full-length transcripts representing multiple splice variants for each gene locus. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
StrobeAlign | Aligns short reads using dynamic seed size with strobemers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Structure | The program structure is a free software package for using multi-locus genotype data to investigate population structure. Its uses include inferring the presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. It can be applied to most of the commonly-used genetic markers, including SNPS, microsatellites, RFLPs and AFLPs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Sturgeon | Sturgeon is a CNS neural network classifier for tumour classification |
Genobioinfo Cluster: How to use |
Subread | A tool kit for processing next-gen sequencing data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
subsampler | Small tool to subsample fasta and fastq files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sumaclust | Fast and exact clustering of sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Sumatra | Sumatra was developed by the LECA and aims to compute a great deal of sequence similarities in a fast and exact way, based on the length of the Longest Common Subsequence (LCS) between two sequences. Sequence clustering based on similarities is also available through Sumaclust. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SUPER-FOCUS | A tool for agile functional analysis of metagenomic data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Supernova | Supernova is a software package for de novo assembly from Chromium Linked-Reads that are made from a single whole-genome library from an individual DNA source. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
superstring | Greedy approximation of the shortest common superstring | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SuperTAD | SuperTAD is an open-source command-line TAD detection package written in C++. It takes either raw or normalized Hi-C contact maps as inputs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SUPPA | Fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SURVIVOR | SURVIVOR is a tool set for simulating/evaluating SVs, merging and comparing SVs within and among samples, and includes various methods to reformat or summarize SVs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SvABA | Structural variation and indel detection by local assembly |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SVanalyzer | Tools for the analysis of structural variation in genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVDetect | A tool to detect genomic structural variations from paired-end and mate-pair sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVIM | SVIM is a structural variant caller for long reads. It is able to detect, classify and genotype five different classes of structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SVIM-asm | SVIM-asm (pronounced SWIM-assem) is a structural variant caller for haploid or diploid genome-genome alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
svimmer | Merges similar SVs from multiple single sample VCF files. The tool was written for merging SVs discovered using Manta calls, but should support (almost) any SV VCFs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi | SVJedi is a structural variation (SV) genotyper for long read data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi-graph | SVJedi-graph is a structural variation (SV) genotyper for long read data. |
Genobioinfo Cluster: How to use |
SVMerge | A pipeline to detect structural variants (SVs) by integrating calls from several existing SV callers, which are then validated and the breakpoints refined using local de novo assembly. |
Genobioinfo Cluster: How to use |
svtyper | Bayesian genotyper for structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
swarm | A robust and fast clustering method for amplicon-based studies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SweeD | A parallel and checkpointable tool that implements a composite likelihood ratio test for detecting selective sweeps. SweeD is based on the SweepFinder algorithm (Nielsen et al. 2005). SweeD can calculate the theoretical SFS of a given demographic model (stepwise changes or with an exponential growth phase + stepwise changes) by using the method by Živković and Stephan (2011). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SyRI | SyRI is a comprehensive tool for predicting genomic differences between related genomes using whole-genome assemblies (WGA). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
T-Coffee | T-Coffee is a multiple sequence alignment package. You can use T-Coffee to align sequences or to combine the output of your favorite alignment methods (Clustal, Mafft, Probcons, Muscle...) into one unique alignmen. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
T-lex | T-lex is a computational pipeline that detects presence and/or absence of annotated individual transposable elements (TEs) using next-generation sequencing (NGS) data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
tabix | TAB-delimited file IndeXer. Useful for vcfTools. |
Genologin Cluster: How to use Genobioinfo Cluster: in bcftools and samtools |
TACO | Multi-sample transcriptome assembly from RNA-Seq. |
Genobioinfo Cluster: How to use |
TACT | Adds tips to a backbone phylogeny using taxonomy simulated with birth-death models | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tandem Repeats Finder | Tandem Repeats Finder is a program to locate and display tandem repeats in DNA sequences. A tandem repeat in DNA is two or more adjacent, approximate copies of a pattern of nucleotides. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TAPAS | Tool for Alternative Polyadenylation Site Analysis. |
Genobioinfo Cluster: How to use |
Tapestry | Tapestry is a tool to validate and edit small eukaryotic genome assemblies using long sequence reads. It is designed to help identify complete chromosomes, symbionts, haplotypes, complex features and errors in close-to-complete genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TargetP | TargetP-2.0 server predicts the presence of N-terminal presequences: signal peptide (SP), mitochondrial transit peptide (mTP), chloroplast transit peptide (cTP) or thylakoid luminal transit peptide (lTP). For the sequences predicted to contain an N-terminal presequence a potential cleavage site is also predicted. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TASSEL | Trait Analysis by aSSociation, Evolution and Linkage. TASSEL has multiple functions, including association study, evaluating evolutionary relationships, analysis of linkage disequilibrium, principal component analysis, cluster analysis, missing data imputation and data visualization for large sets of data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TBtools-II | GUI/CommandLine Tool Box for biologistists to utilize NGS data. |
Genobioinfo Cluster: How to use |
TEsorter | It is coded for LTR_retriever to classify long terminal repeat retrotransposons (LTR-RTs) at first. It can also be used to classify any other TE sequences, including Class I and Class II elements which are covered by the REXdb database. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TexLive | TeX Live is intended to be a straightforward way to get up and running with the TeX document production system. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TGICL | This package automates clustering and assembly of a large EST/mRNA dataset. The clustering is performed by a slightly modified version of NCBI's megablast , and the resulting clusters are then assembled using CAP3 assembly program. TGICL starts with a large multi-FASTA file (and an optional peer quality values file) and outputs the assembly files as produced by CAP3. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TGSGapFiller | A gap filling tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tiberius | Tiberius is a deep learning-based ab initio gene structure prediction tool that end-to-end integrates convolutional and long short-term memory layers with a differentiable HMM layer. |
Genobioinfo Cluster: How to use |
Tigmint | Tigmint identifies and corrects misassemblies using linked reads from 10x Genomics Chromium. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TKGWV2 | TKGWV2 is a pipeline to estimate biological relatedness (1st, 2nd, and unrelated degrees) between individuals specifically aimed at ultra-low coverage ancient DNA data obtained from whole genome sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TMHMM | Prediction of transmembrane helices in proteins. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Tn3+TA_finder | Tn3 Transposon/Toxin Finder (Tn3+TA_finder) is a program for the automatic prediction of transposable elements of the Tn3 family associated with type II toxin and antitoxin pairs in bacteria and archaea. |
Genobioinfo Cluster: How to use |
TnComp_finder | Composite Transposon Finder (TnComp_finder) is a program for the prediction of putative composite transposons in bacterial and archaeal genomes based on insertion sequence replicas in a relatively short span. |
Genobioinfo Cluster: How to use |
TOBIAS | TOBIAS is a collection of command-line bioinformatics tools for performing footprinting analysis on ATAC-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TOGA | TOGA is a new method that integrates gene annotation, inferring orthologs and classifying genes as intact or lost. |
Genobioinfo Cluster: How to use |
Tombo | Tombo is a suite of tools primarily for the identification of modified nucleotides from raw nanopore sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TOPALI-v2 | A rich graphical interface for evolutionary analyses of multiple alignments on HPC clusters and multi-core desktops. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tophat | TopHat is a fast splice junction mapper for RNA-Seq reads. It aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
toulbar2 | toulbar2 is an open-source black-box C++ optimizer for cost function networks and discrete additive graphical models. It can read a variety of formats. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TPMCalculator | TPMCalculator quantifies mRNA abundance directly from the alignments by parsing BAM files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Tracy | Tracy is an efficient and versatile command-line application to basecall, align, assemble and deconvolute Sanger Chromatogram trace files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TransDecoder | TransDecoder identifies candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to the genome using Tophat and Cufflinks. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Transposome | Transposome is a command line application to annotate transposable elements from paired-end whole genome shotgun data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
transposon_annotation_tools | A set of bioconda packages for transposon annotation and transposon feature annotation in nucleotide sequences. transposon_annotation_tools is part of TransposonUltimate. The package includes a series of transposable element discovery tools, such as: MUSTv2, HelitronScanner, SineFinder, MiteTracker, MiteFinderII, SineScan, TirVish, LtrHarvest, RepeatModeler, TransposonPSI, and TransposonProteinNCBICDD1000. You can then use these tools independently. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
transposon_classifier_rfsb | Transposon classification tool for nucleotide sequence classification, providing classification, model training and prediction evaluation. RFSB is part of TransposonUltimate. |
Genobioinfo Cluster: How to use |
Transrate | Transrate is software for de-novo transcriptome assembly quality analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TRASH | Tandem Repeat Annotation and Structural Hierarchy: a package to identify and extract tandem repeats in genome sequences and investigate their higher order structures. |
Genobioinfo Cluster: How to use |
TreeBeST | TreeBeST, which stands for (gene) Tree Building guided by Species Tree, is a versatile program that builds, manipulates and displays phylogenetic trees. It is particularly designed for building gene trees with a known species tree and is highly efficient and accurate. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TreeMix | TreeMix is a method for inferring the patterns of population splits and mixtures in the history of a set of populations. In the underlying model, the modern-day populations in a species are related to a common ancestor via a graph of ancestral populations. We use the allele frequencies in the modern populations to infer the structure of this graph. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
treePL | treePL is a phylogenetic penalized likelihood program. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TreeShrink | TreeShrink is an algorithm for detecting abnormally long branches in one or more phylogenetic trees. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TreeTime | Maximum likelihood inference of time stamped phylogenies and ancestral reconstruction. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Trim Galore | A wrapper tool around Cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files, with some extra functionality for MspI-digested RRBS-type (Reduced Representation Bisufite-Seq) libraries. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
trimAl | trimAl: a tool for automated alignment trimmin | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trimmomatic | Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and single ended data.The selection of trimming steps and their associated parameters are supplied on the command line. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
trinityrnaseq | Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules: Inchworm, Chrysalis, and Butterfly, applied sequentially to process large volumes of RNA-seq reads. Trinity partitions the sequence data into many individual de Bruijn graphs, each representing the transcriptional complexity at at a given gene or locus, and then processes each graph independently to extract full-length splicing isoforms and to tease apart transcripts derived from paralogous genes.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trinotate | Trinotate is a comprehensive annotation suite designed for automatic functional annotation of transcriptomes, particularly de novo assembled transcriptomes, from model or non-model organisms. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
tRNAscan-SE | Search for tRNA genes in genomic sequence. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Truvari | Structural variant comparison tool for VCFs |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trycycler | Trycycler is a tool for generating consensus long-read assemblies for bacterial genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TSEBRA | TSEBRA is a combiner tool that selects transcripts from gene predictions based on the support by extrisic evidence in form of introns and start/stop codons. It was developed to combine BRAKER11 and BRAKER22 predicitons to increase their accuracies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Twisst | Topology weighting by iterative sampling of sub-trees. |
Genobioinfo Cluster: How to use |
Umap | The free umap software package efficiently identifies uniquely mappable regions of any genome. Its Bismap extension identifies mappability of the bisulfite converted genome (methylome). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
UMI-tools | Tools for handling Unique Molecular Identifiers in NGS data sets |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Unicycler | Unicycler is an assembly pipeline for bacterial genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Unimap | A EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment. |
Genobioinfo Cluster: How to use |
unique-kmer-counts | This program calculates the number of distinct k-mers for each sequence record in a fasta file and divides it by the total number of k-mers in that record. |
Genobioinfo Cluster: How to use |
USEARCH | USEARCH is a unique sequence analysis tool with thousands of users world-wide. USEARCH offers search and clustering algorithms that are often orders of magnitude faster than BLAST. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
vAMPirus | Automated virus amplicon sequencing analysis program integrated with Nextflow pipeline manager. |
Genobioinfo Cluster: How to use |
VarScan | VarScan is a platform-independent software tool developed at the Genome Institute at Washington University to detect variants in NGS data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
vawk | An awk-like VCF parser |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VCF-kit | Assorted utilities for the variant call format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vcf2maf | Convert a VCF into a MAF, where each variant is annotated to only one of all possible gene isoforms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vcflib | C++ library and cmdline tools for parsing and manipulating VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VCFtools | VCFtools is a program package designed for working with VCF files, such as those generated by the 1000 Genomes Project. The aim of VCFtools is to provide methods for working with VCF files: validating, merging, comparing and calculate some basic population genetic statistics. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Velocyto | Package for the analysis of expression dynamics in single cell RNA seq data. |
Genologin Cluster: In Python3.6.3 and R-3.6.2_gcc-7.2.0 (with module load compiler/gcc-7.2.0 libraries/hdf5-1.10.5) Genobioinfo Cluster: How to use |
Velvet | Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
vg | Variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ViennaRNA | Vienna RNA package allows RNA Secondary Structure Prediction and Comparison | Genologin Cluster: How to use < Genobioinfo Cluster: How to use |
ViewBS | A powerful toolkit for visualization of high-throughput bisulfite sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VinaLC | A parallel molecular docking program based on AutoDock Vina. |
Genobioinfo Cluster: How to use |
ViQuaS | An improved reconstruction pipeline for viral quasispecies spectra generated by next-generation sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VIRify | VIRify is a pipeline for the detection, annotation, and taxonomic classification of viral contigs in metagenomic and metatranscriptomic assemblies. |
Genobioinfo Cluster: How to use |
VirSorter2 | Customizable pipeline to identify viral sequences from (meta)genomic data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VirulenceFinder | VirulenceFinder identifies viruelnce genes in total or partial sequenced isolates of bacteria - at the moment only E. coli, Enterococcus, S. aureus and Listeria are available. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
viu | A small command-line application to view images from the terminal written in Rust. |
Genobioinfo Cluster: How to use |
Vmatch | A versatile software tool for efficiently solving large scale sequence matching tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VSEARCH | Versatile open-source tool for metagenomics |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Vt | A tool set for short variant discovery in genetic sequence data. |
Genobioinfo Cluster: How to use |
Wengan | An accurate and ultra-fast genome assembler | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WFA | The wavefront alignment (WFA) algorithm is an exact gap-affine algorithm that takes advantage of homologous regions between the sequences to accelerate the alignment process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wgd | Python package and CLI for whole genome duplication analyse |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
WGDI | WGDI (Whole-Genome Duplication Integrated analysis), a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidizations and cross-species genome alignments. | Genobioinfo Cluster: How to use |
WGS-Assembler | Celera Assembler is a de novo whole-genome shotgun (WGS) DNA sequence assembler. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Wgsim | Wgsim is a small tool for simulating sequence reads from a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WhatsHap | WhatsHap is a software for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly. It is especially suitable for long reads, but works also well with short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Whippet | Lightweight and Fast RNA-seq quantification at the event-level | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Whokaryote | Classification of metagenomic contigs as eukaryotic/prokaryotic using biology-based features. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WiggleTools | The WiggleTools package allows genomewide data files to be manipulated as numerical functions, equipped with all the standard functional analysis operators (sum, product, product by a scalar, comparators), and derived statistics (mean, median, variance, stddev, t-test, Wilcoxon's rank sum test, etc). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Winnowmap | Winnowmap is a long-read mapping algorithm, and a result of our exploration into superior minimizer sampling techniques. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wLogDate | Molecular Dating using logarithmic penalty function. wLogDate is a method for dating phylogenetic trees. Given a phylogeny and either sampling times for leaves or calibration points for internal nodes, wLogDate outputs a "dated" tree that conforms to the sampling times or calibration points. It can also work with no sampling time or calibration points where it would simply turn the tree into ultrametric, fixing its height to a given value. Its optimization criterion is to minimize the variance of the mutation rates in log scale (hence the term logDate). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WoLFPSort | WoLF PSORT is an extension of the PSORT II program for protein subcellular localization prediction, which is based on the PSORT principle. WoLF PSORT converts a protein's amino acid sequences into numerical localization features; based on sorting signals, amino acid composition and functional motifs. |
Genobioinfo Cluster: How to use |
wtdbg | Wtdbg2 is a de novo sequence assembler for long noisy reads produced by PacBio or Oxford Nanopore Technologies (ONT). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
wu-blast | Similarity search against databanks, Washington University Blast.(OBSOLETE) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
yacrd | Yet Another Chimeric Read Detector for long reads |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Yak | Yak is initially developed for two specific use cases: 1) to robustly estimate the base accuracy of CCS reads and assembly contigs, and 2) to investigate the systematic error rate of CCS reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Yleaf | Software for human Y-chromosomal haplogroup inference from next generation sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
AGAT | Another Gff Analysis Toolkit: suite of tools to handle gene annotations in any GTF/GFF format. Some examples what AGAT can do: standardise any GTF/GFF file into a comprehensive GFF3 format (script with agat_sp prefix): add missing parent features (e.g. gene and mRNA if only CDS/exon exist). add missing features (e.g. exon and UTR). add missing mandatory attributes (i.e. ID, Parent). fix identifier to be uniq. fix feature location. remove duplicated features. group related features (if spread in different places in the file). sort features. merge overlapping loci into one single locus (only if option activated). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AnnotSV | An integrated tool for Structural Variations annotation and ranking. |
Genobioinfo Cluster: How to use |
ApoplastP | ApoplastP is a machine learning method for predicting localization of proteins to the plant apoplast. ApoplastP can distinguish non-apoplastic proteins from apoplastic proteins for both plant proteins and pathogen proteins. In particular, ApoplastP can predict if an effector localizes to the plant apoplast. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ARAGORN | ARAGORN is a program to detect tRNA genes and tmRNA genes in nucleotide sequence | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Augustus | Augustus is a program that predicts genes in eukaryotic genomic sequences |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AvP | AvP performs automatic detection of HGT candidates within a phylogenetic framework. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bakta | Rapid & standardized annotation of bacterial genomes, MAGs & plasmids. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Barrnap | Barrnap predicts the location of ribosomal RNA genes in genomes. It supports bacteria (5S,23S,16S), archaea (5S,5.8S,23S,16S), mitochondria (12S,16S) and eukaryotes (5S,5.8S,28S,18S). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Braker | BRAKER(1,2,3) is a tool for fully automated genome annotation with GeneMark-ET and AUGUSTUS |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAMISIM | CAMISIM is a software to model abundance distributions of microbial communities and to simulate corresponding shotgun metagenome datasets. |
Genobioinfo Cluster: How to use |
CAT | This project aims to provide a straightforward end-to-end pipeline that takes as input a HAL-format multiple whole genome alignment as well as a GFF3 file representing annotations on one high quality assembly in the HAL alignment, and produces a output GFF3 annotation on all target genomes chosen. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CEGMA | CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CENSOR | CENSOR compares and masks protein or nucleotide sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CIRCexplorer2 | CIRCexplorer2 is a comprehensive and integrative circular RNA analysis toolset |
Genobioinfo Cluster: How to use |
Conterminator | Conterminator is an efficient method for detecting incorrectly labeled sequences across kingdoms by an exhaustive all-against-all sequence comparison. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CONTRAST | CONTRAST predicts protein-coding genes from a multiple genomic alignment using a combination of discriminative machine learning techniques. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
dbCAN3 | run_dbcan (dbCAN3) is the standalone version of the dbCAN3 annotation tool for automated CAZyme annotation. |
Genobioinfo Cluster: How to use |
DIAMOND | Accelerated BLAST compatible local sequence aligner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DIAMOND2GO | Diamond2GO is a set of tools that can rapidly assign gene ontology and perform enrichment for functional genomics. | Genobioinfo Cluster: How to use |
DRAM | DRAM (Distilled and Refined Annotation of Metabolism) is a tool for annotating metagenomic assembled genomes and VirSorter identified viral contigs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
E2P2 | Ensemble Enzyme Prediction Pipeline. |
Genobioinfo Cluster: How to use |
EDTA | This package is developed for automated whole-genome de-novo TE annotation and benchmarking the annotation performance of TE libraries. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EffectorP | EffectorP is a machine learning method for fungal effector prediction in secretomes and has been trained to distinguish secreted proteins from secreted effectors in plant-pathogenic fungi. |
Genobioinfo Cluster: How to use |
eggNog-mapper | eggnog-mapper is a tool for fast functional annotation of novel sequences. It uses precomputed orthologous groups and phylogenies from the eggNOG database to transfer functional information from fine-grained orthologs only. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EnTAP | EnTAP is an eukaryotic non-model annotation pipeline developed by Alexander Hart and Dr. Jill Wegrzyn of the Plant Computational Genomics Lab at the University of Connecticut. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EuGeneEP | EuGene is an open integrative gene finder for eukaryotic and prokaryotic genomes. EuGene-EP (Eukaryote Pipeline) facilitates the application of EuGene on eukaryote genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EVidenceModeler (EVM) | The EVidenceModeler (aka EVM) software combines ab intio gene predictions and protein and transcript alignments into weighted consensus gene structures. EVM provides a flexible and intuitive framework for combining diverse evidence types into a single automated gene structure annotation system. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Finder | Finder is a gene annotator pipeline which automates the process of downloading short reads, aligning them and using the assembled transcripts to generate gene annotations. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fixchr | This package selects homologous chromosomes between two genomes by comparing whole-genome alignments between them. Additionally, it generates dotplots for quick checking of the output. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fpma | Fast Plant Mito Annotation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FragGeneScan | FragGeneScan is an application for finding (fragmented) genes in short reads. It can also be applied to predict prokaryotic genes in incomplete assemblies or complete genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FragGeneScanRs | FragGeneScanRs is an application for finding (fragmented) genes in short reads. It can also be applied to predict prokaryotic genes in incomplete assemblies or complete genomes. It is a re-implementation of FragGeneScan in Rust. |
Genobioinfo Cluster: How to use |
FrameDP | Sensitive peptide detection on noisy matured sequences. Available with command line interface on the cluster. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Funannotate | Funannotate is a genome prediction, annotation, and comparison software package. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GAAS | Genome Assembly Annotation Service: Suite of tools related to Genome Assembly Annotation Service tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GALBA | GALBA is a pipeline for fully automated prediction of protein coding gene structures with AUGUSTUS in novel eukaryotic genomes for the scenario where high quality proteins from a closely related species are available. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GeMoMa | Gene Model Mapper (GeMoMa) is a homology-based gene prediction program. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
geneid | geneid is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneMark-ES | Unsupervised training is an important feature of the GeneMark-ES algorithm that identifies protein coding genes in eukaryotic genomes. This is the only eukaryotic gene finder that can perform gene prediction without curated training sets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneMark-ET | a semi-supervised version of GeneMark-ES, called GeneMark-ET that uses RNA-Seq reads to improve training. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneMark-ETP | Gene finding in eukaryotic genomes supported by transcriptome sequencing and protein homology. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GeneMarkS | A self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomeThreader | GenomeThreader is a software tool to compute gene structure predictions. The gene structure predictions are calculated using a similarity-based approach where additional cDNA/EST and/or protein sequences are used to predict gene structures via spliced alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gff3toembl | Converts Prokka GFF3 files to EMBL files for uploading annotated assemblies to EBI | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ggCaller | A de Bruijn graph-based gene-caller and pangenome analysis tool. | Genobioinfo Cluster: How to use |
GlimmerHMM | GlimmerHMM is a new gene finder based on a Generalized Hidden Markov Model (GHMM). Although the gene finder conforms to the overall mathematical framework of a GHMM, additionally it incorporates splice site models adapted from the GeneSplicer program and a decision tree adapted from GlimmerM. It also utilizes Interpolated Markov Models for the coding and noncoding models . Currently, GlimmerHMM's GHMM structure includes introns of each phase, intergenic regions, and four types of exons (initial, internal, final, and single). |
Genobioinfo Cluster: How to use |
Gmove | Gmove is a genome annotation tool. This combiner takes as input mapping of RNA-seq or protein or ab initio data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GraffiTE | GraffiTE is a pipeline that finds polymorphic transposable elements in genome assemblies and/or long reads, and genotypes the discovered polymorphisms in read sets using genome-graphs. |
Genobioinfo Cluster: How to use |
GUSHR | Assembly-free construction of UTRs from short read RNA-Seq data on the basis of coding sequence annotation. This tool has been adapted to the format needs of AUGUSTUS/BRAKER and employs GeMoMa for generating UTRs from RNA-Seq coverage data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HANNO | Efficient High-throughput ANNOtation of protein coding genes in eukaryote genomes. |
Genobioinfo Cluster: How to use |
Helixer | Using Deep Learning to predict gene annotations. |
Genobioinfo Cluster: How to use |
HelixerPost | HelixerPost uses a sliding window assessment to determine regions of the genome which are likely gene containing. |
Genobioinfo Cluster: How to use |
hexamer | Find likely coding segments in DNA using composition-normalised hexamer tables. |
Genobioinfo Cluster: How to use |
ICEscreen | ICEscreen is a bioinformatic pipeline for the detection and annotation of ICEs (Integrative and Conjugative Elements) and IMEs (Integrative and Mobilizable Elements) in Bacillota genomes. |
Genobioinfo Cluster: How to use |
Infernal | Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities. It is an implementation of a special case of profile stochastic context-free grammars called covariance models (CMs). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Integron Finder | Bioinformatics tool to find integrons in bacterial genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iREAD | iREAD (intron REtention Analysis and Detector)is a tool to detect intron retention(IR) events from RNA-seq datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KofamScan | KofamScan is a gene function annotation tool based on KEGG Orthology and hidden Markov model |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Liftoff | Liftoff is a tool that accurately maps annotations in GFF or GTF between assemblies of the same, or closely-related species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LiftOn | Accurate annotation mapping for GFF/GTF across assemblies. |
Genobioinfo Cluster: How to use |
LOCALIZER | LOCALIZER is a machine learning method for predicting the subcellular localization of both plant proteins and pathogen effectors in the plant cell. |
Genobioinfo Cluster: How to use |
Loctree3 | Protein Subcelullar Localization Sequenced-Based Predictor |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Look4TRs | A de-novo tool for detecting simple tandem repeats using self-supervised hidden Markov models. |
Genobioinfo Cluster: How to use |
LTR_FINDER_parallel | A parallel wrapper for LTR_FINDER (LTR_Finder is an efficient program for finding full-length LTR retrotranspsons in genome sequences.) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LTR_retriever | LTR_retriever is a highly accurate and sensitive program for identification of LTR retrotransposons. |
Genobioinfo Cluster: How to use |
MAKER | MAKER is a portable and easily configurable genome annotation pipeline. Its purpose is to allow smaller eukaryotic and prokaryotic genome projects to independently annotate their genomes and to create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab-initio gene predictions and automatically synthesizes these data into gene annotations having evidence-based quality values. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaEuk | MetaEuk is a modular toolkit designed for large-scale gene discovery and annotation in eukaryotic metagenomic contigs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Mfannot | MFannot is a program for the annotation of mitochondrial and plastid genomes. It is a PERL wrapper around a set of diverse, external independent tools. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mgatk | A mitochondrial genome analysis toolkit. |
Genobioinfo Cluster: How to use |
MinCED | MinCED is a program to find Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) in full genomes or environmental datasets such as metagenomes, in which sequence size can be anywhere from 100 to 800 bp. MinCED runs from the command-line and was derived from CRT |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MitoFinder | Mitofinder is a pipeline to assemble mitochondrial genomes and annotate mitochondrial genes from trimmed read sequencing data. MitoFinder is also designed to find and annotate mitochondrial sequences in existing genomic assemblies (generated from Hifi/PacBio/Nanopore/Illumina sequencing data...) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MitoHPC | MitoHPC : Mitochondrial High Performance Caller. For Calling Mitochondrial Homoplasmies and Heteroplasmies. |
Genobioinfo Cluster: How to use |
MitoSeek | MitoSeek is an open-source software tool to reliably and easily extract mitochondrial genome information from exome sequencing data. MitoSeek evaluates mitochondrial genome alignment quality, estimates relative mitochondrial copy numbers, and detects heteroplasmy, somatic mutation, and structural variance of the mitochondrial genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MitoZ | MitoZ is a Python3-based toolkit which aims to automatically filter pair-end raw data (fastq files), assemble genome, search for mitogenome sequences from the genome assembly result, annotate mitogenome (genbank file as result), and mitogenome visualization. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiXCR | MiXCR is a universal software for fast and accurate analysis of raw T- or B- cell receptor repertoire sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MobileElementFinder | MobileElementFinder is a tool for identifying Mobile Genetic Elements (MGEs) in Whole Genome Shotgun sequence data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MSIsensor-pro | MSIsensor-pro is an updated version of msisensor. MSIsensor-pro evaluates Microsatellite Instability (MSI) for cancer patients with next generation sequencing data. |
Genobioinfo Cluster: How to use |
NLR-Annotator | Disease resistance genes encoding nucleotide-binding and leucine-rich repeat (NLR) intracellular immune receptor proteins detect pathogens by the presence of pathogen effectors. Although developed for wheat, we demonstrate the universal applicability of NLR-Annotator across diverse plant taxa. NLR-Annotator is a tool to annotate loci associated with NLRs in large sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NLRtracker | NLRtracker extracts and functionally annotates NLRs from protein or transcript files based on the core features found in the RefPlantNLR dataset. |
Genobioinfo Cluster: How to use |
ORA | Bio::ORA is a featherweight object for identifying mammalian olfactory receptor genes. The sequences should not be longer than 40kb. The returned array include location, sequence and statistic for the putative olfactory receptor gene. Fully functional with DNA and EST sequence, no intron supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ORFfinder | ORFfinder searches for open reading frames (ORFs) in the DNA sequence you enter. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
orfipy | Fast and flexible ORF finder. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
OrthoFinder | OrthoFinder is a fast, accurate and comprehensive analysis tool for comparative genomics. It finds orthologues and orthogroups infers rooted gene trees for all orthogroups and infers a rooted species tree for the species being analysed. OrthoFinder also provides comprehensive statistics for comparative genomic analyses. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PanTools | PanTools is a toolkit for comparative analysis of large number of genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PASApipeline | PASA, acronym for Program to Assemble Spliced Alignments, is a eukaryotic genome annotation tool that exploits spliced alignments of expressed transcript sequences to automatically model gene structures, and to maintain gene structure annotation consistent with the most recently available experimental sequence data. PASA also identifies and classifies all splicing variations supported by the transcript alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PathoFact | PathoFact is an easy-to-use modular pipeline for the metagenomic analyses of toxins, virulence factors and antimicrobial resistance. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PGAP | The NCBI Prokaryotic Genome Annotation Pipeline is designed to annotate bacterial and archaeal genomes (chromosomes and plasmids). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PHASTER_scripts | Small utility scripts to query the PHASTER API endpoint, to identify and annotate prophage sequences within bacterial genomes and plasmids. |
Genobioinfo Cluster: How to use |
PhiSpy | PhiSpy identifies prophages in Bacterial (and probably Archaeal) genomes. Given an annotated genome it will use several approaches to identify the most likely prophage regions. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Phobius | A combined transmembrane topology and signal peptide predictor. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PlasFlow | Software for prediction of plasmid sequences in metagenomic assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PlasmidFinder | The service identifies plasmids in total or partial sequenced isolates of bacteria. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Portcullis | Splice junction analysis and filtering from BAM files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PredGPI | Prediction of glycosylphosphatidylinositol-anchors in proteins based on HMMs and SVMs. |
Genobioinfo Cluster: How to use |
Prodigal | Prodigal (Prokaryotic Dynamic Programming Genefinding Algorithm) is a microbial (bacterial and archaeal) gene finding program developed at Oak Ridge National Laboratory and the University of Tennessee. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PROKKA | Prokka is a software tool for the rapid annotation of prokaryotic genomes. A typical 4 Mbp genome can be fully annotated in less than 10 minutes on a quad-core computer, and scales well to 32 core SMP systems. It produces GFF3, GBK and SQN files that are ready for editing in Sequin and ultimately submitted to Genbank/DDJB/ENA. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PSAURON | PSAURON is a machine learning model for rapid assessment of protein coding gene annotation. |
Genobioinfo Cluster: How to use |
Pseudofinder | Detection of pseudogene candidates in bacterial and archaeal genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RabbitV | RabbitV is a highly optimized and practical toolkit for the detection of viruses and microorganisms in sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RATT | RATT is software to transfer annotation from a reference (annotated) genome to an unannotated query genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REPET | The REPET package (t Flutre et al, 2011 ) integrates bioinformatics programs in order to tackle biological issues at the genomic scale. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RFPlasmid | RFPlasmid predicts plasmid contigs from assemblies using single copy marker genes, plasmid genes, and kmers. |
Genobioinfo Cluster: How to use |
RGAugury | A pipeline consisted of a couple of scripts for genome-wide RGAs prediction, most of single script in this package can work independently or together. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RGI | Software to predict resistomes from protein or nucleotide data, including metagenomics data, based on homology and SNP models. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SIFT4G | Sorting Intolerant From Tolerant For Genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SIFT4G_Annotator | Annotating VCF files using the SIFT4G databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SNAP | Gene prediction tool |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
snoGPS | Search for H/ACA snoRNA genes in a genomic sequence. |
Genobioinfo Cluster: How to use |
Snoscan | Search for C/D box methylation guide snoRNA genes in a genomic sequence. |
Genobioinfo Cluster: How to use |
SOBAcl | The Sequence Ontology Bioinformatics Analysis command line tool (SOBAcl) will generate a variety of tables, graphs and reports from the data in GFF3 files and format the output in a variety of ways. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sturgeon | Sturgeon is a CNS neural network classifier for tumour classification |
Genobioinfo Cluster: How to use |
T-lex | T-lex is a computational pipeline that detects presence and/or absence of annotated individual transposable elements (TEs) using next-generation sequencing (NGS) data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TAMA | Transcriptome Annotation by Modular Algorithms: this software was designed for processing Iso-Seq data and other long read transcriptome data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TargetP | TargetP-2.0 server predicts the presence of N-terminal presequences: signal peptide (SP), mitochondrial transit peptide (mTP), chloroplast transit peptide (cTP) or thylakoid luminal transit peptide (lTP). For the sequences predicted to contain an N-terminal presequence a potential cleavage site is also predicted. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TEnest | TEnest is a tool for finding and annotating transposable element (TE) insertions. |
Genobioinfo Cluster: How to use |
Tiberius | Tiberius is a deep learning-based ab initio gene structure prediction tool that end-to-end integrates convolutional and long short-term memory layers with a differentiable HMM layer. |
Genobioinfo Cluster: How to use |
Tn3+TA_finder | Tn3 Transposon/Toxin Finder (Tn3+TA_finder) is a program for the automatic prediction of transposable elements of the Tn3 family associated with type II toxin and antitoxin pairs in bacteria and archaea. |
Genobioinfo Cluster: How to use |
TnComp_finder | Composite Transposon Finder (TnComp_finder) is a program for the prediction of putative composite transposons in bacterial and archaeal genomes based on insertion sequence replicas in a relatively short span. |
Genobioinfo Cluster: How to use |
TOGA | TOGA is a new method that integrates gene annotation, inferring orthologs and classifying genes as intact or lost. |
Genobioinfo Cluster: How to use |
TransDecoder | TransDecoder identifies candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to the genome using Tophat and Cufflinks. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Transposome | Transposome is a command line application to annotate transposable elements from paired-end whole genome shotgun data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
transposon_annotation_tools | A set of bioconda packages for transposon annotation and transposon feature annotation in nucleotide sequences. transposon_annotation_tools is part of TransposonUltimate. The package includes a series of transposable element discovery tools, such as: MUSTv2, HelitronScanner, SineFinder, MiteTracker, MiteFinderII, SineScan, TirVish, LtrHarvest, RepeatModeler, TransposonPSI, and TransposonProteinNCBICDD1000. You can then use these tools independently. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
transposon_classifier_rfsb | Transposon classification tool for nucleotide sequence classification, providing classification, model training and prediction evaluation. RFSB is part of TransposonUltimate. |
Genobioinfo Cluster: How to use |
TRASH | Tandem Repeat Annotation and Structural Hierarchy: a package to identify and extract tandem repeats in genome sequences and investigate their higher order structures. |
Genobioinfo Cluster: How to use |
Trinotate | Trinotate is a comprehensive annotation suite designed for automatic functional annotation of transcriptomes, particularly de novo assembled transcriptomes, from model or non-model organisms. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
tRNAscan-SE | Search for tRNA genes in genomic sequence. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TSEBRA | TSEBRA is a combiner tool that selects transcripts from gene predictions based on the support by extrisic evidence in form of introns and start/stop codons. It was developed to combine BRAKER11 and BRAKER22 predicitons to increase their accuracies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VARUS | VARUS automates the selection and download of a limited number of RNA-seq reads from at NCBI's Sequence Read Archive (SRA) targeting a sufficiently high coverage for many genes for the purpose of gene-finder training and genome annotation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vcf2maf | Convert a VCF into a MAF, where each variant is annotated to only one of all possible gene isoforms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VIRify | VIRify is a pipeline for the detection, annotation, and taxonomic classification of viral contigs in metagenomic and metatranscriptomic assemblies. |
Genobioinfo Cluster: How to use |
VirulenceFinder | VirulenceFinder identifies viruelnce genes in total or partial sequenced isolates of bacteria - at the moment only E. coli, Enterococcus, S. aureus and Listeria are available. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WoLFPSort | WoLF PSORT is an extension of the PSORT II program for protein subcellular localization prediction, which is based on the PSORT principle. WoLF PSORT converts a protein's amino acid sequences into numerical localization features; based on sorting signals, amino acid composition and functional motifs. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
3D-DNA | 3D de novo assembly (3D-DNA) pipeline. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ABySS | ABySS (Assembly By Short Sequences) is a de novo, parallel, paired-end sequence assembler that is designed for short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ALLHiC | Phasing and scaffolding polyploid genomes based on Hi-C data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AllPaths-LG | ALLPATHS-LG is a whole genome shotgun assembler that can generate high quality genome assemblies using short reads (~100bp) such as those produced by the new generation of sequencers. The significant difference between ALLPATHS and traditional assemblers such as Arachne is that ALLPATHS assemblies are not necessarily linear, but instead are presented in the form of a graph. This graph representation retains ambiguities, such as those arising from polymorphism, uncorrected read errors, and unresolved repeats, thereby providing information that has been absent from previous genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AMOS | A Modular, Open-Source whole genome assembler. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AnchorWave | AnchorWave (Anchored Wavefront Alignment) identifies collinear regions via conserved anchors (full-length CDS and full-length exon have been implemented currently) and breaks collinear regions into shorter fragments, i.e., anchor and inter-anchor intervals. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Aquila | Diploid personal genome assembly and comprehensive variant detection based on linked-reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARBitR | ARBitR is an overlap aware genome assembly scaffolder for linked sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARC | ARC is a pipeline which facilitates iterative, reference guided de novo assemblies with the intent of: - Reducing time in analysis and increasing accuracy of results by only considering those reads which should assemble together. - Reducing/removing reference bias as compared to mapping based approaches. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARCS | Scaffolding genome sequence assemblies using 10X Genomics GemCode/Chromium data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ARKS | Scaffolding genome sequence assemblies using 10X Genomics GemCode/Chromium data. This project is a new kmer-based (alignment free) implementation of ARCS. It provides improved runtime performance over the original ARCS implementation by removing the requirement to perform alignments with bwa mem. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ASGART | ASGART (A Segmental duplications Gathering and Refinement Tool) is a multiplatform (GNU/Linux, macOS, Windows) tool designed to search for large duplications amongst one or two DNA strands. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
assemblathon2 | This repo contains a motley assortment of unpublished scripts and commands used by Ian Korf, Keith Bradnam, and Joe Fass in the analysis of Assemblathon 2 competition entries (assemblies). |
Genobioinfo Cluster: How to use |
assembly-stats | Get assembly statistics from FASTA and FASTQ files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Assemblytics | Assemblytics is a bioinformatics tool to detect and analyze structural variants from a genome assembly by comparing it to a reference genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Assexon | Assembling Exon Using Gene Capture Data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
aTRAM | aTRAM ("automated target restricted assembly method") is an iterative assembler that performs reference-guided local de novo assemblies using a variety of available methods. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AutoHiC | AutoHiC is a deep learning tool that uses Hi-C data to support genome assembly. It can automatically correct errors during genome assembly and generate genomes at the chromosome level. |
Genobioinfo Cluster: How to use |
Bandage-NG | Bandage-NG is a GUI program that allows users to interact with the assembly graphs made by de novo assemblers such as SPAdes, MEGAHIT and others. |
Genobioinfo Cluster: How to use |
BCALM2 | A bioinformatics tool for constructing the compacted de Bruijn graph from sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bifrost | Highly parallel construction and indexing of colored and compacted de Bruijn graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BiSCoT | BiSCoT is a tool that aims to improve the contiguity of scaffolds and contigs generated after a Bionano scaffolding. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bridger | Bridger is an efficient de novo trascriptome assembler for RNA-Seq data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BUSCO | BUSCO v2 provides quantitative measures for the assessment of genome assembly, gene set, and transcriptome completeness, based on evolutionarily-informed expectations of gene content from near-universal single-copy orthologs selected from OrthoDB v9. BUSCO assessments are implemented in open-source software, with a large selection of lineage-specific sets of Benchmarking Universal Single-Copy Orthologs. These conserved orthologs are ideal candidates for large-scale phylogenomics studies, and the annotated BUSCO gene models built during genome assessments provide a comprehensive gene predictor training set for use as part of genome annotation pipelines. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BUSCO_phylogenomics | This is a Python pipeline to construct species phylogenies using BUSCO proteins. It works directly from BUSCO output and can generate concatenated supermatrix alignments and also gene trees of BUSCO families. The pipeline identifies BUSCO proteins that are complete and single-copy in all input samples. Alternatively, you can account for missing data and choose to include BUSCO proteins that are complete and single-copy in a certain percentage of input samples. Each BUSCO family is individually aligned, trimmed, and then concatenated together to generate a supermatrix alignment. The pipeline also identifies BUSCO proteins that are complete and single-copy in at least 4 input samples, and generates gene trees for each of these families. |
Genobioinfo Cluster: How to use |
BWISE | Software for genome assembly using short-reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAMI-AMBER | AMBER is an evaluation package for the comparative assessment of genome reconstructions and taxonomic assignments from metagenome benchmark datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CAMISIM | CAMISIM is a software to model abundance distributions of microbial communities and to simulate corresponding shotgun metagenome datasets. |
Genobioinfo Cluster: How to use |
canu | A single molecule sequence assembler for genomes large and small. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAP3 | A DNA Sequence Assembly Program | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CARNAC-LR | Clustering coefficient-based Acquisition of RNA Communities in Long Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CheckM | Assess the quality of microbial genomes recovered from isolates, single cells, and metagenomes. |
Genobioinfo Cluster: How to use |
chromeister | A dotplot generator for large chromosomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Chromonomer | Chromonomer is a program designed to integrate a genome assembly with a genetic map. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Circlator | A tool to circularize genome assemblies | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CliqueSNV | Scalable Reconstruction of Intra-Host Viral Populations from NGS Reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
COMEBin | COMEBin allows effective binning of metagenomic contigs using COntrastive Multi-viEw representation learning. |
Genobioinfo Cluster: How to use |
compleasm | A genome completeness evaluation tool based on miniprot. |
Genobioinfo Cluster: How to use |
CulebrONT | An open-source, scalable, modular and traceable Snakemake pipeline, able to launch multiple assembly tools in parallel, giving you the possibility of circularise, polish, and correct assemblies, checking quality. CulebrONT can help to choose the best assembly between all possibilities. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DAmar | Long read QC, assembly and scaffolding pipeline for PacBio or Oxford Nanopore long-read sequencing data. T he pipeline produces a number of QC metrics at various stages as well as incorporating further technologies including Bionano, 10x and HiC data to scaffold the created contigs. DAmar, is a hybrid of the earlier Marvel, Dazzler, and Daccord systems of the Eugene Myers lab. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DAZZ_DB | To facilitate the multiple phases of the dazzler assembler, we organize all the read data into what is effectively a "database" of the reads and their meta-information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DBG2OLC | The genome assembler that reduces the computational time of human genome assembly from 400,000 CPU hours to 2,000 CPU hours, utilizing long erroneous 3GS sequencing reads and short accurate NGS sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DENTIST | DENTIST is a sensitive, highly-accurate and automated pipeline method to close gaps in (short read) assemblies with long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Discovar | Assemble genomes and find variants with DISCOVAR & DISCOVAR de novo | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
drap | De novo RNA-seq Assembly Pipeline |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
dRep | dRep is a python program for rapidly comparing large numbers of genomes. dRep can also "de-replicate" a genome set by identifying groups of highly similar genomes and choosing the best representative genome for each genome set. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EUPAN | Toolkit that integrates various software in order to build eukaryotic pangenomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FABuLOUS | A gap-closing software tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. Initially called TGS-GapCloser. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON | Falcon: a set of tools for fast aligning long reads for consensus and assembly |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON_unzip | Making diploid assembly becomes common practice for genomic study | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON-Phase | FALCON-Phase integrates PacBio long-read assemblies with Phase Genomics Hi-C data to create phased, diploid, chromosome-scale scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Fast-Plast | Fast-Plast is a pipeline that leverages existing and novel programs to quickly assemble, orient, and verify whole chloroplast genome sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FLASH | FLASH, Fast Length Adjustment of SHort reads, is a very accurate fast tool to merge paired-end reads from fragments that are shorter than twice the length of reads. The extended length of reads has a significant positive impact on improvement of genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Flye | Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FMLRC2 | FMLRC2 performs error correction/polishing of long erroneous sequences with accurate short reads. As such, it can be used as both an error-correction tool <1> for raw long reads (ex. Oxford Nanopore) and a polishing tool <2> for de novo assemblies. |
Genobioinfo Cluster: How to use |
GAAS | Genome Assembly Annotation Service: Suite of tools related to Genome Assembly Annotation Service tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GapCloser | The GapCloser is designed to close the gaps emerging during the scaffolding process by SOAPdenovo, using the abundant pair relationships of short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GCI | Genome Continuity Inspector (GCI) is an assembly assessment tool for high-quality genomes (e.g. T2T genomes), in base resolution. | Genobioinfo Cluster: How to use |
GenomeScope2.0 | Reference-free profiling of polyploid genomes | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GetOrganelle | This toolkit assemblies organelle genome from genomic skimming data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gfastats | A single fast and exhaustive tool for summary statistics and simultaneous *fa* (fasta, fastq, gfa <.gz>) genome assembly file manipulation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GLIMPSE | GLIMPSE is a phasing and imputation method for large-scale low-coverage sequencing studies. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraphUnzip | Unzip assembly graphs with Hi-C data and/or long reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Hap10 | The goal is to reconstruct accurate and long haplotypes polyploid genome using linked reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HAPHPIPE | NGS viral assembly and population genetics. |
Genobioinfo Cluster: How to use |
HaploHiC | Comprehensive haplotype division of Hi-C PE-reads based on local contacts ratio. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HaploMerger2 | <HaploMerger2 (HM2) is an important upgrade over HaploMerger. HM2 is an easy-to-use automated pipeline for improving genome assembly in the post-assembly stage. It consists of a set of executables as well as wrappers for several third-part software. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Hapo-G | Hapo-G is a tool that aims to improve the quality of genome assemblies by polishing the consensus with accurate reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HASLR | HASLR is a tool for rapid genome assembly of long sequencing reads. HASLR is a hybrid tool which means it requires long reads generated by Third Generation Sequencing technologies (such as PacBio or Oxford Nanopore) together with Next Generation Sequencing reads (such as Illumina) from the same sample. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
hifiasm | Hifiasm is a fast haplotype-resolved de novo assembler for PacBio Hifi reads. Unlike most existing assemblers, hifiasm starts from uncollapsed genome. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hifiasm-meta | De novo metagenome assembler, based on hifiasm, a haplotype-resolved de novo assembler for PacBio Hifi reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IBA | Python script to assemble AHE (Anchored Hybrid Enrichment) data loci by loci. To summarize, raw sequencing reads for each species were filtered for quality using Trim Galore! v. 0.4.0 (Krueger, 2015), and assembled using the iterative baited assembly (IBA) Python script which employs USEARCH v. 7.0 (Edgar, 2010) and Bridger v. 2014-12-01 (Chang et al., 2015) to assemble loci for each taxon. MAFFT v. 7.245 (Katoh and Standley, 2013) was used to align assembled sequences, and the probe and flanking regions were separated with the Python script Extract_probe_region.py (Breinholt et al., 2018). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Inspector | A tool for evaluate long-read de novo assembly results. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IPA | Improved Phased Assembler (IPA) is the official PacBio software for HiFi genome assembly. IPA was designed to utilize the accuracy of PacBio HiFi reads to produce high-quality phased genome assemblies. IPA is an end-to-end solution, starting with input reads and resulting in a polished assembly. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IRMA | IRMA was designed for the robust assembly, variant calling, and phasing of highly variable RNA viruses. Currently IRMA is deployed with modules for influenza and ebolavirus. IRMA is free to use and parallelizes computations for both cluster computing and single computer multi-core setups. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IsoLasso | IsoLasso is an algorithm to assemble transcripts and estimate their expression levels from RNA-Seq reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IVA | Iterative Virus Assembler is a de novo assembler designed to assemble virus genomes that have no repeat sequences, using Illumina read pairs sequenced from mixed populations at extremely high and variable depth | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
JASPER | JASPER (Jellyfish based Assembly Sequence Polisher for Error Reduction) is an efficient polishing tool for draft genomes. It uses accurate reads (PacBio HiFi or Illumina) to evaluate consensus quality and correct consensus errors in genome assemblies. JASPER is substantially faster than polishing methods based on sequence alignment, and more accurate than currently available k-mer based methods. The efficiency and scalability of JASPER allows one to use it to create personalized reference genomes for specific populations very efficiently, even for large sequenced populations, by polishing the reference genome, such as GRCh38 or chm13v2.0 for human, with Illumina reads sequenced from many individuals from the population. Please see this manuscript for more details: Guo A, Salzberg SL, Zimin AV. JASPER: A fast genome polishing tool that improves accuracy of genome assemblies. PLoS Comput Biol. 2023 Mar 31;19(3):e1011032. doi: 10.1371/journal.pcbi.1011032. PMID: 37000853; PMCID: PMC10096238. |
Genobioinfo Cluster: How to use |
JTK | A regional diploid genome assembler. |
Genobioinfo Cluster: How to use |
KAD | KAD is designed for evaluating the accuracy of nucleotide base quality of genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KMC | Genologin Cluster: How to use Genobioinfo Cluster: How to use | |
KmerGenie | KmerGenie estimates the best k-mer length for genome de novo assembly. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LACHESIS | Software that uses Hi-C data for ultra-long-range scaffolding of de novo genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
lamassemble | Merge overlapping "long" DNA reads into a consensus sequence. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Linker | Linker is a suite of C++ tools useful for interpreting long and linked read sequencing of cancer genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LINKS | LINKS is a scalable genomics application for scaffolding or re-scaffolding genome assembly drafts with long reads, such as those produced by Oxford Nanopore Technologies Ltd and Pacific Biosciences. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LJA | La Jolla Assembler (LJA) is a tool for genome assembly from HiFI reads based on de Bruijn graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LongStitch | A genome assembly correction and scaffolding pipeline using long reads. Basically runs Tigmint, ntLink, ARKS. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LR_Gapcloser | LR_Gapcloser is a gap closing tool using uncorrected or corrected long reads generated from Pacbio platform or Nanopore platform. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRScaf | TGS scaffolding . Improving draft genomes using long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRSIM | Simulator for Linked Reads: this package simulates whole genome sequencing using 10X Genomics Linked Read technology. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LTR_retriever | LTR_retriever is a highly accurate and sensitive program for identification of LTR retrotransposons. |
Genobioinfo Cluster: How to use |
Mapsembler2 | Mapsembler2 is a targeted assembly software. It takes as input any number of NGS raw read set(s) (fasta or fastq, gzipped or not) and a set of input sequences (starters). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MARVEL | MARVEL consists of a set of tools that facilitate the overlapping, patching, correction and assembly of noisy (not so noisy ones as well) long reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MaSuRCA | MaSuRCA is whole genome assembly software. It combines the efficiency of the de Bruijn graph and Overlap-Layout-Consensus (OLC) approaches. MaSuRCA can assemble data sets containing only short reads from Illumina sequencing or a mixture of short reads and long reads (Sanger, 454) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MAtCHap | An ultra fast algorithm for solving the single individual haplotype assembly problem. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MECAT | MECAT is an ultra-fast Mapping, Error Correction and de novo Assembly Tools for single molecula sequencing (SMRT) reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MEGAHIT | An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MeGAMerge | A tool to merge assembled contigs, long reads from metagenomic sequencing runs |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Merqury | Evaluate genome assemblies with k-mers and more |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Meryl | A genomic k-mer counter (and sequence utility) with nice features. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaMDBG | A lightweight assembler for long and accurate metagenomics reads. |
Genobioinfo Cluster: How to use |
MetaWRAP | A flexible pipeline for genome-resolved metagenomic data analysis. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MGSE | MGSE can harness the power of files generated in genome sequencing projects to predict the genome size. Required are the FASTA file containing a high continuity assembly and a BAM file with all available reads mapped to this assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Miniasm | Ultrafast de novo assembly for long noisy reads (though having no consensus step). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiniBUSCO | A genome completeness evaluation tool based on miniprot. |
Genobioinfo Cluster: How to use |
Minipolish | A tool for Racon polishing of miniasm assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MIRA | Whole genome shotgun and EST sequence assembler for Sanger, 454, and Solexa / Illumina. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MITGARD | MITGARD (Mitochondrial Genome Assembly from RNA-seq Data) is a computational tool designed to recover the mitochondrial genome from RNA-seq data of any Eukaryote species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MITObim | The MITObim procedure (mitochondrial baiting and iterative mapping) represents a highly efficient approach to assembling novel mitochondrial genomes of non-model organisms directly from total genomic DNA derived NGS reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MitoFinder | Mitofinder is a pipeline to assemble mitochondrial genomes and annotate mitochondrial genes from trimmed read sequencing data. MitoFinder is also designed to find and annotate mitochondrial sequences in existing genomic assemblies (generated from Hifi/PacBio/Nanopore/Illumina sequencing data...) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MitoHiFi | MitoHiFi circularises, cuts and annotates the mitogenome from contigs assembled with PacBio HiFi reads and softwares such as HiCanu or Hifiasm. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mitoVGP | The Vertebrate Genomes Project Mitogenome Assembly Pipeline. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MitoZ | MitoZ is a Python3-based toolkit which aims to automatically filter pair-end raw data (fastq files), assemble genome, search for mitogenome sequences from the genome assembly result, annotate mitogenome (genbank file as result), and mitogenome visualization. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiXCR | MiXCR is a universal software for fast and accurate analysis of raw T- or B- cell receptor repertoire sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MSI | MSI was designed for sequencing reads with higher error rates (e.g., as the ones produced by Nanopore's sequencers) but also works with reads with lower error rates (e.g., Illumina). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MTG-Link | MTG-Link is a novel gap-filling tool for draft genome assemblies, dedicated to linked read data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Nanopolish | A nanopore consensus algorithm using a signal-level hidden Markov model. Signal-level algorithms for MinION data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NaS | NaS is a hybrid approach developed to take advantage of data generated using MinION device. It combines Illumina and Oxford Nanopore technologies to produce NaS (Nanopore Synthetic-long) reads |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NECAT | NECAT is an error correction and de-novo assembly tool for Nanopore long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Newbler | Newbler is a software package for de novo DNA sequence assembly. It is designed specifically for assembling sequence data generated by the 454 GS-series of pyrosequencing platforms sold by 454 Life Science, a Roche diagnostic. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NextDenovo | NextDenovo is a string graph-based de novo assembler for TGS long reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NOVOPlasty | NOVOPlasty is a de novo assembler and variance caller for short circular genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nQuire | A statistical framework for ploidy estimation using NGS short-read data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ntEdit | ntEdit is a fast and scalable genomics application for polishing genome assembly drafts. It simplifies polishing and "haploidization" of gene and genome sequences with its re-usable Bloom filter design. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ntJoin | Scaffolding draft assemblies using reference assemblies and minimizer graphs | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Oases | Oases is a de novo transcriptome assembler designed to produce transcripts from short read sequencing technologies, such as Illumina, SOLiD, or 454 in the absence of any genomic assembly. It was developed by Marcel Schulz (MPI for Molecular Genomics) and Daniel Zerbino (previously at the European Bioinformatics Institute (EMBL-EBI), now at UC Santa Cruz). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ORG.asm | The ORGanelle ASseMbler aims to target the assembling of small sequences over-represented in a whole genome shotgun sequence dataset. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Organelle_PBA | OrganelleRef_PBA is a script to perform a de-novo PacBio assemblies of any organelle (chloroplast or mitochondrial genomes) using several programs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PASApipeline | PASA, acronym for Program to Assemble Spliced Alignments, is a eukaryotic genome annotation tool that exploits spliced alignments of expressed transcript sequences to automatically model gene structures, and to maintain gene structure annotation consistent with the most recently available experimental sequence data. PASA also identifies and classifies all splicing variations supported by the transcript alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pb-assembly | PacBio® tools : everything needed to run Falcon and Unzip | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PECAT | PECAT is a phased error correction and assembly tool for long reads. It includes a haplotype-aware correction method and an efficient diploid assembly method. |
Genobioinfo Cluster: How to use |
Peregrine | Peregrine is a fast genome assembler for accurate long reads (length > 10kb, accuraccy > 99%). It can assemble a human genome from 30x reads within 20 cpu hours from reads to polished consensus. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
phasebook | phasebook is a novel approach for reconstructing the haplotypes of diploid genomes from long reads de novo, that is without the need for a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Pilon | Pilon is an automated genome assembly improvement and variant detection tool. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PlasForest | A random forest classifier of contigs to identify contigs of plasmid origin in contig and scaffold genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Plass | Plass (Protein-Level ASSembler) is a software to assemble short read sequencing data on a protein level. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
plassembler | plassembler is a program that is designed for automated & fast assembly of plasmids in bacterial genomes that have been hybrid sequenced with long read & paired-end short read sequencing. |
Genobioinfo Cluster: How to use |
Platanus | Platanus is a novel de novo sequence assembler that can reconstruct genomic sequences of highly heterozygous diploids from massively parallel shotgun sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Platanus2 | Platanus-allee (Platanus2) is a de novo haplotype assembler (phasing tool), which assembles each haplotype sequence in a diploid genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
plotsr | Tool to plot synteny and structural rearrangements between genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Polypolish | Polypolish is a tool for polishing genome assemblies with short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Purge_Dups | purge_dups is designed to remove haplotigs and contig overlaps in a de novo assembly based on read depth. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Purge_Haplotigs | Pipeline to help with curating heterozygous diploid genome assemblies (for instance when assembling using FALCON or FALCON-unzip). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pypolca | pypolca is a Standalone Python re-implementation of the POLCA polisher from the MaSuRCA genome assembly and analysis toolkit. |
Genobioinfo Cluster: How to use |
quarTeT | quarTeT is a collection of tools for T2T genome assembly and basic analysis in automatic workflow. |
Genobioinfo Cluster: How to use |
QUAST | QUAST evaluates genome assemblies by computing various metrics |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
quickmerge | A simple and fast metassembler and assembly gap filler designed for long molecule based assemblies | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ra | Ra is as a fast and easy to use assembler for raw reads generated by third generation sequencing. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Rabaler | Rebaler is a program for conducting reference-based assemblies using long reads. It relies mainly on minimap2 for alignment and Racon for making consensus sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Racon | Consensus module for raw de novo DNA assembly of long uncorrected reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAFT | RAFT (Repeat Aware Fragmentation Tool) is an algorithm designed to improve assembly quality by rescuing contained reads. RAFT breaks long reads into smaller sub-reads by following an algorithm described in our preprint. The read fragmentation allows an OLC assembler to retain contained reads during string graph construction. When input reads have non-uniform lengths, retaining contained reads improves assembly contiguity and base-level accuracy. The inputs to RAFT include an error-corrected read file in FASTA/FASTQ format and an all-vs-all alignment file in PAF format. It performs read fragmentation and outputs the fragmented reads in FASTA format. We recommend users to use hifiasm for the initial steps (read error correction, all-vs-all overlap computation) and also for the final step (assembly of fragmented reads). The assembly output format of hifiasm is described here. The RAFT-hifiasm workflow is recommended for long accurate reads with non-uniform length distribution (e.g., ONT Duplex, or a mixture of ONT Duplex and HiFi reads). ONT UL reads can optionally be integrated during the final assembly step. |
Genobioinfo Cluster: How to use |
RaGOO | A tool to order and orient genome assembly contigs via Minimap2 alignments to a reference genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ragout | Ragout (Reference-Assisted Genome Ordering UTility) is a tool for chromosome assembly using multiple references. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RagTag | RagTag, the successor to RaGOO, is a command line tool for reference-guided genome assembly improvement. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAMPART | RAMPART is a configurable pipeline for de novo assembly of DNA sequence data. RAMPART is not a de novo assembler. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ranbow | Ranbow is a haplotype assembler for polyploid genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ratatosk | Ratatosk is a phased error correction tool for erroneous long reads based on compacted and colored de Bruijn graphs built from accurate short reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Raven | Raven is a de novo genome assembler for long uncorrected reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ray | Assemble genomes in parallel using the message-passing interface | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REAPR | REAPR is a tool that evaluates the accuracy of a genome assembly using mapped paired end reads, without the use of a reference genome for comparison. It can be used in any stage of an assembly pipeline to automatically break incorrect scaffolds and flag other errors in an assembly for manual inspection. It reports mis-assemblies and other warnings, and produces a new broken assembly based on the error calls. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Redundans | Redundans is a pipeline that assists an assembly of heterozygous/polymorphic genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REFMAKER | REFMAKER is a command-line and user-friendly pipeline providing different tools to create nuclear references from genomic assemblies of shotgun libraries. |
Genobioinfo Cluster: How to use |
REPdenovo | REPdenovo is designed for constructing repeats directly from sequence (paired-end) reads. It based on the idea of frequent k-mer assembly. REPdenovo provides many functionalities, and can generate much longer repeats than existing tools. Internally, REPdenovo uses Jellyfish for k-mer counting, Velvet for assembly, and bwa to map reads on the Transposable Elements. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
riboSeed | Pipeline for using ribosomal flanking regions to improve bacterial genome assembly. |
Genobioinfo Cluster: How to use |
runBNG | An easy way to run BioNano genomic analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
rust-mdbg | rust-mdbg is an ultra-fast minimizer-space de Bruijn graph (mdBG) implementation, geared towards the assembly of long and accurate reads such as PacBio HiFi. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SALSA | A tool to scaffold long read assemblies with Hi-C data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Scaff10X | Pipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Scallop | Scallop is an accurate reference-based transcript assembler. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SDA | Segmental Duplication Assembler |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SECAPR | Used to process targeted sequencing (or Gene capture) data by applying assembly and subsequent mapping algorithms (reducing paralogs unlike Hybpiper). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SGA | SGA is a de novo genome assembler based on the concept of string graphs. The major goal of SGA is to be very memory efficient, which is achieved by using a compressed representation of DNA sequence reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Shasta | The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
shiver | shiver is a tool for mapping paired-end short reads to a custom reference sequence constructed using do novo assembled contigs, in order to minimise the biased loss of information that occurs from mapping to a reference that differs from the sample. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SLICEMBLER | SLICEMBLER is a meta-assembler designed for ultra-deep sequencing data/ | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SLR-superscaffolder | This is a scaffold assembler designed for stLFR reads. It uses the link-reads information from stLFR reads to assemble contigs to scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
smartdenovo | SMARTdenovo is a de novo assembler for PacBio and Oxford Nanopore (ONT) data. It produces an assembly from all-vs-all raw read alignments without an error correction stage. It also provides tools to generate accurate consensus sequences, though a platform dependent consensus polish tools (e.g. Quiver for PacBio or Nanopolish for ONT) are still required for higher accuracy. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SMRTLink | SMRT Link is the web-based end-to-end workflow manager for the Sequel™ System. (installed in mode command line on our cluster) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
soap.coverage | Can calculate sequencing coverage or physical coverage as well as duplication rate and details of specific block for each segments and whole genome by using SOAP, Blat, Blast, BlastZ, mummer and MAQ aligement results with multi-thread. Gzip file supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SOAPdenovo | ttSOAPdenovo is a novel short-read assembly method that can build a de novo draft assembly for the human-sized genomes. The program is specially designed to assemble Illumina GA short reads. It creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost effective way.
|
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SOAPdenovo-Trans | SOAPdenovo-Trans is a de novo transcriptome assembler basing on the SOAPdenovo framework, adapt to alternative splicing and different expression level among transcripts. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SPAdes | SPAdes ヨ St. Petersburg genome assembler ヨ is intended for both standard isolates and single-cell MDA bacteria assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SRAsembler | SRAssembler (Selective Recursive local Assembler) is a modular pipeline program that can assemble genomic DNA reads into contigs that are homologous to a query DNA or protein sequence. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SSPACE | SSPACE standard is a stand-alone program for scaffolding pre-assembled contigs using NGS paired-read data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SSPACE-LongRead | SSPACE-LongRead is a stand-alone program for scaffolding pre-assembled contigs using long reads (e.g. PacBio RS reads). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
StringTie | StringTie is a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts. It uses a novel network flow algorithm as well as an optional de novo assembly step to assemble and quantitate full-length transcripts representing multiple splice variants for each gene locus. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Supernova | Supernova is a software package for de novo assembly from Chromium Linked-Reads that are made from a single whole-genome library from an individual DNA source. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SuperTAD | SuperTAD is an open-source command-line TAD detection package written in C++. It takes either raw or normalized Hi-C contact maps as inputs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TACO | Multi-sample transcriptome assembly from RNA-Seq. |
Genobioinfo Cluster: How to use |
Tapestry | Tapestry is a tool to validate and edit small eukaryotic genome assemblies using long sequence reads. It is designed to help identify complete chromosomes, symbionts, haplotypes, complex features and errors in close-to-complete genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TGICL | This package automates clustering and assembly of a large EST/mRNA dataset. The clustering is performed by a slightly modified version of NCBI's megablast , and the resulting clusters are then assembled using CAP3 assembly program. TGICL starts with a large multi-FASTA file (and an optional peer quality values file) and outputs the assembly files as produced by CAP3. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TGSGapFiller | A gap filling tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tigmint | Tigmint identifies and corrects misassemblies using linked reads from 10x Genomics Chromium. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Transrate | Transrate is software for de-novo transcriptome assembly quality analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
trinityrnaseq | Trinity, developed at the Broad Institute and the Hebrew University of Jerusalem, represents a novel method for the efficient and robust de novo reconstruction of transcriptomes from RNA-seq data. Trinity combines three independent software modules: Inchworm, Chrysalis, and Butterfly, applied sequentially to process large volumes of RNA-seq reads. Trinity partitions the sequence data into many individual de Bruijn graphs, each representing the transcriptional complexity at at a given gene or locus, and then processes each graph independently to extract full-length splicing isoforms and to tease apart transcripts derived from paralogous genes.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trycycler | Trycycler is a tool for generating consensus long-read assemblies for bacterial genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Unicycler | Unicycler is an assembly pipeline for bacterial genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Unimap | A EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment. |
Genobioinfo Cluster: How to use |
Velvet | Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Verkko | Verkko is a hybrid genome assembly pipeline developed for telomere-to-telomere assembly of PacBio HiFi and Oxford Nanopore reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ViennaRNA | Vienna RNA package allows RNA Secondary Structure Prediction and Comparison | Genologin Cluster: How to use < Genobioinfo Cluster: How to use |
ViQuaS | An improved reconstruction pipeline for viral quasispecies spectra generated by next-generation sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Wengan | An accurate and ultra-fast genome assembler | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wgd | Python package and CLI for whole genome duplication analyse |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
WGS-Assembler | Celera Assembler is a de novo whole-genome shotgun (WGS) DNA sequence assembler. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wtdbg | Wtdbg2 is a de novo sequence assembler for long noisy reads produced by PacBio or Oxford Nanopore Technologies (ONT). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
YaHS | YaHS is a scaffolding tool using Hi-C data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
CellRanger ARC | Cell Ranger ARC's pipelines analyze sequencing data produced from Chromium Single Cell Multiome ATAC + Gene Expression. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellRanger ATAC | Cell Ranger ATAC is a set of analysis pipelines that process Chromium Single Cell ATAC data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
diffTF | Quantification of Differential Transcription Factor Activity and Multiomics-Based Classification into Activators and Repressor. |
Genobioinfo Cluster: How to use |
NucleoATAC | Python package for calling nucleosomes using ATAC-seq data. Also includes general scripts for working with paired-end ATAC-seq data (or potentially other paired-end data). |
Genobioinfo Cluster: How to use |
ROSE | To create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TOBIAS | TOBIAS is a collection of command-line bioinformatics tools for performing footprinting analysis on ATAC-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
BamBam | several simple-to-use tools to facilitate NGS analysis | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BISCUIT | BISulfite-seq CUI Toolkit (BISCUIT) is a utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data. |
Genobioinfo Cluster: How to use |
Bismark | A tool to map bisulfite converted sequence reads and determine cytosine methylation states |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BisSNP | Accurate combined SNP/Methylation calling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BS-Seeker2- | BS-Seeker2 is a seamless and versatile pipeline for accurately and fast mapping the bisulfite-treated reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BS-SNPer | BS-SNPer is an ultrafast and memory-efficient package, a program for BS-Seq variation detection from alignments in standard BAM/SAM format using approximate Bayesian modeling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BSMAP | BSMAP is a short reads mapping software for bisulfite sequencing reads. Bisulfite treatment converts unmethylated Cytosines into Uracils (sequenced as Thymine) and leave methylated Cytosines unchanged, hence provides a way to study DNA cytosine methylation at single nucleotide resolution. BSMAP aligns the Ts in the reads to both Cs and Ts in the reference | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bwa-meth | Fast and accurate alignment of BS-Seq reads using bwa-mem and a 3-letter genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MethylScore | Identification of differentially methylated regions between multiple epigenomes from BS-treated read mappings via methylated region calling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nf-core workflows | This module provide access to workflows nf-core, there are automatically downloaded into your home. More info at nf-core/config page.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
rastair | Fast processing of TET-assisted pyridine borane sequencing (TAPS)-based sequencing data. |
Genobioinfo Cluster: How to use |
segemehl | segemehl is a software to map short sequencer reads to reference genomes. segemehl implements a matching strategy based on enhanced suffix arrays (ESA). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Trim Galore | A wrapper tool around Cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files, with some extra functionality for MspI-digested RRBS-type (Reduced Representation Bisufite-Seq) libraries. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ViewBS | A powerful toolkit for visualization of high-throughput bisulfite sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
Alfred | BAM Statistics, Feature Counting and Annotation | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AlleleSeq | pipeline which constructs a diploid personal genome from genomic sequence variants of a family trio, including SNPs, indels and structural variants and maps functional genomic data onto this personal genome. | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
BroadPeak | BroadPeak broad peak calling algorithm for diffuse ChIP-seq datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
csem | CSEM is a ChIP-Seq multi-read allocator. CSEM stands for ChIP-Seq multi-read allocation using Expectation-Maximization. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DANPOS2 | A toolkit for Dynamic Analysis of Nucleosome and Protein Occupancy by Sequencing, version 2 | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
diffTF | Quantification of Differential Transcription Factor Activity and Multiomics-Based Classification into Activators and Repressor. |
Genobioinfo Cluster: How to use |
GEM | GEM is a scientific software for studying protein-DNA interaction at high resolution using ChIP-seq/ChIP-exo data. It can also be applied to CLIP-seq and Branch-seq data. |
Genobioinfo Cluster: How to use |
IDR | The IDR (Irreproducible Discovery Rate) framework is a unified approach to measure the reproducibility of findings identified from replicate experiments and provide highly stable thresholds based on reproducibility. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MACS | We present Model-based Analysis of ChIP-Seq (MACS) on short reads sequencers such as Genome Analyzer (Illumina / Solexa). MACS empirically models the length of the sequenced ChIP fragments, which tends to be shorter than sonication or library construction size estimates, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome sequence, allowing for more sensitive and robust prediction. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, is publicly available open source, and can be used for ChIP-Seq with or without control samples. | Genologin Cluster: in Python-2.7.2 Genobioinfo Cluster: How to use |
MACS2 | Model-based Analysis of ChIP-Seq |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MAGIC | A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MuMRescueLite | MuMRescueLite is the software that enable to use the tag sequencies of mapped to multiple loci to the genome, for the expression analysis. At the mapping of short sequence tags of CAGE or ChIP-Seq to the genome, sequence tags that map to multiple genomic loci (multi-mapping tags or MuMs), are routinely omitted from further analysis, leading to experimental bias and reduced coverage. MuMRescueLite probabilistically reincorporates multi-mapping tags into mapped short read data with acceptable computational requirements. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ROSE | To create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RSEG | The RSEG software package is aimed to analyze ChIP-Seq data, especially for identifying genomic regions and their boundaries marked by diffusive histone modification markers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SICER2 | Redesigned and improved version of the original ChIP-seq broad peak calling tool SICER. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
Back_to_sequences | Given a set of kmers (fasta / fastq <.gz> format) and a set of sequences (fasta / fastq <.gz> format), this tool will extract the sequences containing some of those kmers. |
Genobioinfo Cluster: How to use |
BOLDigger2 | An even better Python program to query .fasta files against the COI database of www.boldsystems.org |
Genobioinfo Cluster: How to use |
DIAMOND2GO | Diamond2GO is a set of tools that can rapidly assign gene ontology and perform enrichment for functional genomics. | Genobioinfo Cluster: How to use |
ganon | ganon classifies DNA sequences against large sets of genomic reference sequences efficiently. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gget | gget enables efficient querying of genomic reference databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Infernal | Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities. It is an implementation of a special case of profile stochastic context-free grammars called covariance models (CMs). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_datasets | NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases. You can use it to find and download sequence, annotation, and metadata for genes and genomes using our command-line interface (CLI) tools or NCBI Datasets web interface. |
Genobioinfo Cluster: How to use |
OMA | The OMA (Orthologous MAtrix) database is a well-established resource for identifying orthologs among publicly available complete genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OrthoLoger | Standalone pipeline for delineation of orthologs. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SIFT4G_Annotator | Annotating VCF files using the SIFT4G databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
BisSNP | Accurate combined SNP/Methylation calling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BS-SNPer | BS-SNPer is an ultrafast and memory-efficient package, a program for BS-Seq variation detection from alignments in standard BAM/SAM format using approximate Bayesian modeling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cgmaptools | Command-line Toolset for Bisulfite Sequencing Data Analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CHEUI | CHEUI (Methylation (CH3) Estimation Using Ionic current) is an RNA modification detection software for Oxford Nanopore direct RNA sequencing data. CHEUI can be used to detect m6A and m5C in individual reads at single-nucleotide resolution from any sample (e.g. single condition), or detect differential m6A or m5C between any two conditions. CHEUI uses a two-stage deep learning method to detect m6A and m5C transcriptome-wide at single-read and single-site resolution in any sequence context (i.e. without any sequence constrains). |
Genobioinfo Cluster: How to use |
DeepSignal | Detecting methylation using signal-level features from Nanopore sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mCaller | This program is designed to call m6A from nanopore data using the differences between measured and expected currents. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MethyLasso | A segmentation approach to analyze DNA methylation patterns and identify differentially methylation regions from whole-genome datasets. |
Genobioinfo Cluster: How to use |
MethylDackel | MethylDackel (formerly named PileOMeth, which was a temporary name derived due to it using a PILEup to extract METHylation metrics) will process a coordinate-sorted and indexed BAM or CRAM file containing some form of BS-seq alignments and extract per-base methylation metrics from them. MethylDackel requires an indexed fasta file containing the reference genome as well. |
Genobioinfo Cluster: How to use |
MethylScore | Identification of differentially methylated regions between multiple epigenomes from BS-treated read mappings via methylated region calling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
modbam2bed | A program to aggregate modified base counts stored in a modified-base BAM file to a bedMethyl file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nf-core workflows | This module provide access to workflows nf-core, there are automatically downloaded into your home. More info at nf-core/config page.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
rastair | Fast processing of TET-assisted pyridine borane sequencing (TAPS)-based sequencing data. |
Genobioinfo Cluster: How to use |
S3V2_IDEAS_ESMP | A package for normalizing, denoising and integrating epigenomic datasets across different cell types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tombo | Tombo is a suite of tools primarily for the identification of modified nucleotides from raw nanopore sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
ACFS | Accurate CircRNA Finder Suite. Discovering circRNAs from RNA-Seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Alfred | BAM Statistics, Feature Counting and Annotation | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AlleleSeq | pipeline which constructs a diploid personal genome from genomic sequence variants of a family trio, including SNPs, indels and structural variants and maps functional genomic data onto this personal genome. | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
Arriba | Arriba is a command-line tool for the detection of gene fusions from RNA-Seq data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BlockClust | BlockClust is an efficient approach to detect transcripts with similar processing patterns. We propose a novel way to encode expression profiles in compact discrete structures, which can then be processed using fast graph-kernel techniques. BlockClust allows both clustering and classification of small non-coding RNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bustools | bustools is a program for manipulating BUS files for single cell RNA-Seq datasets. It can be used to error correct barcodes, collapse UMIs, produce gene count or transcript compatibility count matrices, and is useful for many other tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
canu | A single molecule sequence assembler for genomes large and small. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellBender | CellBender is a software package for eliminating technical artifacts from high-throughput single-cell RNA sequencing (scRNA-seq) data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellRanger | Cell Ranger is a set of analysis pipelines that processes Chromium single cell 3’ RNA-seq output to align reads, generate gene-cell matrices and perform clustering and gene expression analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CellRanger ARC | Cell Ranger ARC's pipelines analyze sequencing data produced from Chromium Single Cell Multiome ATAC + Gene Expression. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
chimerascan | chimerascan is a software package that detects gene fusions in paired-end RNA sequencing (RNA-Seq) datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ChimPipe | ChimPipe is a computational method for the detection of novel transcription-induced chimeric transcripts and fusion genes from Illumina Paired-End RNA-seq data. It combines junction spanning and paired-end read information to accurately detect chimeric splice junctions at base-pair resolution. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ChopStitch | Exon annotation and splice graph construction using transcriptome assembly and whole genome sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
circfull | A tool to detect and quantify full-length circRNA isoforms from circFL-seq. |
Genobioinfo Cluster: How to use |
circtools | A modular, python-based framework for circRNA-related tools that unifies several functionalities in a single, command line driven software. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CIRI | CIRI (circRNA identifier) is a novel chiastic clipping signal based algorithm, which can unbiasedly and accurately detect circRNAs from transcriptome data by employing multiple filtration strategies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CleaveLand4 | Analysis of degradome data to find sliced miRNA and siRNA targets | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ContextMap2 | Fast and accurate context-based RNA-seq mapping. ContextMap determines the most likely origin of a read by evaluating the context of the read in the form of alignments of other reads to the same genomic region. In the original implementation, the focus was on improving initial mappings provided by other mapping tools. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CroCo | A program to detect potential cross contaminations in HTS assembled transcriptomes using expression level quantification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cufflinks | Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples. It accepts aligned RNA-Seq reads and assembles the alignments into a parsimonious set of transcripts. Cufflinks then estimates the relative abundances of these transcripts based on how many reads support each one, taking into account biases in library preparation protocols. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
devCellPy | devCellPy is a Python package designed for hierarchical multilayered classification of cells based on single-cell RNA-sequencing (scRNA-seq). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
diffTF | Quantification of Differential Transcription Factor Activity and Multiomics-Based Classification into Activators and Repressor. |
Genobioinfo Cluster: How to use |
drap | De novo RNA-seq Assembly Pipeline |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ErmineJ | ErmineJ performs analyses of gene sets in high-throughput genomics data such as gene expression profiling studies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ERVmap | ERVmap is one part curated database of human proviral ERV loci and one part a stringent algorithm to determine which ERVs are transcribed in their RNA seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
eXpress | eXpress is a streaming tool for quantifying the abundances of a set of target sequences from sampled subsequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FEELnc | FlExible Extraction of LncRNA. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FLAIR | FLAIR (Full-Length Alternative Isoform analysis of RNA) for the correction, isoform definition, and alternative splicing analysis of noisy reads. FLAIR has primarily been used for nanopore cDNA, native RNA, and PacBio sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GUSHR | Assembly-free construction of UTRs from short read RNA-Seq data on the basis of coding sequence annotation. This tool has been adapted to the format needs of AUGUSTUS/BRAKER and employs GeMoMa for generating UTRs from RNA-Seq coverage data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IRFinder | Detecting intron retention from RNA-Seq experiments |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IsoLasso | IsoLasso is an algorithm to assemble transcripts and estimate their expression levels from RNA-Seq reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
kallisto | kallisto is a program for quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LIQA | Long-read Isoform Quantification and Analysis) is an Expectation-Maximization based statistical method to quantify isoform expression and detect differential alternative splicing (DAS) events using long-read RNA-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mmannot | mmannot annotates reads, or quantifies the features. For instance, suppose that you have sequenced your organism of interest with sRNA-Seq (RNA-Seq works too), and you want to know how many times you have sequenced miRNAs, rRNAs, tRNAs, etc. This is what mmannot does. A huge proportion of the reads may actually map at several locations. These multi-mapping reads are usually handled poorly by similar quantification tools. In our methods, when a read maps at several locations, all these locations are inspected: If all these locations belong to the same feature (e.g. miRNAs, in case of a duplicated gene family), the read is still annotated as a miRNA. If the location belong to different features (e.g. 3'UTR and miRNA), the read is ambiguous, and is flagged as 3'UTR--miRNA. In case 1, we say when have rescued a read. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mmquant | This tool counts the number of reads (produced by RNA-Seq) per gene, much like HTSeq-count and featureCounts. The main difference with other tools is that multi-mapping reads are counted differently: if a read is mapped to gene A, gene B, and gene C, the tool will create a new feature, "geneA--geneB--geneC", that will be counted once. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MuMRescueLite | MuMRescueLite is the software that enable to use the tag sequencies of mapped to multiple loci to the genome, for the expression analysis. At the mapping of short sequence tags of CAGE or ChIP-Seq to the genome, sequence tags that map to multiple genomic loci (multi-mapping tags or MuMs), are routinely omitted from further analysis, leading to experimental bias and reduced coverage. MuMRescueLite probabilistically reincorporates multi-mapping tags into mapped short read data with acceptable computational requirements. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoCount | NanoCount estimates transcripts abundance from Oxford Nanopore direct-RNA sequencing datasets, using an expectation-maximization approach like RSEM, Kallisto, salmon, etc to handle the uncertainty of multi-mapping reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
nf-core workflows | This module provide access to workflows nf-core, there are automatically downloaded into your home. More info at nf-core/config page.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pizzly | A program for detecting gene fusions from RNA-Seq data of cancer samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Portcullis | Splice junction analysis and filtering from BAM files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PSGInfer | PSGInfer is a software package for the analysis of RNA-Seq data with probabilistic splice graph (PSG) models of gene alternative processing (splicing, transcription initiation, and polyadenylation) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PSI-Sigma | Percent Spliced-In (PSI) values are commonly used to report alternative pre-mRNA splicing (AS) changes. However, previous PSI-detection methods are limited to specific types of AS events. PSI-Sigma is using a new splicing index (PSIΣ) that is more flexible, can incoporate novel junctions, and can compute PSI values of individual exons in complex splicing events. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REDItools | REDItools are python scripts developed with the aim to study RNA editing at genomic scale by next generation sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REINDEER | Efficient indexing of k-mer presence and abundance in sequencing datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
rMATS | MATS is a computational tool to detect differential alternative splicing events from RNA-Seq data. From the RNA-Seq data, MATS can automatically detect and analyze alternative splicing events corresponding to all major types of alternative splicing patterns. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
rMATS turbo | rMATS turbo is the C/Cython version of rMATS (refer to http://rnaseq-mats.sourceforge.net) : Multivariate Analysis of Transcript Splicing (MATS). The major difference between rMATS turbo and rMATS is speed and space usage. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RSEM | RSEM (RNA-Seq by Expectation-Maximization) is a software package for estimating gene and isoform expression levels from RNA-Seq data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RSeQC | RSeQC package provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Salmon | Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using lightweight alignments |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SAMap | Mapping single-cell RNA sequencing datasets from evolutionarily distant organisms. |
Genobioinfo Cluster: How to use |
SARTools | SARTools is a R package dedicated to the differential analysis of RNA-seq data. It provides tools to generate descriptive and diagnostic graphs, to run the differential analysis with one of the well known DESeq2 or edgeR packages and to export the results into easily readable tab-delimited files. |
Genologin Cluster: In R-3.4.3 Genobioinfo Cluster: Ask for Install |
sleuth | sleuth is a program for analysis of RNA-Seq experiments for which transcript abundances have been quantified with kallisto. |
Genologin Cluster: in R-3.5.3 (with module load compiler/gcc-7.2.0) Genobioinfo Cluster: Ask for Install |
SOAPdenovo-Trans | SOAPdenovo-Trans is a de novo transcriptome assembler basing on the SOAPdenovo framework, adapt to alternative splicing and different expression level among transcripts. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SortMeRNA | SortMeRNA is a software designed to rapidly filter ribosomal RNA fragments from metatransriptomic data produced by next-generation sequencers. It is capable of handling large RNA databases and sorting out all fragments matching to the database with high accuracy and specificity | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
souporcell | souporcell is a method for clustering mixed-genotype scRNAseq experiments by individual. |
Genobioinfo Cluster: How to use |
SpaceRanger | Space Ranger is a set of analysis pipelines that process Visium spatial RNA-seq output and brightfield and fluorescence microscope images in order to detect tissue, align reads, generate feature-spot matrices, perform clustering and gene expression analysis, and place spots in spatial context on the slide image. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SpliceGrapher | SpliceGrapher predicts alternative splicing patterns and produces splice graphs that capture in a single structure the ways a gene's exons may be assembled. It enhances gene models using evidence from next-generation sequencing and EST alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SpliceTools | A suite of downstream RNA splicing analysis tools to investigate mechanisms and impact of alternative splicing, Nucleic Acids Research, 2023. |
Genobioinfo Cluster: How to use |
SQANTI3 | SQANTI3 is the newest version of the SQANTI tool (publication) that merges features from SQANTI, (code repository) and SQANTI2 (code repository), together with new additions. SQANTI3 will continue as an integrated development aiming to providing you the best characterization possible for your new long read-defined transcriptome. SQANTI3 is the first module of the Functional IsoTranscriptomics (FIT) framework, that also includes IsoAnnot and tappAS. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SQuIRE | SQuIRE reveals locus-specific regulation of interspersed repeat expression, Nucleic Acids Research | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
STAR | RNA-seq aligner |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
STAR-Fusion | STAR-Fusion further processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set (using a GTF file, ideally the same annotation file used during the STAR genome index building process during the intial STAR setup). |
Genobioinfo Cluster: How to use |
StringTie | StringTie is a fast and highly efficient assembler of RNA-Seq alignments into potential transcripts. It uses a novel network flow algorithm as well as an optional de novo assembly step to assemble and quantitate full-length transcripts representing multiple splice variants for each gene locus. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Subread | A tool kit for processing next-gen sequencing data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SUPPA | Fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TACO | Multi-sample transcriptome assembly from RNA-Seq. |
Genobioinfo Cluster: How to use |
TAMA | Transcriptome Annotation by Modular Algorithms: this software was designed for processing Iso-Seq data and other long read transcriptome data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TAPAS | Tool for Alternative Polyadenylation Site Analysis. |
Genobioinfo Cluster: How to use |
TPMCalculator | TPMCalculator quantifies mRNA abundance directly from the alignments by parsing BAM files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Transrate | Transrate is software for de-novo transcriptome assembly quality analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trinotate | Trinotate is a comprehensive annotation suite designed for automatic functional annotation of transcriptomes, particularly de novo assembled transcriptomes, from model or non-model organisms. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Whippet | Lightweight and Fast RNA-seq quantification at the event-level | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
chimerascan | chimerascan is a software package that detects gene fusions in paired-end RNA sequencing (RNA-Seq) datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Pizzly | A program for detecting gene fusions from RNA-Seq data of cancer samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
STAR-Fusion | STAR-Fusion further processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set (using a GTF file, ideally the same annotation file used during the STAR genome index building process during the intial STAR setup). |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
Carthagene | CarthaGene is a genetic/radiated hybrid mapping software. CarthaGene looks for multiple populations maximum likelihood consensus maps using a fast EM algorithm for maximum likelihood estimation and powerful ordering algorithms. CarthaGene can handle data made up of several distinct populations which t may each be either F2 backcross, recombinant inbred lines, F2 t intercross, phase known outbreds and/or radiated hybrids (haploid t and diploid data). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Chromonomer | Chromonomer is a program designed to integrate a genome assembly with a genetic map. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
OMBlast | An alignment tool for optical mapping data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
ASMC | Ascertained Sequentially Markovian Coalescent (contains ASMC and an extension, FastSMC, together with python bindings for both) |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BAMscorer | BAMscorer can be used to conduct genomic assignment tests from BAM files. Assignments can be done on genomic regions, inversions, and whole-genome datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Beagle_Utilities | Simple utility programs for manipulating text files, especially VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BeXY | BeXY is a tool to jointly infer sex karyotypes and sex-linked scaffolds from read count data. It can also be used to genetically sex single individuals. BeXY is a command-line tool, and we provide an easy-to-use R package to visualize and parse the results. |
Genobioinfo Cluster: How to use |
demuxlet | Genetic multiplexing of barcoded single cell RNA-seq |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EMMAX | EMMAX is a statistical test for large scale human or model organism association mapping accounting for the sample structure. In addition to the computational efficiency obtained by EMMA algorithm, EMMAX takes advantage of the fact that each loci explains only a small fraction of complex traits, which allows us to avoid repetitive variance component estimation procedure, resulting in a significant amount of increase in computational time of association mapping using mixed model. |
Genobioinfo Cluster: How to use |
evalAdmix | evalAdmix allows to evaluate the results of an admixture analysis (i.e. the result of applying ADMIXTURE, STRUCTURE, NGSadmix and similar). |
Genobioinfo Cluster: How to use |
gamevar.f90 | A software package for calculating individual genetic diversity. |
Genobioinfo Cluster: How to use |
GONE | This program calculates and uses linkage disequilibrium at genomic marker loci to infer the effective population size trajectories over a period of about 100-200 hundred generations back in time. |
Genobioinfo Cluster: How to use |
hap-ibd | The hap-ibd program detects identity-by-descent (IBD) segments and homozygosity-by-descent (HBD) segments in phased genotype data. |
Genobioinfo Cluster: How to use |
moments | moments implements methods for inferring demographic history and patterns of selection from genetic data, based on solutions to the diffusion approximations to the site-frequency spectrum (SFS). |
Genobioinfo Cluster: How to use |
PoPoolation | PoPoolation is a pipeline for analysing pooled next generation sequencing data. Currently PoPoolation allows to calculate Tajima’s Pi, Watterson’s Theta and Tajima’s D with a sliding window approach for chromosomes or for set of genes. |
Genobioinfo Cluster: How to use |
POPS | The POPS program performs inference of ancestry distribution models. |
Genobioinfo Cluster: How to use |
pyrho | Fast demography-aware inference of fine-scale recombination rates based on fused-LASSO. |
Genobioinfo Cluster: How to use |
quickLD | High-performance Computation of Linkage Disequilibrium on CPUs and GPUs. |
Genobioinfo Cluster: How to use |
READv2 | Relationship Estimation from Ancient DNA version 2. |
Genobioinfo Cluster: How to use |
Vt | A tool set for short variant discovery in genetic sequence data. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
AlphaImpute | AlphaImpute is a software package for imputing and phasing genotype data in diploid populations with pedigree information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bgc | bgc implements Bayesian estimation of genomic clines to quantify introgression at many loci. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Eagle | The Eagle software estimates haplotype phase either within a genotyped cohort or using a phased reference panel. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EMMAX | EMMAX is a statistical test for large scale human or model organism association mapping accounting for the sample structure. In addition to the computational efficiency obtained by EMMA algorithm, EMMAX takes advantage of the fact that each loci explains only a small fraction of complex traits, which allows us to avoid repetitive variance component estimation procedure, resulting in a significant amount of increase in computational time of association mapping using mixed model. |
Genobioinfo Cluster: How to use |
FaST-LMM | FaST-LMM (Factored Spectrally Transformed Linear Mixed Models) is a program for performing genome-wide association studies (GWAS) on large data sets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fineRADstructure | A complete, easy to use, and fast population inference package for RAD-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GCTA | GCTA (Genome-wide Complex Trait Analysis) was originally designed to estimate the proportion of phenotypic variance explained by genome- or chromosome-wide SNPs for complex traits (the GREML method), and has subsequently extended for many other analyses to better understand the genetic architecture of complex traits. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GEMMA | GEMMA is the software implementing the Genome-wide Efficient Mixed Model Association algorithm for a standard linear mixed model and some of its close relatives for genome-wide association studies (GWAS). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KING | KING is a toolset that makes use of high-throughput SNP data typically seen in a genome-wide association study (GWAS) or a sequencing project. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kmersGWAS | A library for running k-mers based GWAS. |
Genobioinfo Cluster: How to use |
lcMLkin | lcMLkin is a C++ program that allows users to infer biological relatedness from low coverage 2nd generation sequencing data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Lep-MAP3 | Lep-MAP3 is a novel and free software for linkage mapping. It can construct linkage maps on very large number of markers and individuals on single or multiple families. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MACH | MACH is a Markov Chain based haplotyper that can resolve long haplotypes or infer missing genotypes in samples of unrelated individuals. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MapThin | Reduce the number of SNPs in a gene marker dense map computed by PLINK. First, by eliminating linked SNPs. Then, by applying different criteria. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Minimac4 | Minimac4 is a latest version in the series of genotype imputation software - preceded by Minimac3 (2015), Minimac2 (2014), minimac (2012) and MaCH (2010). Minimac4 is a lower memory and more computationally efficient implementation of the original algorithms with comparable imputation quality. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
piMASS | posterior inference via Model Averaging and Subset Selection: performs genome-wide joint analysis of all SNPs in association with a phenotype. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PLINK | PLINK is a free, open-source whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Scoary | Scoary is designed to take the gene_presence_absence.csv file from Roary as well as a traits file created by the user and calculate the assocations between all genes in the accessory genome and the traits. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TASSEL | Trait Analysis by aSSociation, Evolution and Linkage. TASSEL has multiple functions, including association study, evaluating evolutionary relationships, analysis of linkage disequilibrium, principal component analysis, cluster analysis, missing data imputation and data visualization for large sets of data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
ACACIA | Allele CAlling proCedure for Illumina Amplicon sequencing data: This workflow aims at extracting allele information out of paired-end Illumina FASTQC files. |
Genobioinfo Cluster: How to use |
AmpliSAT | AmpliSAT (Amplicon Sequencing Analysis Tools) are a set of online tools that make easy the analysis of Amplicon Sequencing experiments. |
Genobioinfo Cluster: How to use |
Beagle_Utilities | Simple utility programs for manipulating text files, especially VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cgMLSTFinder | Core genome Multi-Locus Sequence Typing cgMLSTFinder runs KMA <1> against a chosen core genome MLST (cgMLST) database and outputs the detected alleles in a matrix file. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
evalAdmix | evalAdmix allows to evaluate the results of an admixture analysis (i.e. the result of applying ADMIXTURE, STRUCTURE, NGSadmix and similar). |
Genobioinfo Cluster: How to use |
fastPHASE | A tool for genotype imputation and estimating missing haplotypes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
flare | The flare program uses a set of reference haplotypes to infer the ancestry of each allele in a set of admixed study samples. The flare program is fast, accurate, and memory-efficient. | Genobioinfo Cluster: How to use |
GLIMPSE | GLIMPSE is a phasing and imputation method for large-scale low-coverage sequencing studies. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
graphtyper | graphtyper is a graph-based variant caller capable of genotyping population-scale short read data sets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
locator | A supervised machine learning method for predicting the geographic origin of a sample from genotype or sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LRez | Standalone tool and library allowing to work with barcoded linked-reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mgatk | A mitochondrial genome analysis toolkit. |
Genobioinfo Cluster: How to use |
PanGenie | A short-read genotyper for various types of genetic variants (such as SNPs, indels and structural variants) represented in a pangenome graph. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pixy | pixy is a command-line tool for painlessly and correctly estimating average nucleotide diversity within (π) and between (dxy) populations from a VCF. | Genobioinfo Cluster: How to use |
READ | READ is a method to infer the degree of relationship (up to second degree, i.e. nephew/niece-uncle/aunt, grandparent-grandchild or half-siblings) for a pair of low-coverage individuals. |
Genobioinfo Cluster: How to use |
STITCH | STITCH is an R program for reference panel free, read aware, low coverage sequencing genotype imputation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi | SVJedi is a structural variation (SV) genotyper for long read data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi-graph | SVJedi-graph is a structural variation (SV) genotyper for long read data. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
Beagle_Utilities | Simple utility programs for manipulating text files, especially VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ChromoPainter | ChromoPainter is a tool for finding haplotypes in sequence data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastPHASE | A tool for genotype imputation and estimating missing haplotypes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
flare | The flare program uses a set of reference haplotypes to infer the ancestry of each allele in a set of admixed study samples. The flare program is fast, accurate, and memory-efficient. | Genobioinfo Cluster: How to use |
hap.py | Genobioinfo Cluster: How to use | |
HaploHiC | Comprehensive haplotype division of Hi-C PE-reads based on local contacts ratio. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Haplostrips | Haplostrips produce plots that depict variants in a genomic window among different samples. Visualize similarities between haplotypes with respect to a reference haplotype through haplotype clustering and sorting, useful for revealing hidden population structure. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Haploview | Haploview is designed to simplify and expedite the process of haplotype analysis by providing a common interface to several tasks relating to such analyses. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HAT-phasing | HAT is a haplotype assembly tool that use NGS and TGS data along a reference genome to reconstruct haplotypes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Migrate | Migrate estimates effective population sizes,past migration rates between n population assuming a migration matrix model with asymmetric migration rates and different subpopulation sizes, and population divergences or admixture. | Genobioinfo Cluster: How to use |
nPhase | nPhase is a ploidy agnostic tool developed in python which predicts the haplotypes of a sample that was sequenced by both long and short reads by aligning them to a reference. It should work with any ploidy. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PredictHaplo | This software aims at reconstructing haplotypes from next-generation sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
READ | READ is a method to infer the degree of relationship (up to second degree, i.e. nephew/niece-uncle/aunt, grandparent-grandchild or half-siblings) for a pair of low-coverage individuals. |
Genobioinfo Cluster: How to use |
SNPsplit | SNPsplit is an allele-specific alignment sorter which is designed to read alignment files in SAM/BAM format and determine the allelic origin of reads that cover known SNP positions. |
Genobioinfo Cluster: How to use |
WGDI | WGDI (Whole-Genome Duplication Integrated analysis), a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidizations and cross-species genome alignments. | Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
ALLHiC | Phasing and scaffolding polyploid genomes based on Hi-C data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Armatus | Multiresolution domain calling software for chromosome conformation capture interaction matrices. Armatus is a Topologically Associated Domain caller. Follow the Web page to know more about Armatus. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AutoHiC | AutoHiC is a deep learning tool that uses Hi-C data to support genome assembly. It can automatically correct errors during genome assembly and generate genomes at the chromosome level. |
Genobioinfo Cluster: How to use |
Chromosight | Python package to detect chromatin loops (and other patterns) in Hi-C contact maps. |
Genobioinfo Cluster: How to use |
Cooler | Cooler is a support library for a sparse, compressed, binary persistent storage format, also called cooler, used to store genomic interaction data, such as Hi-C contact matrices. |
Genobioinfo Cluster: How to use |
coolpuppy | A versatile tool to perform pile-up analysis on Hi-C data in .cool format. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DARIC | A complete framework for identifying quantitatively differential compartments from Hi-C and Micro-C data.
|
Genobioinfo Cluster: How to use |
FALCON-Phase | FALCON-Phase integrates PacBio long-read assemblies with Phase Genomics Hi-C data to create phased, diploid, chromosome-scale scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FAN-C | Framework for the ANalysis of C-like data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraphUnzip | Unzip assembly graphs with Hi-C data and/or long reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HiCExplorer | HiCExplorer is a powerful and easy to use set of tools to process, normalize and visualize Hi-C data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Hickit | Hickit is a set of tools initially developed to process diploid single-cell Hi-C data. It extracts contact pairs from read alignment, identifies phases of contacts overlapping with SNPs of known phases, imputes missing phases, infers the 3D structure of a single cell and visualizes the structure. |
Genobioinfo Cluster: How to use |
HiCLift | A fast and efficient tool for converting chromatin interaction data between genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
juicebox_scripts | A collection of scripts for working with Hi-C data, Juicebox, and other genomic file formats | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Linker | Linker is a suite of C++ tools useful for interpreting long and linked read sequencing of cancer genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Orca | A deep learning sequence modeling framework for multiscale genome structure prediction. |
Genobioinfo Cluster: How to use |
PretextMap | Paired REad TEXTure Mapper. Converts SAM formatted read pairs into genome contact maps. Full suite of Pretext tools. |
Genobioinfo Cluster: How to use |
PretextView | OpenGL Powered Pretext Contact Map Viewer. |
Genobioinfo Cluster: How to use |
SALSA | A tool to scaffold long read assemblies with Hi-C data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SELFISH | SELFISH is a tool for finding differential chromatin interactions between two Hi-C contact maps. |
Genobioinfo Cluster: How to use |
sslHiC | sslHiC is a computational framework for comparative analyses of Hi-C data, including reproducibility measurement and differential chromatin interaction (DCI) detection. |
Genobioinfo Cluster: How to use |
SuperTAD | SuperTAD is an open-source command-line TAD detection package written in C++. It takes either raw or normalized Hi-C contact maps as inputs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
YaHS | YaHS is a scaffolding tool using Hi-C data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
ANGEL | Robust Open Reading Frame prediction (ANGLE re-implementation) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CARNAC-LR | Clustering coefficient-based Acquisition of RNA Communities in Long Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cDNA_Cupcake | cDNA_Cupcake is a miscellaneous collection of Python and R scripts used for analyzing sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cogent | Cogent is a tool for reconstructing the coding genome using high-quality full-length transcriptome sequences. It is designed to be used on Iso-Seq data and in cases where there is no reference genome or the ref genome is highly incomplete. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IsoSeq3 | Scalable De Novo Isoform Discovery from Single-Molecule PacBio Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SQANTI3 | SQANTI3 is the newest version of the SQANTI tool (publication) that merges features from SQANTI, (code repository) and SQANTI2 (code repository), together with new additions. SQANTI3 will continue as an integrated development aiming to providing you the best characterization possible for your new long read-defined transcriptome. SQANTI3 is the first module of the Functional IsoTranscriptomics (FIT) framework, that also includes IsoAnnot and tappAS. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TAMA | Transcriptome Annotation by Modular Algorithms: this software was designed for processing Iso-Seq data and other long read transcriptome data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
AGAT | Another Gff Analysis Toolkit: suite of tools to handle gene annotations in any GTF/GFF format. Some examples what AGAT can do: standardise any GTF/GFF file into a comprehensive GFF3 format (script with agat_sp prefix): add missing parent features (e.g. gene and mRNA if only CDS/exon exist). add missing features (e.g. exon and UTR). add missing mandatory attributes (i.e. ID, Parent). fix identifier to be uniq. fix feature location. remove duplicated features. group related features (if spread in different places in the file). sort features. merge overlapping loci into one single locus (only if option activated). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Alfred | BAM Statistics, Feature Counting and Annotation | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AMAS | Calculate summary statistics and manipulate multiple sequence alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Apptainer | Apptainer is an open source container platform designed to be simple, fast, and secure. | Genobioinfo Cluster: How to use |
ArrowGrid | The distribution is a parallel wrapper around the Arrow consensus framework within the SMRT Analysis Software |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ART | ART is a set of simulation tools to generate synthetic next-generation sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
atac dnase pipelines | ATAC-seq and DNase-seq processing pipeline. This pipeline is designed for automated end-to-end quality control and processing of ATAC-seq or DNase-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
awscli | The AWS Command Line Interface (AWS CLI) is an open source tool that enables you to interact with AWS services using commands in your command-line shell. | Genobioinfo Cluster: How to use |
bamaddrg | Adds read groups to input BAM files, streams BAM output on stdout. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BamBam | several simple-to-use tools to facilitate NGS analysis | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bamstats (notsame as BAMstats) | Bamstats is a command line tool written in Go for computing mapping statistics from a BAM file. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bamtofastq | Tool for converting 10x BAMs produced by Cell Ranger, Space Ranger, Cell Ranger ATAC, Cell Ranger DNA, and Long Ranger back to FASTQ files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bamtools | BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bamUtil | bamUtil is a repository that contains several programs that perform operations on SAM/BAM files. All of these programs are built into a single executable, bam. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BBMap | a short read aligner, as well as various other bioinformatic tools. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bcl2fastq | The Bcl2FastQ conversion software is a new tool to handle bcl conversion and demultiplexing of both unzipped and zipped bcl files, which have reduced footprint and were introduced as an optional output of the HCS Software version 2.0 |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BEDOPS | BEDOPS is an open-source command-line toolkit that performs highly efficient and scalable Boolean and other set operations, statistical calculations, archiving, conversion and other management of genomic data of arbitrary scale. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bedtools | The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BigDataScript | BigDataScript is intended as a scripting language for big data pipeline | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bioawk | Bioawk is an extension to Brian Kernighan's awk, adding the support of several common biological data formats, including optionally gzip'ed BED, GFF, SAM, VCF, FASTA/Q and TAB-delimited formats with column names. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
biohazard-tools | This is a collection of command line utilities that do useful stuff involving BAM files for Next Generation Sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Biopieces | The Biopieces are a collection of bioinformatics tools that can be pieced together in a very easy and flexible manner to perform both simple and complex tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BIOPYTHON | Biopython is a set of freely available tools for biological computation written in Python by an international team of developers. | Genologin Cluster: In Python-3.6.3 Genobioinfo Cluster: Ask for Install |
busco2fasta | A script to turn a set of BUSCO results into a directory of multisequence FASTA files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bwtools | bwtool is a command-line utility for bigWig files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cabal | Cabal is the standard package system for Haskell software. It helps people to configure, build and install Haskell software and to distribute it easily to other users and developers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cctools | The Cooperative Computing Tools (cctools) enable large scale distributed computations to harness hundreds to thousands of machines from clusters, clouds, and grids. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cd-hit | CD-HIT stands for Cluster Database at High Identity with Tolerance. The program (cd-hit) takes a fasta format sequence database as input and produces a set of 'non-redundant' (nr) representative sequences as output. In addition cd-hit outputs a cluster file, documenting the sequence 'groupies' for each nr sequence representative. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cdbfasta | This is a brief introduction to a couple of platform independent file-based hashing tools (cdbfasta and cdbyank) that can be used for creating indices for quick retrieval of any particular sequences from large multi-FASTA files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
circos | Circos is a software package for visualizing data and information. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
clustermq | ClusterMQ: send R function calls as cluster job | Genologin Cluster: In R-3.6.0 => module load system/R-3.6.0 Genobioinfo Cluster: Ask for Install |
Concrete Autoencoders | The concrete autoencoder is an end-to-end differentiable method for global feature selection, which efficiently identifies a subset of the most informative features and simultaneously learns a neural network to reconstruct the input data from the selected features. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
csvtk | A cross-platform, efficient and practical CSV/TSV toolkit in Golang. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DamageProfiler | A Java based tool to determine damage patterns on ancient DNA as a replacement for mapDamage. DamageProfiler calculates damage profiles of mapped reads and provides a graphical as well as text based representation. It creates
|
Genobioinfo Cluster: How to use |
Dasel | Dasel (short for data-selector) allows you to query and modify data structures using selector strings. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
datamash | GNU datamash is a command-line program which performs basic numeric, textual and statistical operations on input textual data files. |
Genologin Cluster: How to use Genobioinfo Cluster: default system |
DAZZ_DB | To facilitate the multiple phases of the dazzler assembler, we organize all the read data into what is effectively a "database" of the reads and their meta-information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DBSCAN-SWA | An integrated tool for rapid prophage detection and annotation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
deepTools | Tools to process and analyze deep sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DSK | DSK is a k-mer counting software, similar to Jellyfish. DSK supports large values of k, and runs with (almost-)arbitrarily low memory usage and reasonably low temporary disk usage. DSK can count k-mers of large Illumina datasets on laptops and desktop computers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ecoPCR | ecoPCR is an electronic PCR software developed by LECAand Helix-Project . It helps you to estimate Barcode primers quality. In conjunction with OBItools, you can postprocess ecoPCR output to compute barcode coverage and barcode speci?city. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EDirect | Entrez Direct (EDirect) provides access to the NCBI's suite of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EGA_download_client | The EgaDemoClient is a JAVA based data streamer that enables EGA account holders to securely download files and datasets, either through an interactive shell (IS) or using direct command line mode (DCLM). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EggLib | EggLib is a C++/Python library and program package for evolutionary genetics and genomics. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Eigen | Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EMBOSS | EMBOSS is "The European Molecular Biology Open Software Suite". EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ensembl-API | Ensembl uses MySQL relational databases to store its information. A comprehensive set of Application Programme Interfaces (APIs) serve as a middle-layer between underlying database schemes and more specific application programmes. The APIs aim to encapsulate the database layout by providing efficient high-level access to data tables and isolate applications from data layout changes. Ensembl's API is written in Perl | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ERPIN | ERPIN (Easy RNA Profile IdentificatioN) is an RNA motif search program developped by Daniel Gautheret and André Lambert. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EUPAN | Toolkit that integrates various software in order to build eukaryotic pangenomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FASTA Composition | finds the overall composition of sequences in a FASTA file |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FASTA_Length | FASTA Length finds the lengths of sequences in a FASTA file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fasta_validator | C code to validate a fasta file. |
Genobioinfo Cluster: How to use |
fastk-medians | A set of utilities to calculate the median number of times the k-mers in a sequence of interest occur across the whole set. |
Genobioinfo Cluster: How to use |
fastprofkernel | fastprofkernel is a Debian package that uses an accelerated version of the original profile kernel <1> to automatically train SVM based classification models. It can assign user-defined classes to so far uncharacterized proteins. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastQC | A Quality Control application for FastQ files. FastQC is an application which takes a FastQ file and runs a series of tests on it to generate a comprehensive QC report. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FASTX-Toolkit | The FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fgbio | A set of tools to analyze genomic data with a focus on Next Generation Sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FindingOverCovRegions | FindOverCovRegions.py search for genomic regions with abnormal read coverage (e.g. depth). To do so, this program requieres begraph-like file (e.g. bedtools genomecov per-base reports) where for each position of the genome, the coverage depth is reported (even 0 values). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fqtools | fqtools is a software suite for fast processing of FASTQ files; Various file manipulations are supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Galaxy | Galaxy is a scientific workflow, data integration, and data and analysis persistence and publishing platform that aims to make computational biology accessible to research scientists that do not have computer programming experience. Although it was initially developed for genomics research, it is largely domain agnostic and is now used as a general bioinformatics workflow management system | Genologin Cluster: soon SGE Cluster: Link to Galaxy instance Genobioinfo Cluster: Ask for Install |
gargammel | gargammel is an ancient DNA simulator | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gCluster | The gCluster algorithm is a general clustering method that predicts clusters of any biological word or combination of them, relying only on the DNA sequence and the statistical significance. When using CG as word, gCluster works similarly to CpGcluster, our method to predict CpG islands. More broadly, gCluster has much in common with wordCluster but uses an improved distance model. |
Genobioinfo Cluster: How to use |
GDAL | a translator library for raster and vector geospatial data formats that is released under an X/MIT style Open Source license by the Open Source Geospatial Foundation. | Genologin Cluster: /tools/libraries/gdal Genobioinfo Cluster: Ask for Install |
GEM-library | A set of very optimized tools for indexing/querying huge genomes/files. |
Genobioinfo Cluster: How to use |
GEM-Tools | GEM-Tools is a C API and a Python module to support and simplify usage of the GEM Mapper. |
Genobioinfo Cluster: How to use |
GenomeScope | Fast genome analysis from unassembled short reads |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomeTools | Collection of bioinformatics tools (in the realm of genome informatics) combined into a single binary named "gt". |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gerbil | A basic task in bioinformatics is the counting of k-mers in genome strings. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gfatools | gfatools is a set of tools for manipulating sequence graphs in the GFA or the rGFA format. It has implemented parsing, subgraph and conversion to FASTA/BED. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gff3sort | A Perl Script to sort gff3 files and produce suitable results for tabix tools | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gff3toembl | Converts Prokka GFF3 files to EMBL files for uploading annotated assemblies to EBI | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gffcompare | gffcompare can be used to compare, merge, annotate and estimate accuracy of one or more GFF files (the “query” files), when compared with a reference annotation (also provided as GFF). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gffread | GFF/GTF utility providing format conversions, region filtering, FASTA sequence extraction and more. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gget | gget enables efficient querying of genomic reference databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gh-cli | gh is GitHub on the command line. It brings pull requests, issues, and other GitHub concepts to the terminal next to where you are already working with git and your code. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GHC | GHC is a state-of-the-art, open source, compiler and interactive environment for the functional language Haskell | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Grinder | Grinder is a versatile open-source bioinformatic tool to create simulated omic shotgun and amplicon sequence libraries for all main sequencing platforms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GTFtools | GTFtools provides a set of functions to analyze various modes of gene models. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hal2vg | Convert HAL to vg-compatible sequence graph. |
Genobioinfo Cluster: How to use |
Hclust2 | Hclust2 is a handy tool for plotting heat-maps with several useful options to produce high quality figures that can be used in publication. |
Genobioinfo Cluster: How to use |
hcluster_sg | A hierarchical clustering software for sparse graphs | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HDFView | HDFView is a visual tool for browsing and editing HDF4 and HDF5 files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HGT-ID | An efficient and sensitive program for detecting viral insertion sequences from known viral reference genome in the genome of human cancers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HiCExplorer | HiCExplorer is a powerful and easy to use set of tools to process, normalize and visualize Hi-C data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HiCLift | A fast and efficient tool for converting chromatin interaction data between genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IDR | The IDR (Irreproducible Discovery Rate) framework is a unified approach to measure the reproducibility of findings identified from replicate experiments and provide highly stable thresholds based on reproducibility. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iVar | Var is a computational package that contains functions broadly useful for viral amplicon-based sequencing. Additional tools for metagenomic sequencing are actively being incorporated into iVar. While each of these functions can be accomplished using existing tools, iVar contains an intersection of functionality from multiple tools that are required to call iSNVs and consensus sequences from viral sequencing data across multiple replicates. We implemented the following functions in iVar: (1) trimming of primers and low-quality bases, (2) consensus calling, (3) variant calling - both iSNVs and insertions/deletions, and (4) identifying mismatches to primer sequences and excluding the corresponding reads from alignment files. |
Genobioinfo Cluster: How to use |
JAGS | JAGS is Just Another Gibbs Sampler. It is a program for analysis of Bayesian hierarchical models using Markov Chain Monte Carlo (MCMC) simulation not wholly unlike BUGS. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
JCVI | Collection of Python libraries to parse bioinformatics files, or perform computation related to assembly, annotation, and comparative genomics. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Jellyfish | JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Julia | Julia is a high-level, high-performance dynamic programming language for technical computing, with syntax that is familiar to users of other technical computing environments. It provides a sophisticated compiler, distributed parallel execution, numerical accuracy, and an extensive mathematical function library. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Jvarkit | Java utilities for Bioinformatics (only requested tools are compiling) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KAT | KAT (The K-mer Analysis Toolkit) is a suite of tools that generate, analyse and compare k-mer spectra produced from sequence files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KCOSS | A fast and space-saving multi-threaded k-mer frequency statistics algorithm |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
kentUtils | UCSC command line bioinformatic utilities |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
klocate | Standalone tool based on the bwa index to locate a set of kmers along a reference genome. klocate searches each kmer (full and perfect match) in the index and outputs all positions the kmer maps to (output to sdtout in bed format). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
kmap | Standalone tool based on the bwa index to locate a set of kmers along a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
kmdiif | kmdiff provides differential k-mers analysis between two populations (control and case). Each population is represented by a set of short-read sequencing. Outputs are differentially represented k-mers between controls and cases. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kmer-counter | A fast k-mer counter written in Rust. |
Genobioinfo Cluster: How to use |
KrakenTools | KrakenTools provides individual scripts to analyze Kraken/Kraken2/Bracken/KrakenUniq output files. |
Genobioinfo Cluster: How to use |
Krona | Krona allows hierarchical data to be explored with zoomable pie charts. Krona charts can be created using an Excel template or KronaTools, which includes support for several bioinformatics tools and raw data formats. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
lastp_aai | A simple Python script for calculating pairwise amino acid identity (AAI) between protein files (extension .faa) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
libplinkio | This is a small C and Python library for reading Plink genotype files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
libstree | libstree is a generic suffix tree implementation, written in C. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Liftoff | Liftoff is a tool that accurately maps annotations in GFF or GTF between assemblies of the same, or closely-related species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
llvm | The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRez | Standalone tool and library allowing to work with barcoded linked-reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mash | Fast genome and metagenome distance estimation using MinHash. documentation : Publications — Mash 2.0 documentation |
Genobioinfo Cluster: How to use |
MEGA-CC | Software suite for analyzing DNA and protein sequence data from species and populations. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Met4j | Met4J is an open-source Java library dedicated to the structural analysis of metabolic networks. It also came with a toolbox gathering CLI for several analyses relevant to metabolism-related research. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mosdepth | Fast BAM/CRAM depth calculation for WGS, exome, or targeted sequencing. mosdepth can output: per-base depth about 2x as fast samtools depth--about 25 minutes of CPU time for a 30X genome. mean per-window depth given a window size--as would be used for CNV calling. the mean per-region given a BED file of regions. a distribution of proportion of bases covered at or above a given threshhold for each chromosome and genome-wide. quantized output that merges adjacent bases as long as they fall in the same coverage bins e.g. (10-20) threshold output to indicate how many bases in each region are covered at the given thresholds. when appropriate, the output files are bgzipped and indexed for ease of use. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MultiQC | Aggregate results from bioinformatics analyses across many samples into a single report. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NAMD | NAMD, recipient of a 2002 Gordon Bell Award and a 2012 Sidney Fernbach Award, is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NanoPlot | Plotting tool for Oxford Nanopore sequencing data and alignments. |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: How to use |
natsort | Simple yet flexible natural sorting in Python | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NCBI_tools | NCBI portable software toolkit |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_tools++ | NCBI C++ Toolkit provides free, portable, public domain libraries. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NetLogo | NetLogo is a multi-agent programmable modeling environment. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NextCloudcmd | A command line client that can be used to synchronize Nextcloud files to client machines. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Nextflow | Nextflow enables scalable and reproducible scientific workflows using software containers. It allows the adaptation of pipelines written in the most common scripting languages. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ngsutils | Tools for next-generation sequencing analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
numpy | NumPy is a package needed for scientific computing with Python. | Genologin Cluster: In Python (all versions) Genobioinfo Cluster: Ask for Install |
OBITools | OBITools is a set of python programs developed to simplify the manipulation of sequence files in our labs. They were mainly designed to help us for analyzing Next Generation Sequencer outputs (454 or Illumina) in the context of DNA Metabarcoding. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pandoc | Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
parallel | GNU parallel is a shell tool for executing jobs in parallel using one or more computers. |
Genologin Cluster: How to use Genobioinfo Cluster: default system |
parallel-fastq-dump | NCBI fastq-dump can be very slow sometimes, even if you have the resources (network, IO, CPU) to go faster, even if you already downloaded the sra file (see the protip below). This tool speeds up the process by dividing the work into multiple threads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pbtk | PacBio BAM toolkit |
Genobioinfo Cluster: How to use |
PEAR | PEAR is an ultrafast, memory-efficient and highly accurate pair-end read merger. It is fully parallelized and can run with as low as just a few kilobytes of memory. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PfamScan | A program that searches a FASTA file against a library of Pfam HMMs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pomoxis | Pomoxis comprises a set of basic bioinformatic tools tailored to nanopore sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pong | pong is a freely available software package, released by Behr et al. (2016, Bioinformatics), for post-processing output from clustering inference using population genetic data. |
Genologin Cluster: In Python-2.7.15 Genobioinfo Cluster: Ask for Install |
PoPoolation2 | PoPoolation2 allows to compare allele frequencies for SNPs between two or more populations and to identify significant differences. PoPoolation2 requires next generation sequencing data of pooled genomic DNA (Pool-Seq). It may be used for measuring differentiation between populations, for genome wide association studies and for experimental evolution. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pp-popularity-contest | The pp-popularity-contest package sets up a cron job that periodically submits the developers anonymous statistics on the usage of Rost Lab prediction methods installed on this system. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
preseq | Software for predicting library complexity and genome coverage in high-throughput sequencing. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Primer3 | Primer3 is a widely used program for designing PCR primers (PCR = "Polymerase Chain Reaction"). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PRINSEQ | PRINSEQ is a tool that generates summary statistics of sequence and quality data and that is used to filter, reformat and trim next-generation sequence data. The standalone version is primarily designed for data preprocessing and does not generate summary statistics in graphical form. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PROJ4 | Cartographic Projections Library | Genologin Cluster: /tools/libraries/PROJ/ Genobioinfo Cluster: Ask for Install |
pybam | Very simple, pure python, BAM file reader. If you do not need to use BAM indexes, pybam is probably the fastest and simplest BAM parser out there, particularly if run under PyPy. |
Genobioinfo Cluster: How to use |
PyCharm | PyCharm is a dedicated Python and Django IDE providing a wide range of essential tools for Python developers, tightly integrated together to create a convenient environment for productive Python development and Web development. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PyPy | A fast, compliant alternative implementation of Python. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PySlurm | This module provides a low-level Python wrapper around the Slurm C-API using Cython. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Quake | t Quake is a package to correct substitution sequencing errors in experiments with deep coverage (e.g. >15X), specifically intended for Illumina sequencing reads. Quake adopts the k-mer error correction framework, first introduced by the EULER genome assembly package. Unlike EULER and similar progams, Quake utilizes a robust mixture model of erroneous and genuine k-mer distributions to determine where errors are located. Then Quake uses read quality values and learns the nucleotide to nucleotide error rates to determine what types of errors are most likely. This leads to more corrections and greater accuracy, especially with respect to avoiding mis-corrections, which create false sequence unsimilar to anything in the original genome sequence from which the read was taken.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Quarto | Quarto is a software that compiles a markdown code to html, pdf, or many other formats. It is a successor of pandoc. |
Genobioinfo Cluster: How to use |
R | R is "GNU S", a freely available language and environment for statistical computing and graphics which provides a wide variety of statistical and graphical techniques: linear and nonlinear modelling, statistical tests, time series analysis, classification, clustering, etc. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RabbitUniq | Compute unique k-mer faster. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RabbitV | RabbitV is a highly optimized and practical toolkit for the detection of viruses and microorganisms in sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RetroScan | RetroScan is an easy-to-use tool for retrocopy identification that integrates a series of bioinformatics tools (LAST, BEDtools, ClustalW2, KaKs_Calculator, HISAT2, StringTie, SAMtools and Shiny) and scripts. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Roary | Roary is a high speed stand alone pan genome pipeline, which takes annotated assemblies in GFF3 format (produced by Prokka (Seemann, 2014)) and calculates the pan genome. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RopeBWT2 | RopeBWT2 is an tool for constructing the FM-index for a collection of DNA sequences. |
Genobioinfo Cluster: How to use |
ROSE | To create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ruby | A dynamic, open source programming language. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
sambamba | Sambamba is a high performance modern robust and fast tool (and library), written in the D programming language, for working with SAM and BAM files. Current functionality is an important subset of samtools functionality, including view, index, sort, markdup, and depth. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
samclip | Filter SAM file for soft and hard clipped alignments | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Saturn | A tool for assessing the library saturation without any reference genome. . |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sbt | sbt is a build tool for Scala, Java, and more. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
scipy | SciPy (pronounced "Sigh Pie") is open-source software for mathematics, science, and engineering. The SciPy library depends on Numpy, which provides convenient and fast N-dimensional array manipulation. | Genologin Cluster: In Python modules Genobioinfo Cluster: Ask for Install |
selscan | A program to calculate EHH-based scans for positive selection in genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Seq | Seq is a programming language for computational genomics and bioinformatics. With a Python-compatible syntax and a host of domain-specific features and optimizations, Seq makes writing high-performance genomics software as easy as writing Python code, and achieves performance comparable to (and in many cases better than) C/C++. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SeqAn | SeqAn is an open source C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
seqfilter | Filter fasta/fastq(.gz) files by ID and/or sequence length |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SeqFu | A general-purpose program to manipulate and parse information from FASTA/FASTQ files, supporting gzipped input files. Includes functions to interleave and de-interleave FASTQ files, to rename sequences and to count and print statistics on sequence lengths. |
Genobioinfo Cluster: How to use |
SeqKit | A cross-platform and ultrafast toolkit for FASTA/Q file manipulation. Common manipulations of FASTA/Q file include converting, searching, filtering, deduplication, splitting, shuffling, and sampling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Seqtk | Toolkit for processing sequences in FASTA/Q formats |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SequenceTools | Tools for population genetics on sequencing datas |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Singularity | Singularity enables users to have full control of their environment. Singularity containers can be used to package entire scientific workflows, software and libraries, and even data. |
Genologin Cluster: How to use Genobioinfo Cluster: module load containers/singularity/3.9.9 |
Smudgeplots | Inference of ploidy and heterozygosity structure using whole genome sequencing data. This tool extracts heterozygous kmer pairs from kmer dump files (from jellyfish or KMC) and performs gymnastics with them. We are able to disentangle genome structure by comparing the sum of kmer pair coverages (CovA + CovB) to their relative coverage (CovA / (CovA + CovB)). Smudgeplots are computed from raw/trimmed reads and show the haplotype structure using heterozygous kmer pairs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Snakemake | Snakemake is a workflow management system that aims to reduce the complexity of creating workflows by providing a fast and comfortable execution environment, together with a clean and modern specification language in python style. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
soap.coverage | Can calculate sequencing coverage or physical coverage as well as duplication rate and details of specific block for each segments and whole genome by using SOAP, Blat, Blast, BlastZ, mummer and MAQ aligement results with multi-thread. Gzip file supported. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sourmash | sourmash is a command-line tool and Python library for computing hash sketches from DNA sequences, comparing them to each other, and plotting the results. | Genologin Cluster: In Python-3.7.4 Genobioinfo Cluster: How to use |
squeakr | Squeakr is a k-mer-counting and multiset-representation system using the recently-introduced counting quotient filter (CQF) Pandey et al. (2017), a feature-rich approximate membership query (AMQ) data structure. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
squid | A C library that is bundled with much of the above software. C function library for sequence analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SRAToolkit | Toolkit to query Short Reads Archive at NCBI |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
subsampler | Small tool to subsample fasta and fastq files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sumaclust | Fast and exact clustering of sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Sumatra | Sumatra was developed by the LECA and aims to compute a great deal of sequence similarities in a fast and exact way, based on the length of the Longest Common Subsequence (LCS) between two sequences. Sequence clustering based on similarities is also available through Sumaclust. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
superstring | Greedy approximation of the shortest common superstring | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
tabix | TAB-delimited file IndeXer. Useful for vcfTools. |
Genologin Cluster: How to use Genobioinfo Cluster: in bcftools and samtools |
TexLive | TeX Live is intended to be a straightforward way to get up and running with the TeX document production system. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
toulbar2 | toulbar2 is an open-source black-box C++ optimizer for cost function networks and discrete additive graphical models. It can read a variety of formats. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Umap | The free umap software package efficiently identifies uniquely mappable regions of any genome. Its Bismap extension identifies mappability of the bisulfite converted genome (methylome). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
unique-kmer-counts | This program calculates the number of distinct k-mers for each sequence record in a fasta file and divides it by the total number of k-mers in that record. |
Genobioinfo Cluster: How to use |
vawk | An awk-like VCF parser |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VCF-kit | Assorted utilities for the variant call format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vcflib | C++ library and cmdline tools for parsing and manipulating VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VCFtools | VCFtools is a program package designed for working with VCF files, such as those generated by the 1000 Genomes Project. The aim of VCFtools is to provide methods for working with VCF files: validating, merging, comparing and calculate some basic population genetic statistics. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Vmatch | A versatile software tool for efficiently solving large scale sequence matching tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VSEARCH | Versatile open-source tool for metagenomics |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Wgsim | Wgsim is a small tool for simulating sequence reads from a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WiggleTools | The WiggleTools package allows genomewide data files to be manipulated as numerical functions, equipped with all the standard functional analysis operators (sum, product, product by a scalar, comparators), and derived statistics (mean, median, variance, stddev, t-test, Wilcoxon's rank sum test, etc). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
3rdChimeraMiner | Exploration of whole genome amplification generated chimeric sequences in long-read sequencing data. | Genobioinfo Cluster: How to use |
ANGEL | Robust Open Reading Frame prediction (ANGLE re-implementation) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ArrowGrid | The distribution is a parallel wrapper around the Arrow consensus framework within the SMRT Analysis Software |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BELLA | A computationally-efficient and highly-accurate long-read to long-read aligner and overlapper. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
chopper | This tool, intended for long read sequencing such as PacBio or ONT, filters and trims a fastq file. chopper is a tool that reunites the now outdated softwares NanoFilt and NanoLyse. It permits to filter QC files and has a faster execution time than NanoFilt and NanoLyse.
|
Genobioinfo Cluster: How to use |
CIRI-long | Circular RNA Identification for Long-Reads Nanopore Sequencing Data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Clair3 | Clair3 is a germline small variant caller for long-reads. Clair3 makes the best of two major method categories: pileup calling handles most variant candidates with speed, and full-alignment tackles complicated candidates to maximize precision and recall. Clair3 runs fast and has superior performance, especially at lower coverage. Clair3 is simple and modular for easy deployment and integration. |
Genobioinfo Cluster: How to use |
CONSENT | CONSENT (sCalable self-cOrrectioN of long reads with multiple SEquence alignmeNT) is a self-correction method for long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cuteSV | Long read based human genomic structural variation detection with cuteSV. |
Genobioinfo Cluster: How to use |
DAmar | Long read QC, assembly and scaffolding pipeline for PacBio or Oxford Nanopore long-read sequencing data. T he pipeline produces a number of QC metrics at various stages as well as incorporating further technologies including Bionano, 10x and HiC data to scaffold the created contigs. DAmar, is a hybrid of the earlier Marvel, Dazzler, and Daccord systems of the Eugene Myers lab. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DAZZ_DB | To facilitate the multiple phases of the dazzler assembler, we organize all the read data into what is effectively a "database" of the reads and their meta-information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DBG2OLC | The genome assembler that reduces the computational time of human genome assembly from 400,000 CPU hours to 2,000 CPU hours, utilizing long erroneous 3GS sequencing reads and short accurate NGS sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeChat | Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat. |
Genobioinfo Cluster: How to use |
Deepbinner | Deepbinner is a tool for demultiplexing barcoded Oxford Nanopore sequencing reads. It does this with a deep convolutional neural network classifier, using many of the architectural advances that have proven successful in image classification. Unlike other demultiplexers (e.g. Albacore and Porechop), Deepbinner identifies barcodes from the raw signal (a.k.a. squiggle) which gives it greater sensitivity and fewer unclassified reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeepSignal | Detecting methylation using signal-level features from Nanopore sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DENTIST | DENTIST is a sensitive, highly-accurate and automated pipeline method to close gaps in (short read) assemblies with long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ELECTOR | EvaLuation of Error Correction Tools for lOng Reads. |
|
FABuLOUS | A gap-closing software tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. Initially called TGS-GapCloser. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON | Falcon: a set of tools for fast aligning long reads for consensus and assembly |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FALCON-Phase | FALCON-Phase integrates PacBio long-read assemblies with Phase Genomics Hi-C data to create phased, diploid, chromosome-scale scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastplong | Ultra-fast preprocessing and quality control for long-read sequencing data. |
Genobioinfo Cluster: How to use |
Filtlong | Filtlong is a tool for filtering long reads by quality. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FLAIR | FLAIR (Full-Length Alternative Isoform analysis of RNA) for the correction, isoform definition, and alternative splicing analysis of noisy reads. FLAIR has primarily been used for nanopore cDNA, native RNA, and PacBio sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FLAS | FLAS is software that makes self-correction for PacBio long reads with fast speed and high throughput. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Flye | Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FMLRC | FMLRC, or FM-index Long Read Corrector, is a tool for performing hybrid correction of long read sequencing using the BWT and FM-index of short-read sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FMLRC2 | FMLRC2 performs error correction/polishing of long erroneous sequences with accurate short reads. As such, it can be used as both an error-correction tool <1> for raw long reads (ex. Oxford Nanopore) and a polishing tool <2> for de novo assemblies. |
Genobioinfo Cluster: How to use |
GCI | Genome Continuity Inspector (GCI) is an assembly assessment tool for high-quality genomes (e.g. T2T genomes), in base resolution. | Genobioinfo Cluster: How to use |
GraphAligner | Seed-and-extend program for aligning long error-prone reads to genome graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraphMap | A highly sensitive and accurate mapper for long, error-prone reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Hap10 | The goal is to reconstruct accurate and long haplotypes polyploid genome using linked reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Hapo-G | Hapo-G is a tool that aims to improve the quality of genome assemblies by polishing the consensus with accurate reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HASLR | HASLR is a tool for rapid genome assembly of long sequencing reads. HASLR is a hybrid tool which means it requires long reads generated by Third Generation Sequencing technologies (such as PacBio or Oxford Nanopore) together with Next Generation Sequencing reads (such as Illumina) from the same sample. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HECIL | Hybrid Error Correction of Long Reads using Iterative Learning | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HELEN | HELEN (Homopolymer Encoded Long-read Error-corrector for Nanopore) uses a Recurrent-Neural-Network (RNN) based Multi-Task Learning (MTL) model that can predict a base and a run-length for each genomic position using the weights generated by MarginPolish. This installation includes MarginPolish. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HG-CoLoR | HG-CoLoR (Hybrid method based on a variable-order de bruijn Graph for the error Correction of Long Reads) is a hybrid method for the error correction of long reads that both aligns the short reads to the long reads, and uses a variable-order de Bruijn graph, in a seed-and-extend approach. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HiFiAdapterFilt | Convert .bam to .fastq and remove reads with remnant PacBio adapter sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hifiasm | Hifiasm is a fast haplotype-resolved de novo assembler for PacBio Hifi reads. Unlike most existing assemblers, hifiasm starts from uncollapsed genome. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hifiasm-meta | De novo metagenome assembler, based on hifiasm, a haplotype-resolved de novo assembler for PacBio Hifi reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
IPA | Improved Phased Assembler (IPA) is the official PacBio software for HiFi genome assembly. IPA was designed to utilize the accuracy of PacBio HiFi reads to produce high-quality phased genome assemblies. IPA is an end-to-end solution, starting with input reads and resulting in a polished assembly. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IsoSeq3 | Scalable De Novo Isoform Discovery from Single-Molecule PacBio Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Jabba | A hybrid error correction tool for sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
lamassemble | Merge overlapping "long" DNA reads into a consensus sequence. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
lima | Demultiplex Barcoded PacBio Samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Linker | Linker is a suite of C++ tools useful for interpreting long and linked read sequencing of cancer genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LIQA | Long-read Isoform Quantification and Analysis) is an Expectation-Maximization based statistical method to quantify isoform expression and detect differential alternative splicing (DAS) events using long-read RNA-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LongQC | LongQC is a tool for the data quality control of the PacBio and ONT long reads, and it has two functionalities: sample qc and platform qc. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
longshot | Longshot is a variant calling tool for diploid genomes using long error prone reads such as Pacific Biosciences (PacBio) SMRT and Oxford Nanopore Technologies (ONT). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LoRDEC | LoRDEC is a program to correct sequencing errors in long reads from 3rd generation sequencing with high error rate, and is especially intended for PacBio reads. It uses a hybrid strategy, meaning that it uses two sets of reads: the reference read set, whose error rate is assumed to be small, and the PacBio read set, which is then corrected using the reference set. Typically, the reference set contains Illumina reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LR_Gapcloser | LR_Gapcloser is a gap closing tool using uncorrected or corrected long reads generated from Pacbio platform or Nanopore platform. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LRScaf | TGS scaffolding . Improving draft genomes using long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MARVEL | MARVEL consists of a set of tools that facilitate the overlapping, patching, correction and assembly of noisy (not so noisy ones as well) long reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MashMap | MashMap implements a fast and approximate algorithm for computing local alignment boundaries between long DNA sequences. It can be useful for mapping genome assembly or long reads (PacBio/ONT) to reference genome(s). Given a minimum alignment length and an identity threshold for the desired local alignments, Mashmap computes alignment boundaries and identity estimates using k-mers. It does not compute the alignments explicitly, but rather estimates a k-mer based Jaccard similarity using a combination of Minimizers and MinHash. This is then converted to an estimate of sequence identity using the Mash distance. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MaSuRCA | MaSuRCA is whole genome assembly software. It combines the efficiency of the de Bruijn graph and Overlap-Layout-Consensus (OLC) approaches. MaSuRCA can assemble data sets containing only short reads from Illumina sequencing or a mixture of short reads and long reads (Sanger, 454) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mCaller | This program is designed to call m6A from nanopore data using the differences between measured and expected currents. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Megalodon | Megalodon provides "basecalling augmentation" for raw nanopore sequencing reads, including direct, reference-guided SNP and modified base calling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MeGAMerge | A tool to merge assembled contigs, long reads from metagenomic sequencing runs |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaMaps | MetaMaps is tool specifically developed for the analysis of long-read (PacBio/ONT) metagenomic datasets. It simultaenously carries out read assignment and sample composition estimation. It is faster than classical exact alignment-based approaches, and its output is more information-rich than that of kmer-spectra-based methods. For example, each MetaMaps alignment comes with an approximate alignment location, an estimated alignment identity and a mapping quality. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Miniasm | Ultrafast de novo assembly for long noisy reads (though having no consensus step). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MinIONQC | Fast and effective quality control for MinION and PromethION sequencing data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Minipolish | A tool for Racon polishing of miniasm assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiniScrub | MiniScrub is a de novo long sequencing read preprocessing method that improves read quality by predicting and removing ("scrubbing") read segments that have a high concentration of errors. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
modbam2bed | A program to aggregate modified base counts stored in a modified-base BAM file to a bedMethyl file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NanoCaller | NanoCaller is a computational method that integrates long reads in deep convolutional neural network for the detection of SNPs/indels from long-read sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NanoComp | Compare multiple runs of long read sequencing data and alignments. |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: in Python-3.11.1 |
NanoSimm | NanoSim is a fast and scalable read simulator that captures the technology-specific features of ONT data, and allows for adjustments upon improvement of nanopore sequencing technology. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoSPC | NanoSPC is a scalable, portable and cloud compatible pipeline for analyzing Nanopore sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NaS | NaS is a hybrid approach developed to take advantage of data generated using MinION device. It combines Illumina and Oxford Nanopore technologies to produce NaS (Nanopore Synthetic-long) reads |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NECAT | NECAT is an error correction and de-novo assembly tool for Nanopore long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NextDenovo | NextDenovo is a string graph-based de novo assembler for TGS long reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NGMLR | NGMLR is a long-read mapper designed to align PacBio or Oxford Nanopore (standard and ultra-long) to a reference genome with a focus on reads that span structural variations SV detection from paired end reads mapping |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Oases | Oases is a de novo transcriptome assembler designed to produce transcripts from short read sequencing technologies, such as Illumina, SOLiD, or 454 in the absence of any genomic assembly. It was developed by Marcel Schulz (MPI for Molecular Genomics) and Daniel Zerbino (previously at the European Bioinformatics Institute (EMBL-EBI), now at UC Santa Cruz). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ont_fast5_api | ont_fast5_api is a simple interface to HDF5 files of the Oxford Nanopore fast5 file format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Organelle_PBA | OrganelleRef_PBA is a script to perform a de-novo PacBio assemblies of any organelle (chloroplast or mitochondrial genomes) using several programs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Pacasus | Tool for detecting and cleaning PacBio / Nanopore long reads after whole genome amplification. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Peregrine | Peregrine is a fast genome assembler for accurate long reads (length > 10kb, accuraccy > 99%). It can assemble a human genome from 30x reads within 20 cpu hours from reads to polished consensus. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pomoxis | Pomoxis comprises a set of basic bioinformatic tools tailored to nanopore sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Porechop_ABI | Porechop_abi (ab initio) is an extension of Porechop that is able to infer the adapter sequence from the Oxford Nanopore reads. It discovers the adapter sequence from the reads using approximate k-mers and assembly, and add the sequence found to the adapter list (adapters.py file). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pycoQC | pycoQC computes metrics and generates Interactive QC plots from the sequencing summary report generated by Oxford Nanopore technologies basecaller (Albacore/Guppy) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Rabaler | Rebaler is a program for conducting reference-based assemblies using long reads. It relies mainly on minimap2 for alignment and Racon for making consensus sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Racon | Consensus module for raw de novo DNA assembly of long uncorrected reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAFT | RAFT (Repeat Aware Fragmentation Tool) is an algorithm designed to improve assembly quality by rescuing contained reads. RAFT breaks long reads into smaller sub-reads by following an algorithm described in our preprint. The read fragmentation allows an OLC assembler to retain contained reads during string graph construction. When input reads have non-uniform lengths, retaining contained reads improves assembly contiguity and base-level accuracy. The inputs to RAFT include an error-corrected read file in FASTA/FASTQ format and an all-vs-all alignment file in PAF format. It performs read fragmentation and outputs the fragmented reads in FASTA format. We recommend users to use hifiasm for the initial steps (read error correction, all-vs-all overlap computation) and also for the final step (assembly of fragmented reads). The assembly output format of hifiasm is described here. The RAFT-hifiasm workflow is recommended for long accurate reads with non-uniform length distribution (e.g., ONT Duplex, or a mixture of ONT Duplex and HiFi reads). ONT UL reads can optionally be integrated during the final assembly step. |
Genobioinfo Cluster: How to use |
Ratatosk | Ratatosk is a phased error correction tool for erroneous long reads based on compacted and colored de Bruijn graphs built from accurate short reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
rust-mdbg | rust-mdbg is an ultra-fast minimizer-space de Bruijn graph (mdBG) implementation, geared towards the assembly of long and accurate reads such as PacBio HiFi. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Shasta | The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SLR-superscaffolder | This is a scaffold assembler designed for stLFR reads. It uses the link-reads information from stLFR reads to assemble contigs to scaffolds. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
spliced_bam2gff | A tool to convert spliced BAM alignments into GFF2 format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SQANTI3 | SQANTI3 is the newest version of the SQANTI tool (publication) that merges features from SQANTI, (code repository) and SQANTI2 (code repository), together with new additions. SQANTI3 will continue as an integrated development aiming to providing you the best characterization possible for your new long read-defined transcriptome. SQANTI3 is the first module of the Functional IsoTranscriptomics (FIT) framework, that also includes IsoAnnot and tappAS. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SquiggleKit | A toolkit for manipulating nanopore signal data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SSPACE-LongRead | SSPACE-LongRead is a stand-alone program for scaffolding pre-assembled contigs using long reads (e.g. PacBio RS reads). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sturgeon | Sturgeon is a CNS neural network classifier for tumour classification |
Genobioinfo Cluster: How to use |
SVIM | SVIM is a structural variant caller for long reads. It is able to detect, classify and genotype five different classes of structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SVJedi | SVJedi is a structural variation (SV) genotyper for long read data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi-graph | SVJedi-graph is a structural variation (SV) genotyper for long read data. |
Genobioinfo Cluster: How to use |
Tapestry | Tapestry is a tool to validate and edit small eukaryotic genome assemblies using long sequence reads. It is designed to help identify complete chromosomes, symbionts, haplotypes, complex features and errors in close-to-complete genome assemblies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TGSGapFiller | A gap filling tool that uses error-prone long reads generated by third-generation-sequence techniques (Pacbio, Oxford Nanopore, etc.) or preassembled contigs to fill N-gap in the genome assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tombo | Tombo is a suite of tools primarily for the identification of modified nucleotides from raw nanopore sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Trycycler | Trycycler is a tool for generating consensus long-read assemblies for bacterial genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
uLTRA | uLTRA is a tool for splice alignment of long transcriptomic reads to a genome, guided by a database of exon annotations. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VeChat | Correcting errors in noisy long reads using variation graphs. |
Genobioinfo Cluster: How to use |
Velvet | Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
WhatsHap | WhatsHap is a software for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly. It is especially suitable for long reads, but works also well with short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
yacrd | Yet Another Chimeric Read Detector for long reads |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Yak | Yak is initially developed for two specific use cases: 1) to robustly estimate the base accuracy of CCS reads and assembly contigs, and 2) to investigate the systematic error rate of CCS reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
Beagle-lib | BEAGLE-lib is a high-performance library that can perform the core calculations at the heart of most Bayesian and Maximum Likelihood phylogenetics packages |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
eLSA | Extended Local Similarity Analysis -- Finding Time-Dependent Associations in Time Series Datasets | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GROMACS | GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles. It is primarily designed for biochemical molecules like proteins, lipids and nucleic acids that have a lot of complicated bonded interactions, but since GROMACS is extremely fast at calculating the nonbonded interactions (that usually dominate simulations) many groups are also using it for research on non-biological systems, e.g. polymers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
JAGS | JAGS is Just Another Gibbs Sampler. It is a program for analysis of Bayesian hierarchical models using Markov Chain Monte Carlo (MCMC) simulation not wholly unlike BUGS. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MCL | The MCL algorithm is short for the Markov Cluster Algorithm, a fast and scalable unsupervised cluster algorithm for graphs (also known as networks) based on simulation of (stochastic) flow in graphs. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ReLERNN | Recombination Landscape Estimation using Recurrent Neural Networks |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
snape-pooled | SNAPE-pooled computes the probability distribution for the frequency of the minor allele in a certain population, at a certain position in the genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sNMF | A fast and efficient program for estimating individual admixture coefficients based on sparse non-negative matrix factorization and population genetics. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
toulbar2 | toulbar2 is an open-source black-box C++ optimizer for cost function networks and discrete additive graphical models. It can read a variety of formats. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
HUMAnN | HUMAnN is a method for efficiently and accurately profiling the abundance of microbial metabolic pathways and other molecular functions from metagenomic or metatranscriptomic sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
ARAGORN | ARAGORN is a program to detect tRNA genes and tmRNA genes in nucleotide sequence | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Barrnap | Barrnap predicts the location of ribosomal RNA genes in genomes. It supports bacteria (5S,23S,16S), archaea (5S,5.8S,23S,16S), mitochondria (12S,16S) and eukaryotes (5S,5.8S,28S,18S). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BlockClust | BlockClust is an efficient approach to detect transcripts with similar processing patterns. We propose a novel way to encode expression profiles in compact discrete structures, which can then be processed using fast graph-kernel techniques. BlockClust allows both clustering and classification of small non-coding RNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
circtools | A modular, python-based framework for circRNA-related tools that unifies several functionalities in a single, command line driven software. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CIRI-long | Circular RNA Identification for Long-Reads Nanopore Sequencing Data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CleaveLand4 | Analysis of degradome data to find sliced miRNA and siRNA targets | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FEELnc | FlExible Extraction of LncRNA. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Infernal | Infernal ("INFERence of RNA ALignment") is for searching DNA sequence databases for RNA structure and sequence similarities. It is an implementation of a special case of profile stochastic context-free grammars called covariance models (CMs). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
miRDeep2 | miRDeep2 is a software package for identification of novel and known miRNAs in deep sequencing data. Furthermore, it can be used for miRNA expression profiling across samples. Last, a new module for preprocessing of raw Illumina sequencing data produces files for downstream analysis with the miRDeep2 or quantifier module. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MiRfold | MiRfold searches for a good miRNA-like folding in the sequence surrounding a putative miRNA. It was optimized on plant miRNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PHASE | PHASE is a package that performs molecular phylogenetic inference. The software seeks to accurately compare molecular sequences to determine the likely evolutionary relationships between a group of species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PhaseTank | To systemically characterize phasiRNAs/tasiRNAs and their regulatory cascades 'miRNA/phasiRNA -> | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
phyloFlash | phyloFlash is a pipeline to rapidly reconstruct the SSU rRNAs and explore phylogenetic composition of an illumina (meta)genomic dataset. |
Genobioinfo Cluster: How to use |
PingPongPro | Find ping-pong signatures in piRNA-Seq data like a pro. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Prost | Prost! (PRocessing Of Short Transcripts) can analyze smallRNA sequencing data generated on any sequencing platform. Prost! does not rely on existing annotation to filter sequencing reads but instead starts by aligning all the reads on a user-provided genomic reference, allowing the study of miRNAs in any species. Additionally, any number of samples can be studied together in a single Prost! run, allowing an accurate analysis of an entire dataset. After grouping the processed reads by genomic location, Prost! then annotates them using a user-defined annotation database (public or personal annotation database). |
Genobioinfo Cluster: How to use |
RNAclust | RNAclust is a perl script summarizing all the single steps required for clustering of structured RNA motifs, i.e. identifying groups of RNA sequences sharing a secondary structure motif. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RNAmmer | Rnammer predicts 5s/8s, 16s/18s, and 23s/28s ribosomal RNA in tttfull genome sequences. The program uses hidden Markov models trained on data from the 5S ribosomal RNA database and the European ribosomal RNA database project. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RNAscClust | RNAscClust is a pipeline to cluster a set of structured RNAs taking their respective structural conservation into account. The aim of RNAscClust is to aid the discovery of families and classes of ncRNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ShortStack | ShortStack is a tool developed to process and analyze smallRNA-seq data with respect to a reference genome, and output a comprehensive and informative annotation of all discovered small RNA genes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
snoGPS | Search for H/ACA snoRNA genes in a genomic sequence. |
Genobioinfo Cluster: How to use |
Snoscan | Search for C/D box methylation guide snoRNA genes in a genomic sequence. |
Genobioinfo Cluster: How to use |
SortMeRNA | SortMeRNA is a software designed to rapidly filter ribosomal RNA fragments from metatransriptomic data produced by next-generation sequencers. It is capable of handling large RNA databases and sorting out all fragments matching to the database with high accuracy and specificity | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
srnaMapper | This tool maps reads produced by sRNA-Seq to a genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
tRNAscan-SE | Search for tRNA genes in genomic sequence. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
Juicebox | Software for visualizing data from Hi-C and other proximity mapping experiments | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Juicer | A One-Click System for Analyzing Loop-Resolution Hi-C Experiments |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LocARNA | LocARNA is a tool for multiple alignment of RNA molecules. LocARNA requires only RNA sequences as input and will simultaneously fold and align the input sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MiRfold | MiRfold searches for a good miRNA-like folding in the sequence surrounding a putative miRNA. It was optimized on plant miRNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
non-B_gfa | gfa programs for Non-B site at NCI/FNLCR. gfa is a Suite of programs developed at NCI-Frederick/Frederick National Lab to find sequences associated with non-B DNA forming motifs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PETfold | PETfold performs Probabilistic Evolutionary and Thermodynamic folding of a multiple alignment of RNA sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
R-scape | R-scape looks for evidence of a conserved RNA structure by measuring pairwise covariations observed in an input multiple sequence alignment. It analyzes all possible pairs, including those in your proposed structure (if you provide one). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
randfold | The software compute the probability that, for a given RNA sequence, the Minimum Free Energy (MFE) of the secondary structure is different from a distribution of MFE computed with random sequences.. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Rfold | Rfold computes local base pairing probabilities for long DNA sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RNAclust | RNAclust is a perl script summarizing all the single steps required for clustering of structured RNA motifs, i.e. identifying groups of RNA sequences sharing a secondary structure motif. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RNAz | RNAz detects stable and conserved RNA secondary structures in multiple sequence alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
ASHURE | Python-based pipeline for analyzing Nanopore sequencing metabarcoding data. ASHURE can take a reference database in order to improve accuracy. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CARNAC-LR | Clustering coefficient-based Acquisition of RNA Communities in Long Reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
chopper | This tool, intended for long read sequencing such as PacBio or ONT, filters and trims a fastq file. chopper is a tool that reunites the now outdated softwares NanoFilt and NanoLyse. It permits to filter QC files and has a faster execution time than NanoFilt and NanoLyse.
|
Genobioinfo Cluster: How to use |
CONSENT | CONSENT (sCalable self-cOrrectioN of long reads with multiple SEquence alignmeNT) is a self-correction method for long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DAmar | Long read QC, assembly and scaffolding pipeline for PacBio or Oxford Nanopore long-read sequencing data. T he pipeline produces a number of QC metrics at various stages as well as incorporating further technologies including Bionano, 10x and HiC data to scaffold the created contigs. DAmar, is a hybrid of the earlier Marvel, Dazzler, and Daccord systems of the Eugene Myers lab. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Deepbinner | Deepbinner is a tool for demultiplexing barcoded Oxford Nanopore sequencing reads. It does this with a deep convolutional neural network classifier, using many of the architectural advances that have proven successful in image classification. Unlike other demultiplexers (e.g. Albacore and Porechop), Deepbinner identifies barcodes from the raw signal (a.k.a. squiggle) which gives it greater sensitivity and fewer unclassified reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeepSignal | Detecting methylation using signal-level features from Nanopore sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Dorado | Dorado is a high-performance, easy-to-use, open source basecaller for Oxford Nanopore reads. |
Genobioinfo Cluster: How to use |
Flye | Flye is a de novo assembler for long and noisy reads, such as those produced by PacBio and Oxford Nanopore Technologies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GCI | Genome Continuity Inspector (GCI) is an assembly assessment tool for high-quality genomes (e.g. T2T genomes), in base resolution. | Genobioinfo Cluster: How to use |
LR_Gapcloser | LR_Gapcloser is a gap closing tool using uncorrected or corrected long reads generated from Pacbio platform or Nanopore platform. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mCaller | This program is designed to call m6A from nanopore data using the differences between measured and expected currents. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
modbam2bed | A program to aggregate modified base counts stored in a modified-base BAM file to a bedMethyl file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Modkit | A bioinformatics tool for working with modified bases from Oxford Nanopore. Specifically for converting modBAM to bedMethyl files using best practices, but also manipulating modBAM files and generating summary statistics. |
Genobioinfo Cluster: How to use |
Nano-Q | Python script for conservatively cleaning ONT reads from bam files and estimate variant frequencies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoCLUST | NanoCLUST is an analysis pipeline for UMAP-based classification of amplicon-based full-length 16S rRNA nanopore reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NanoCount | NanoCount estimates transcripts abundance from Oxford Nanopore direct-RNA sequencing datasets, using an expectation-maximization approach like RSEM, Kallisto, salmon, etc to handle the uncertainty of multi-mapping reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoSimm | NanoSim is a fast and scalable read simulator that captures the technology-specific features of ONT data, and allows for adjustments upon improvement of nanopore sequencing technology. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NECAT | NECAT is an error correction and de-novo assembly tool for Nanopore long noisy reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ont_fast5_api | ont_fast5_api is a simple interface to HDF5 files of the Oxford Nanopore fast5 file format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Porechop_ABI | Porechop_abi (ab initio) is an extension of Porechop that is able to infer the adapter sequence from the Oxford Nanopore reads. It discovers the adapter sequence from the reads using approximate k-mers and assembly, and add the sequence found to the adapter list (adapters.py file). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ratatosk | Ratatosk is a phased error correction tool for erroneous long reads based on compacted and colored de Bruijn graphs built from accurate short reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Shasta | The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Sniffles | A fast structural variant caller for long-read sequencing, Sniffles2 accurately detect SVs on germline, somatic and population-level for PacBio and Oxford Nanopore read data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
spliced_bam2gff | A tool to convert spliced BAM alignments into GFF2 format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SquiggleKit | A toolkit for manipulating nanopore signal data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tombo | Tombo is a suite of tools primarily for the identification of modified nucleotides from raw nanopore sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Variabel | A novel approach and method for intrahost variant detection, which outperforms existing ONT variant callers. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
Bandage-NG | Bandage-NG is a GUI program that allows users to interact with the assembly graphs made by de novo assemblers such as SPAdes, MEGAHIT and others. |
Genobioinfo Cluster: How to use |
Bifrost | Highly parallel construction and indexing of colored and compacted de Bruijn graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Cactus | Cactus is a reference-free whole-genome alignment program, as well as a pagenome graph construction toolkit. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastix | A simple command line tool to add prefixes to FASTA headers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GFAffix | Genobioinfo Cluster: How to use | |
GfaViz | Graphical interactive tool for the visualization of sequence graphs in GFA format. |
Genobioinfo Cluster: How to use |
ggCaller | A de Bruijn graph-based gene-caller and pangenome analysis tool. | Genobioinfo Cluster: How to use |
hal2vg | Convert HAL to vg-compatible sequence graph. |
Genobioinfo Cluster: How to use |
Jasmine | JASMINE: Jointly Accurate Sv Merging with Intersample Network Edges. This tool is used to merge structural variants (SVs) across samples. Each sample has a number of SV calls, consisting of position information (chromosome, start, end, length), type and strand information, and a number of other values. Jasmine represents the set of all SVs across samples as a network, and uses a modified minimum spanning forest algorithm to determine the best way of merging the variants such that each merged variants represents a set of analogous variants occurring in different samples. Manual : Jasmine User Manual · mkirsche/Jasmine Wiki · GitHub Jasmine also includes a module for automating the creation of IGV screenshots of variants of interest. |
Genobioinfo Cluster: How to use |
kSNP4 | kSNP4 identifies the pan-genome SNPs in a set of genome sequences, and estimates phylogenetic trees based upon those SNPs. |
Genobioinfo Cluster: How to use |
odgi | odgi provides an efficient and succinct dynamic DNA sequence graph model, as well as a host of algorithms that allow the use of such graphs in bioinformatic analyses. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
panacus | Genobioinfo Cluster: How to use | |
PanGenie | A short-read genotyper for various types of genetic variants (such as SNPs, indels and structural variants) represented in a pangenome graph. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PanTools | PanTools is a toolkit for comparative analysis of large number of genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pggb | Pangenome graph builder. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PopIns2 | Population-scale detection of non-reference sequence variants using colored de Bruijn Graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
proteinortho_curves | Draw pan- and core-genome curves from proteinortho output |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
BlockClust | BlockClust is an efficient approach to detect transcripts with similar processing patterns. We propose a novel way to encode expression profiles in compact discrete structures, which can then be processed using fast graph-kernel techniques. BlockClust allows both clustering and classification of small non-coding RNAs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CMfinder | CMfinder is a RNA motif prediction tool. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DinuQ | The DinuQ (Dinucleotide Quantification) Python3 package provides a range of metrics for quantifying nucleotide, dinucleotide and synonymous codon representation in genetic sequences. | Genologin Cluster: Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EMBOSS | EMBOSS is "The European Molecular Biology Open Software Suite". EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastaGrep | FastaGrep is a tool for searching oligonucleotide binding sites from FastA genomic sequences. It can do both match/mismatch based and thermodynamic binding energy searches. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Gerbil | A basic task in bioinformatics is the counting of k-mers in genome strings. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Homer | HOMER (Hypergeometric Optimization of Motif EnRichment) is a suite of tools for Motif Discovery and next-gen sequencing analysis. It is a collection of command line programs for unix-style operating systems written in Perl and C++. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kmap | Standalone tool based on the bwa index to locate a set of kmers along a reference genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MEME | The MEME Suite allows you to: (1) discover motifs using MEME or GLAM2 on groups of related DNA or protein sequences, (2) search sequence databases using motifs, (3) compare a motif to all motifs in a database of motifs, and (3) associate motifs with Gene Ontology terms via their putative target genes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pftools3 | The pftools package contains all the software necessary to build protein and DNA generalized profiles and use them to scan and align sequences, and search databases | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
QmRLFS-finder | QmRLFS-finder, the first R-loop finding tool which uses (unsupervised) QmRLFS (Quantitative Models of RLFS) models to predict RLFSs. This command line tool generates locations and detailed information of RLFSs as well as standards-compliant output files for further analysis and visualization. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RNAclust | RNAclust is a perl script summarizing all the single steps required for clustering of structured RNA motifs, i.e. identifying groups of RNA sequences sharing a secondary structure motif. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Scan For Matches | scan_for_matches is a utility written in C for locating patterns in DNA or protein FASTA files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VAST-TOOLS | Vertebrate Alternative Splicing and Transcription Tools (VAST-TOOLS) is a toolset for profiling and comparing alternative splicing events in RNA-Seq data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
AAF | This is a package for constructing phylogeny without doing alignment or assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
adegenet | R package dedicated to the exploratory analysis of genetic data. It implements a set of tools ranging from multivariate methods to spatial genetics and genome-wise SNP data analysis | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ALFATClust | ALignment-Free Adaptive Threshold Clustering:Biological sequence clustering tool with dynamic threshold for individual clusters. Suitable for clustering multiple groups of homologous sequences. | Genobioinfo Cluster: How to use |
AmpliSAT | AmpliSAT (Amplicon Sequencing Analysis Tools) are a set of online tools that make easy the analysis of Amplicon Sequencing experiments. |
Genobioinfo Cluster: How to use |
Anvio | Anvi’o is an analysis and visualization platform for ‘omics data. It brings together many aspects of today’s cutting-edge genomic, metagenomic, and metatranscriptomic analysis practices to address a wide array of needs. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Apscale | Advanced Pipeline for Simple yet Comprehensive AnaLysEs of DNA metabarcoding data |
Genobioinfo Cluster: How to use |
ASHURE | Python-based pipeline for analyzing Nanopore sequencing metabarcoding data. ASHURE can take a reference database in order to improve accuracy. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ASTER | A family of ASTRAL-like algorithms. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ASTRAL | ASTRAL is a tool for estimating an unrooted species tree given a set of unrooted gene trees. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ASTRAL-Pro | ASTRAL-Pro stands for ASTRAL for PaRalogs and Orthologs. ASTRAL is a tool for estimating an unrooted species tree given a set of unrooted gene trees. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Bakta | Rapid & standardized annotation of bacterial genomes, MAGs & plasmids. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BAli-Phy | BAli-Phy is software by Ben Redelings that estimates multiple sequence alignments and evolutionary trees from DNA, amino acid, or codon sequences. It uses likelihood-based evolutionary models of substitutions and insertions and deletions to place gaps. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BayesTraits | BayesTraits is a computer package for performing analyses of trait evolution among groups of species for which a phylogeny or sample of phylogenies is available. This new package incoporates our earlier and separate programes Multistate, Discrete and Continuous. BayesTraits can be applied to the analysis of traits that adopt a finite number of discrete states, or to the analysis of continuously varying traits. Hypotheses can be tested about models of evolution, about ancestral states and about correlations among pairs of traits. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Beagle-lib | BEAGLE-lib is a high-performance library that can perform the core calculations at the heart of most Bayesian and Maximum Likelihood phylogenetics packages |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BEAST | BEAST is a software package for phylogenetic analysis with an emphasis on time-scaled trees. BEAST is a cross-platform program for Bayesian analysis of molecular sequences using MCMC. It is entirely orientated towards rooted, time-measured phylogenies inferred using strict or relaxed molecular clock models. It can be used as a method of reconstructing phylogenies but is also a framework for testing evolutionary hypotheses without conditioning on a single tree topology. BEAST uses MCMC to average over tree space, so that each tree is weighted proportional to its posterior probability. We include a simple to use user-interface program for setting up standard analyses and a suit of programs for analysing the results. |
Genobioinfo Cluster: How to use |
BEAST2 | BEAST 2 is a cross-platform program for Bayesian phylogenetic analysis of molecular sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BIG-SCAPE | Biosynthetic Genes Similarity Clustering and Prospecting Engine. Defines a distance metric between Gene Clusters using a combination of three indices (Jaccard Index of domain types, Domain Sequence Similarity the Adjacency Index) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BlobTools | A modular command-line solution for visualisation, quality control and taxonomic partitioning of genome datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BOLDigger | Python program to query .fasta files against the different databases of www.boldsystems.org |
SLURM Cluster: How to use Genobioinfo Cluster: How to use |
Bracken | Bracken (Bayesian Reestimation of Abundance with KrakEN) is a highly accurate statistical method that computes the abundance of species in DNA sequences from a metagenomics sample. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BUCKy | BUCKy is a free program to combine molecular data from multiple loci. BUCKy estimates the dominant history of sampled individuals, and how much of the genome supports each relationship, using Bayesian concordance analysis. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CAFE | Software for Computational Analysis of gene Family Evolution. The purpose of CAFE is to analyze changes in gene family size in a way that accounts for phylogenetic history and provides a statistical foundation for evolutionary inferences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CAMI-AMBER | AMBER is an evaluation package for the comparative assessment of genome reconstructions and taxonomic assignments from metagenome benchmark datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CAMISIM | CAMISIM is a software to model abundance distributions of microbial communities and to simulate corresponding shotgun metagenome datasets. |
Genobioinfo Cluster: How to use |
CCMetagen | CCMetagen processes sequence alignments produced with KMA, which implements the ConClave sorting scheme to achieve highly accurate read mappings. CCMetagen processes sequence alignments produced with KMA, which implements the ConClave sorting scheme to achieve highly accurate read mappings. CCMetagen produces ranked taxonomic results in user-friendly formats that are ready for publication or downstream statistical analyses. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Centrifuge | Classifier for metagenomic sequences. Centrifuge is a novel microbial classification engine that enables rapid, accurate and sensitive labeling of reads and quantification of species on desktop computers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CheckM | Assess the quality of microbial genomes recovered from isolates, single cells, and metagenomes. |
Genobioinfo Cluster: How to use |
CheckM2 | Assessing the quality of metagenome-derived genome bins using machine learning. |
Genobioinfo Cluster: How to use |
ClonalFrameML | A software package that performs efficient inference of recombination in bacterial genomes. |
Genobioinfo Cluster: How to use |
COMEBin | COMEBin allows effective binning of metagenomic contigs using COntrastive Multi-viEw representation learning. |
Genobioinfo Cluster: How to use |
CONCOCT | A program for unsupervised binning of metagenomic contigs by using nucleotide composition, coverage data in multiple samples and linkage data from paired end reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CoverM | Read coverage calculator for metagenomics. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
CRABS | CRABS (Creating Reference databases for Amplicon-Based Sequencing) is a versatile software program that generates curated reference databases for metagenomic analysis. |
Genobioinfo Cluster: How to use |
d2SBin | Improving the binning of metagenomic contigs on d2S oligonucleotide frequency dissimilarity | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DAS_Tool | An automated method that integrates the results of a flexible number of binning algorithms to calculate an optimized, non-redundant set of bins from a single assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DATES | DATES (Distribution of Ancestry Tracts of Evolutionary Signals) is a method to estimate the time of admixture in ancient DNA samples described in Narasimhan, Patterson et al. 2018 | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
dbcAmplicons | Analysis of Double Barcoded Illumina Amplicon Data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
decOM | decOM is a high-accuracy microbial source tracking method that is suitable for contamination quantification in paleogenomics, namely the analysis of collections of possibly contaminated ancient oral metagenomic data sets. |
Genobioinfo Cluster: How to use |
DECX | This is the DECX (DEC eXtended) model for historical biogeographic inference | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DESMAN | De novo Extraction of Strains from MetAgeNomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DLCpar | DLCpar is a reconciliation method for inferring gene duplications, losses, and coalescence (accounting for incomplete lineage sorting). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DRAM | DRAM (Distilled and Refined Annotation of Metabolism) is a tool for annotating metagenomic assembled genomes and VirSorter identified viral contigs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EggLib | EggLib is a C++/Python library and program package for evolutionary genetics and genomics. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
eMPRess | eMPRess is a software tool for reconciling pairs of phylogenetic trees such as host-parasite, host-symbiont, and species-gene trees under the Duplication-Transfer-Loss (DTL) model. | Genologin Cluster: How to use Genobioinfo Cluster: Genobioinfo Cluster: How to use |
Emu | Emu is a relative abundance estimator for 16S genomic sequences. The method is optimized for error-prone full-length reads, but can also be utilized for short-read data. |
Genobioinfo Cluster: How to use |
EPIK | EPIK is a program for rapid alignment-free phylogenetic placement. |
Genobioinfo Cluster: How to use |
ETE | A Python framework for the analysis and visualization of trees. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EukCC | EukCC is a completeness and contamination estimator for metagenomic assembled microbial eukaryotic genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EukRep | Classification of Eukaryotic and Prokaryotic sequences from metagenomic datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Exabayes | ExaBayes is a software package for Bayesian tree inference. It is particularly suitable for large-scale analyses on computer clusters. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ExaML | Exascale Maximum Likelihood (ExaML) code for phylogenetic inference using MPI. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastME | FastME provides distance algorithms to infer phylogenies. FastME is based on balanced minimum evolution, which is the very principle of NJ. FastME improves over NJ by performing topological moves using fast, sophisticated algorithms. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastTree | FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FigTree | FigTree is designed as a graphical viewer of phylogenetic trees and as a program for producing publication-ready figures. |
Genobioinfo Cluster: How to use |
FROGS | FROGS is a CLI workflow designed to produce an OTU count matrix from high depth sequencing amplicon data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
G-PhoCS | G-PhoCS is a software package for inferring ancestral population sizes, population divergence times, and migration rates from individual genome sequences. | Genobioinfo Cluster: How to use |
ganon | ganon classifies DNA sequences against large sets of genomic reference sequences efficiently. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gappa | A toolkit for analyzing and visualizing phylogenetic (placement) data. | Genobioinfo Cluster: How to use |
GARLI | GARLI, Genetic Algorithm for Rapid Likelihood Inference is a program for inferring phylogenetic trees. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Gblocks | Gblocks is a computer program written in ANSI C language that eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences. These positions may not be homologous or may have been saturated by multiple substitutions and it is convenient to eliminate them prior to phylogenetic analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gotree | Gotree is a set of command line tools and an API to manipulate phylogenetic trees. |
Genobioinfo Cluster: How to use |
GrapeTree | GrapeTree is a fully interactive, tree visualization program within EnteroBase, which supports facile manipulations of both tree layout and metadata. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraPhlAn | GraPhlAn is a software tool for producing high-quality circular representations of taxonomic and phylogenetic trees. It focuses on concise, integrative, informative, and publication-ready representations of phylogenetically- and taxonomically-driven investigation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GTDB-Tk | GTDB-Tk is a software toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes based on the Genome Database Taxonomy GTDB. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gubbins | Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences. Gubbins (Genealogies Unbiased By recomBinations In Nucleotide Sequences) is an algorithm that iteratively identifies loci containing elevated densities of base substitutions while concurrently constructing a phylogeny based on the putative point mutations outside of these regions. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hapflk | hapflk is a software implementing the hapFLK <1> and FLK <2> tests for the detection of selection signatures based on multiple population genotyping data. | Genologin Cluster: In Python-2.7.15 (module load system/Python-2.7.15) Genobioinfo Cluster: How to use |
HAPHPIPE | NGS viral assembly and population genetics. |
Genobioinfo Cluster: How to use |
hifiasm-meta | De novo metagenome assembler, based on hifiasm, a haplotype-resolved de novo assembler for PacBio Hifi reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HyPhy | HyPhy is an open-source software package for the analysis of genetic sequences using techniques in phylogenetics, molecular evolution, and machine learning. It features a complete graphical user interface (GUI) and a rich scripting language for limitless customization of analyses. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
IPK | IPK is a tool for computing phylo-k-mers for a fixed phylogeny. |
Genobioinfo Cluster: How to use |
IQ-TREE | Efficient phylogenomic software by maximum likelihood |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ITSx | Improved software detection and extraction of ITS1 and ITS2 from ribosomal ITS sequences of fungi and other eukaryotes for use in environmental sequencing | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iVar | Var is a computational package that contains functions broadly useful for viral amplicon-based sequencing. Additional tools for metagenomic sequencing are actively being incorporated into iVar. While each of these functions can be accomplished using existing tools, iVar contains an intersection of functionality from multiple tools that are required to call iSNVs and consensus sequences from viral sequencing data across multiple replicates. We implemented the following functions in iVar: (1) trimming of primers and low-quality bases, (2) consensus calling, (3) variant calling - both iSNVs and insertions/deletions, and (4) identifying mismatches to primer sequences and excluding the corresponding reads from alignment files. |
Genobioinfo Cluster: How to use |
jModeltest | jModelTest is a tool to carry out statistical selection of best-fit models of nucleotide substitution. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
JustOrthologs | A Fast, Accurate, and User-Friendly Ortholog-Finding Algorithm |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Kaiju | Fast taxonomic classification of metagenomic sequencing reads using a protein reference database |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Kraken | Kraken is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Kraken2 | Kraken2 is a system for assigning taxonomic labels to short DNA sequences, usually obtained through metagenomic studies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KrakenTools | KrakenTools provides individual scripts to analyze Kraken/Kraken2/Bracken/KrakenUniq output files. |
Genobioinfo Cluster: How to use |
KrakenUniq | KrakenUniq (formerly KrakenHLL) is a novel metagenomics classifier that combines the fast k-mer-based classification of Kraken with an efficient algorithm for assessing the coverage of unique k-mers found in each species in a dataset. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kSNP4 | kSNP4 identifies the pan-genome SNPs in a set of genome sequences, and estimates phylogenetic trees based upon those SNPs. |
Genobioinfo Cluster: How to use |
LEfSe | LEfSe (Linear discriminant analysis effect size) is a tool developed by the Huttenhower group to find biomarkers between 2 or more groups using relative abundances. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LSx | LSX is a script in R that runs the LS³ and LS⁴ algorithms of data subsampling for multigene phylogenetic inference. Both of these algorithms do a gene-by-gene inspection of the heterogeneity of evolutionary rates among user-defined lineages of interest (LOI). Then, using criteria that differ in both algorithms (see details here or in the papers), they try to find a subsample of sequences that evolve at a homogeneous rate across all LOIs. If this subset is found, an alignment of the gene is produced with only the sequences that evolve homogeneously. At the same time, a table is also produced showing which sequences were “flagged” (the sequences that were removed), and which sequences were kept. If a subset of sequences that evolve at a homogeneous rate is not found, the gene is flagged entirely. |
Genobioinfo Cluster: How to use |
MALT | MALT (MEGAN alignment tool) is an extension of MEGAN (metagenome analyzer). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MARVEL_bins | MARVEL (Metagenomic Analysis and Retrieval of Viral Elements) is a tool for recovery of draft phage genomes from whole community shotgun metagenomic sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mashtree | Create a tree using Mash distances. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MaxBin2 | MaxBin is a software for binning assembled metagenomic sequences based on an Expectation-Maximization algorithm. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MEGAHIT | An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MEGAN | MEtaGenome ANalyzer : Metagenomic data analysis : taxonomic and functionnal (SEED and KEGG classification) analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaBat | An adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
metaBIT | An integrative and automated metagenomic pipeline for analysing microbial profiles from high-throughput sequencing shotgun data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
metaDMG | A fast and accurate ancient DNA damage toolkit for metagenomic data. |
Genobioinfo Cluster: How to use |
MetaEuk | MetaEuk is a modular toolkit designed for large-scale gene discovery and annotation in eukaryotic metagenomic contigs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
METAL | The METAL software is designed to facilitate meta-analysis of large datasets (such as several whole genome scans) in a convenient, rapid and memory efficient manner. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaMDBG | A lightweight assembler for long and accurate metagenomics reads. |
Genobioinfo Cluster: How to use |
MetaPhlAn | MetaPhlAn is a computational tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaPhlAn2 | MetaPhlAn2 is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea, Eukaryotes and Viruses) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MetaPhlAn3 | MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea and Eukaryotes) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With the newly added StrainPhlAn module, it is now possible to perform accurate strain-level microbial profiling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaPhlAn4 | MetaPhlAn is a computational tool for profiling the composition of microbial communities (Bacteria, Archaea and Eukaryotes) from metagenomic shotgun sequencing data (i.e. not 16S) with species-level. With StrainPhlAn, it is possible to perform accurate strain-level microbial profiling. |
Genobioinfo Cluster: How to use |
MetaWRAP | A flexible pipeline for genome-resolved metagenomic data analysis. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Metaxa2 | Improved Identification and Taxonomic Classification of Small and Large Subunit rRNA in Metagenomic Data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
meteor | Meteor (Metagenomic Explorator), a software for profiling metagenomic data at gene level. |
Genobioinfo Cluster: How to use |
Migrate | Migrate estimates effective population sizes,past migration rates between n population assuming a migration matrix model with asymmetric migration rates and different subpopulation sizes, and population divergences or admixture. | Genobioinfo Cluster: How to use |
ModelTest-NG | ModelTest-NG is a tool for selecting the best-fit model of evolution for DNA and protein alignments. ModelTest-NG supersedes jModelTest and ProtTest in one single tool, with graphical and command console interfaces. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mothur | The one-stop source for your computational microbial ecology needs. mothur offers the ability to go from raw sequences to the generation of visualization tools to describe alpha and beta diversity. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mPTP | A tool for single-locus species delimitation. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MrBayes | MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MVP | MVP stands for Multi-choice Viromics Pipeline. It is a simplified pipeline that utilizes a suite of state-of-art tools to easily get from a set of contigs to a vOTU heatmap (and more). |
Genobioinfo Cluster: How to use |
MyCC | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoCLUST | NanoCLUST is an analysis pipeline for UMAP-based classification of amplicon-based full-length 16S rRNA nanopore reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Newick_Utilities | The Newick Utilities are a suite of Unix shell tools for processing phylogenetic trees. We distribute the package under the BSD License. Functions include re-rooting, extracting subtrees, trimming, pruning, condensing, drawing (ASCII graphics or SVG). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NINJA | Nearly Infinite Neighbor Joining Application | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
orthAgogue | a tool for high speed estimation of homology relations within and between species in massive data sets. orthAgogue is easy to use and offers flexibility through a range of optional parameters. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
OrthoFinder | OrthoFinder is a fast, accurate and comprehensive analysis tool for comparative genomics. It finds orthologues and orthogroups infers rooted gene trees for all orthogroups and infers a rooted species tree for the species being analysed. OrthoFinder also provides comprehensive statistics for comparative genomic analyses. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PALEOMIX | The PALEOMIX pipeline is a set of free and open-source pipelines and tools designed to enable the rapid processing of Next Generation Sequencing (NGS) data, starting from de-multiplexed reads from one or more samples, through sequence processing and alignment, and ending with genotyping, phylogenetic inference on the samples, as well as metagenomic analysis of the samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PAML | PAML is a package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PanTools | PanTools is a toolkit for comparative analysis of large number of genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ParGenes | A massively parallel tool for model selection and tree inference on thousands of genes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PartitionFinder | PartitionFinder is free open source software to select best-fit partitioning schemes and models of molecular evolution for phylogenetic analyses. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PathoFact | PathoFact is an easy-to-use modular pipeline for the metagenomic analyses of toxins, virulence factors and antimicrobial resistance. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pathPhynder | A workflow for integrating ancient lineages into present-day phylogenies. |
Genobioinfo Cluster: How to use |
PAUP | Tools for inferring and interpreting phylogenetic trees |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pelican | Pelican is a reimplementation of the model of Tamuri et al. (2009) to identify sites undergoing different kinds of directional selection in different parts of a phylogenetic tree. |
Genobioinfo Cluster: How to use |
PHASE | PHASE is a package that performs molecular phylogenetic inference. The software seeks to accurately compare molecular sequences to determine the likely evolutionary relationships between a group of species. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PHAST | Phylogenetic Analysis with Space/Time models (PHAST) is a freely available software package consisting of a collection of command-line programs and supporting libraries for comparative and evolutionary genomics. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PhiPack | The Phi Test is a simple, rapid, and statistically efficient test for recombination. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PhyKIT | PhyKIT is a UNIX shell toolkit for processing and analyzing phylogenomic data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PHYLIP | PHYLIP (PHYLogeny Inference Package), is a package composed by 34 programs dedicated to phylogeny inference. Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites and fragments, distance matrices, and discrete characters.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PhyloBayes | PhyloBayes is a Bayesian Monte Carlo Markov Chain (MCMC) sampler for phylogenetic reconstruction and molecular dating using protein and nucleic acid alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Phylobayes_MPI | PhyloBayes (Lartillot et al, 2009) is a Bayesian Monte Carlo Markov Chain (MCMC) sampler for phylogenetic reconstruction. With MPI. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
phyloFlash | phyloFlash is a pipeline to rapidly reconstruct the SSU rRNAs and explore phylogenetic composition of an illumina (meta)genomic dataset. |
Genobioinfo Cluster: How to use |
PhyloNet | Genologin Cluster: How to use Genobioinfo Cluster: How to use | |
PhyloPhlAn | PhyloPhlAn is a computational pipeline for reconstructing highly accurate and resolved phylogenetic trees based on whole-genome sequence information. The pipeline is scalable to thousands of genomes and uses the most conserved 400 proteins for extracting the phylogenetic signal. PhyloPhlAn also implements taxonomic curation, estimation, and insertion operations. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
phyluce | phyluce (phy-loo-chee) is a software package that was initially developed for analyzing data collected from ultraconserved elements in organismal genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PhyML | PhyML is a phylogeny software based on the maximum-likelihood principle. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
phyx | phyx performs phylogenetics analyses on trees and sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PICRUSt | PICRUSt (pronounced モpie crustヤ) is a bioinformatics software package designed to predict metagenome functional content from marker gene (e.g., 16S rRNA) surveys and full genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Plasmer | An accurate and sensitive bacterial plasmid identification tool based on deep machine-learning of shared k-mers and genomic features. |
Genobioinfo Cluster: How to use |
Plass | Plass (Protein-Level ASSembler) is a software to assemble short read sequencing data on a protein level. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ProphET | ProphET, Prophage Estimation Tool: a standalone prophage sequence prediction tool with self-updating reference database. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Proteinortho | Proteinortho is a tool to detect orthologous genes within different species. |
Genobioinfo Cluster: How to use |
pyMLST | Python Mlst Local Search Tool. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
QIIME | QIIME (pronounced "chime") stands for Quantitative Insights Into ttMicrobial Ecology. QIIME is an open source software package for ttcomparison and analysis of microbial communities, primarily based on tthigh-throughput amplicon sequencing data (such as SSU rRNA) generated tton a variety of platforms, but also supporting analysis of other types ttof data (such as shotgun metagenomic data). QIIME takes users from tttheir raw sequencing output through initial analyses such as OTU ttpicking, taxonomic assignment, and construction of phylogenetic trees ttfrom representative sequences of OTUs, and through downstream ttstatistical analysis, visualization, and production of ttpublication-quality graphics. QIIME has been applied to single studies ttbased on billions of sequences from thousands of samples. ttttt |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
r8s | This package implements several methods to infer divergence times on a molecular phylogeny, using penalized likelihood, maximum likelihood and nonparametric rate smoothing methods. It also implements miscellaneous tree and character evolution models and tests. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RAiSD | RAiSD (Raised Accuracy in Sweep Detection) is a stand-alone software implementation of the μ statistic for selective sweep detection. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAxML | RAxML (Randomized Axelerated Maximum Likelihood) is a program for sequential and parallel Maximum Likelihood based inference of large phylogenetic trees. It can also be used for postanalyses of sets of phylogenetic trees, analyses of alignments and, evolutionary placement of short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RAxML-NG | RAxML-NG is a phylogenetic tree inference tool which uses maximum-likelihood (ML) optimality criterion. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ray | Assemble genomes in parallel using the message-passing interface | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RDP Classifier | The RDP Classifier is a naive Bayesian classifier that can rapidly and accurately provides taxonomic assignments from domain to genus, with confidence estimates for each assignment.
|
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RDPTools | Collection of commonly used RDP Tools for easy building | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
read2tree | read2tree is a software tool that allows to obtain alignment matrices for tree inference. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
REFMAKER | REFMAKER is a command-line and user-friendly pipeline providing different tools to create nuclear references from genomic assemblies of shotgun libraries. |
Genobioinfo Cluster: How to use |
REINDEER | Efficient indexing of k-mer presence and abundance in sequencing datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RevBayes | RevBayes provides an interactive environment for statistical computation in phylogenetics. It is primarily intended for modeling, simulation, and Bayesian inference in evolutionary biology, particularly phylogenetics. However, the environment is quite general and can be useful for many complex modeling tasks. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RFMIX | A discriminative method for local ancestry inference |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RogueNaRok | A versatile and scalable algorithm for rogue taxon identification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RootDigger | RootDigger is a program that will, when given a MSA and an unrooted tree with branch lengths place a root on the given tree. For the foreseeable future, RootDigger will only support DNA data, as the method RootDigger uses is ineffective when using AA data. |
Genobioinfo Cluster: How to use |
Scoary | Scoary is designed to take the gene_presence_absence.csv file from Roary as well as a traits file created by the user and calculate the assocations between all genes in the accessory genome and the traits. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Seq-Gen | Seq-Gen is a program that will simulate the evolution of nucleotide or amino acid sequences along a phylogeny, using common models of the substitution process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SGSGeneLoss | Gene presence/absence variation discovery. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Simka | Simka is a de novo comparative metagenomics tool. Simka represents each dataset as a k-mer spectrum and compute several classical ecological distances between them. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
singlem | SingleM is a tool to find the abundances of discrete operational taxonomic units (OTUs) directly from shotgun metagenome data, without heavy reliance on reference sequence databases. It is able to differentiate closely related species even if those species are from lineages new to science. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SLR | SLR is a program to detect sites in coding DNA that are unusually conserved and/or unusually variable (that is, evolving under purify or positive selection) by analysing the pattern of changes for an alignment of sequences on an evolutionary tree. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SNPhylo | a pipeline to generate a phylogenetic tree from huge SNP data | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SortaDate | Scripts that you can use at different stages to attempt to find more clock-like genes. Generally, you would use these for dating analyses with another package | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SPECTRE | A collection of Phylogenetics tools for creating and manipulating networks and trees. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SqueezeMeta | A fully automated metagenomics pipeline, from reads to bins. |
Genobioinfo Cluster: How to use |
SSU-ALIGN | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install | |
Sumatra | Sumatra was developed by the LECA and aims to compute a great deal of sequence similarities in a fast and exact way, based on the length of the Longest Common Subsequence (LCS) between two sequences. Sequence clustering based on similarities is also available through Sumaclust. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SUPER-FOCUS | A tool for agile functional analysis of metagenomic data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SuperCRUNCH | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install | |
swarm | A robust and fast clustering method for amplicon-based studies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SweeD | A parallel and checkpointable tool that implements a composite likelihood ratio test for detecting selective sweeps. SweeD is based on the SweepFinder algorithm (Nielsen et al. 2005). SweeD can calculate the theoretical SFS of a given demographic model (stepwise changes or with an exponential growth phase + stepwise changes) by using the method by Živković and Stephan (2011). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TACT | Adds tips to a backbone phylogeny using taxonomy simulated with birth-death models | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Taxoniq | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install | |
TaxonKit | A Practical and Efficient NCBI Taxonomy Toolkit |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TKGWV2 | TKGWV2 is a pipeline to estimate biological relatedness (1st, 2nd, and unrelated degrees) between individuals specifically aimed at ultra-low coverage ancient DNA data obtained from whole genome sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TOGA | TOGA is a new method that integrates gene annotation, inferring orthologs and classifying genes as intact or lost. |
Genobioinfo Cluster: How to use |
TOPALI-v2 | A rich graphical interface for evolutionary analyses of multiple alignments on HPC clusters and multi-core desktops. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TreeBeST | TreeBeST, which stands for (gene) Tree Building guided by Species Tree, is a versatile program that builds, manipulates and displays phylogenetic trees. It is particularly designed for building gene trees with a known species tree and is highly efficient and accurate. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
treePL | treePL is a phylogenetic penalized likelihood program. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TreeShrink | TreeShrink is an algorithm for detecting abnormally long branches in one or more phylogenetic trees. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TreeTime | Maximum likelihood inference of time stamped phylogenies and ancestral reconstruction. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
trimAl | trimAl: a tool for automated alignment trimmin | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Twisst | Topology weighting by iterative sampling of sub-trees. |
Genobioinfo Cluster: How to use |
vAMPirus | Automated virus amplicon sequencing analysis program integrated with Nextflow pipeline manager. |
Genobioinfo Cluster: How to use |
VIRify | VIRify is a pipeline for the detection, annotation, and taxonomic classification of viral contigs in metagenomic and metatranscriptomic assemblies. |
Genobioinfo Cluster: How to use |
WGDI | WGDI (Whole-Genome Duplication Integrated analysis), a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidizations and cross-species genome alignments. | Genobioinfo Cluster: How to use |
Whokaryote | Classification of metagenomic contigs as eukaryotic/prokaryotic using biology-based features. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wLogDate | Molecular Dating using logarithmic penalty function. wLogDate is a method for dating phylogenetic trees. Given a phylogeny and either sampling times for leaves or calibration points for internal nodes, wLogDate outputs a "dated" tree that conforms to the sampling times or calibration points. It can also work with no sampling time or calibration points where it would simply turn the tree into ultrametric, fixing its height to a given value. Its optimization criterion is to minimize the variance of the mutation rates in log scale (hence the term logDate). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
ABCtoolbox | BCtoolbox is a general-purpose program to perform Approximate Bayesian Computation. ABCtoolbox can be used for ABC inference on almost any type of model, including models arising in physics, biology or engineering. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ADMIXTOOLS | ADMIXTOOLS (Patterson et al. 2012) is a software package that supports formal tests of whether admixture occurred, and makes it possible to infer admixture proportions and dates. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Admixture | ADMIXTURE is a software tool for maximum likelihood estimation of individual ancestries from multilocus SNP genotype datasets. It uses the same statistical model as STRUCTURE but calculates estimates much more rapidly using a fast numerical optimization algorithm. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ALDER | The ALDER software computes the weighted linkage disequilibrium (LD) statistic for making inference about population admixture |
Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
AlphaImpute | AlphaImpute is a software package for imputing and phasing genotype data in diploid populations with pedigree information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Ancestry HMM | A hidden Markov model approach for simultaneously estimating local ancestry and admixture time using next generation sequence. |
Genobioinfo Cluster: How to use |
ancIBD | Identify IBD segments between pairs of individuals in ancient human DNA data. The software package Software documentation here. |
Genobioinfo Cluster: How to use |
ANGSD | ANGSD is a software for analyzing next generation sequencing data. The software can handle a number of different input types from mapped reads to imputed genotype probabilities. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ASMC | Ascertained Sequentially Markovian Coalescent (contains ASMC and an extension, FastSMC, together with python bindings for both) |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BAMM | A program for multimodel inference on speciation and trait evolution. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BAMscorer | BAMscorer can be used to conduct genomic assignment tests from BAM files. Assignments can be done on genomic regions, inversions, and whole-genome datasets. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BayeScan | Detecting natural selection from population-bases genetic data using differences in alleles frequencies between populations. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BayeScEnv | BayeScEnv is a Fst-based, genome-scan method that uses environmental variables to detect local adaptation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BayPass | The package BayPass is a population genomics software which is primarily aimed at identifying genetic markers subjected to selection and/or associated to population-specific covariates (e.g., environmental variables, quantitative or categorical phenotypic characteristics). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Beagle | BEAGLE is a state of the art software package for analysis of large-scale genetic data sets with hundreds of thousands of markers genotyped on thousands of samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Circuitscape | Circuitscape borrows algorithms from electronic circuit theory to predict patterns of movement, gene flow, and genetic differentiation among plant and animal populations in heterogeneous landscapes. |
Genobioinfo Cluster: How to use |
CLUMPAK | Clustering Markov Packager Across K - was developed in order to aid users analyse the results of STRUCTURE-like programs. The software offers a few alternative modes of action, please go to the Help section for detailed about these modes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Comp-D | A program for comprehensive computation of D-statistics and population summaries (serial version). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
currentNE | Estimation of current effective population using artificial neural networks. |
Genobioinfo Cluster: How to use |
dadi | dadi implements a method for demographic inference from genetic data, based on a diffusion approximation to the allele frequency spectrum. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DFE-alpha | DFE-alpha was initially written to estimate the distribution of fitness effects (DFE) of new deleterious mutations using within-species nucleotide polymorphism data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DIYABC | A user-friendly approach to Approximate Bayesian Computation for inference on population history using molecular markers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Dsuite | Fast calculation of Paterson's D (ABBA-BABA) and the f4-ratio statistics across many populations/species |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
easySFS | easySFS is a tool for the effective selection of population size projection for construction of the site frequency spectrum. It may be used to convert VCF to dadi/fastsimcoal/momi2 style SFS for demographic analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EEMS | EEMS method for analyzing and visualizing spatial population structure from geo-referenced genetic samples. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Eigensoft | The EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification correction method (Price et al. 2006). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ELAI | The software performs local ancestry inference for admixed individuals. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EMMAX | EMMAX is a statistical test for large scale human or model organism association mapping accounting for the sample structure. In addition to the computational efficiency obtained by EMMA algorithm, EMMAX takes advantage of the fact that each loci explains only a small fraction of complex traits, which allows us to avoid repetitive variance component estimation procedure, resulting in a significant amount of increase in computational time of association mapping using mixed model. |
Genobioinfo Cluster: How to use |
est-sfs | est-sfs implements a maximum likelihood method to infer the unfolded site frequency spectrum (the uSFS) and ancestral state probabilities for DNA sequence data. |
Genologin Cluster: How to use |
FaMoz | FaMoz, a software written in the C language and in TclTk, uses likelihood calculation and simulation to perform parentage studies with codominant, dominant, cytoplasmic markers or combinations of the different types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastNGSadmix | Program for infering admixture proportions and doing PCA with a single NGS sample. Inferences based on reference panel. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FastSimBac | FastSimBac is a simulator of the coalescent process with bacterial recombination that simulates genealogies spatially across chromosomes as a Markov process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastsimcoal2 | Fast sequential Markov coalescent simulation of genomic data under complex evolutionary models | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastStructure | fastStructure is an algorithm for inferring population structure from large SNP genotype data. It is based on a variational Bayesian framework for posterior inference and is written in Python2.x. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FBAT | FBAT is an acronym for Family-Based Association Tests in genetic analyses. Family-based association designs, as opposed to case-control study designs, are particularly attractive, since they test for linkage as well as association, avoid spurious associations caused by admixture of populations, and are convenient for investigators interested in refining linkage findings in family samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fineSTRUCTURE | fineSTRUCTURE is a fast and powerful algorithm for identifying population structure using dense sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
flare | The flare program uses a set of reference haplotypes to infer the ancestry of each allele in a set of admixed study samples. The flare program is fast, accurate, and memory-efficient. | Genobioinfo Cluster: How to use |
G-PhoCS | G-PhoCS is a software package for inferring ancestral population sizes, population divergence times, and migration rates from individual genome sequences. | Genobioinfo Cluster: How to use |
Gamma-SMC | This is an alternative and an upgrade of the widely used PSMC method, which infers population size trajectories from VCF files. |
Genobioinfo Cluster: How to use |
Genepop | Population genetics software that computes estimates of F-statistics. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
genomegaMap | Within-species genome-wide dN/dS estimation from very many genomes. | Genobioinfo Cluster: How to use |
GenomeSTRiP | Genome STRiP (Genome STRucture In Populations) is a suite of tools for discovering and genotyping structural variations using sequencing data. The methods are designed to detect shared variation using data from multiple individuals. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GEVA | Genealogical Estimation of Variant Age. We have developed a method for estimating the age of genetic variants; that is, the time of origin of an allele through mutation at a single locus. Our approach, which we refer to as the Genealogical Estimation of Variant Age (GEVA), is similar to existing methods that involve coalescent modeling to infer the time to the most recent common ancestor (TMRCA) between individual genomes <13, 23, 24>. However, these methods typically operate on a discretized timescale <13>, utilize only a fraction of the information available in larger sample data <25>, or employ approximations to overcome computational complexity <14, 15, 26>. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gIMble | A genome-wide IM blockwise likelihood estimation toolkit | Genologin Cluster: Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Gtools | GTOOL is a program for transforming sets of genotype data for use with the programs SNPTEST and IMPUTE. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HapCUT2 | Software tools for haplotype assembly from sequence data | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
hapflk | hapflk is a software implementing the hapFLK <1> and FLK <2> tests for the detection of selection signatures based on multiple population genotyping data. | Genologin Cluster: In Python-2.7.15 (module load system/Python-2.7.15) Genobioinfo Cluster: How to use |
Haplogrep | Haplogrep is a command-line tool for mtDNA haplogroup classification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Haplostrips | Haplostrips produce plots that depict variants in a genomic window among different samples. Visualize similarities between haplotypes with respect to a reference haplotype through haplotype clustering and sorting, useful for revealing hidden population structure. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iSMC | This software extend the sequentially Markovian coalescent model to jointly infer the spatial variation in recombination rate (rho) from a single pair of unphased genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LDhat | LDhat is a package written in the C and C++ languages for the analysis of recombination rates from population genetic data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LDhelmet | LDhelmet performs statistical inference for fine-scale variable recombination rate estimation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MALDER | This is a version of ALDER (http://groups.csail.mit.edu/cb/alder/) that has been modified to allow multiple admixture events. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MapThin | Reduce the number of SNPs in a gene marker dense map computed by PLINK. First, by eliminating linked SNPs. Then, by applying different criteria. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mauve | Mauve is a system for efficiently constructing multiple genome alignments in the presence of large-scale evolutionary events such as rearrangement and inversion. Multiple genome alignment provides a basis for research into comparative genomics and the study of evolutionary dynamics. Aligning whole genomes is a fundamentally different problem than aligning short sequences. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Migrate | Migrate estimates effective population sizes,past migration rates between n population assuming a migration matrix model with asymmetric migration rates and different subpopulation sizes, and population divergences or admixture. | Genobioinfo Cluster: How to use |
MLST | Multi-Locus sequence Typing. The method enables investigators to determine the ST based on WGS data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ms | A program for generating samples under neutral models. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
msBayes | msBayes allows complex and flexible comparative phylogeographic inference. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
msmc | This software implements MSMC, a method to infer population size and gene flow from multiple genome sequences | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
msmc2 | This program implements MSMC2, a method to infer population size history and population separation history from whole genome sequencing data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
msprime |
Please see the documentation for more details |
Genobioinfo Cluster: How to use |
msums | A program for the efficient computation of a number of population genetics summary statistics. msums can read ms-format data on (nearly) arbitrary numbers of populations. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Neural-ADMIXTURE | Neural ADMIXTURE is an unsupervised global ancestry inference technique based on ADMIXTURE. By using neural networks, Neural ADMIXTURE offers high quality ancestry assignments with a running time which is much faster than ADMIXTURE's. |
Genobioinfo Cluster: How to use |
ngsLD | ngsLD is a program to estimate pairwise linkage disequilibrium (LD) taking the uncertainty of genotype's assignation into account. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ngsRelate | NgsRelate can be used to infer relatedness coefficients for pairs of individuals from low coverage Next Generation Sequencing (NGS) data by using genotype likelihoods instead of called genotypes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ngsTools | ngsTools is a collection of programs for population genetics analyses from NGS data, taking into account its statistical uncertainty. The methods implemented in these programs do not rely on SNP or genotype calling, and are particularly suitable for low sequencing depth data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
omegaplus | A scalable tool for rapid detection of selective sweeps in whole-genome datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PCAdmix | PCAdmix is a method that estimates local ancestry via principal components analysis (PCA) using phased haplotypes. The method considers data chromosome by chromosome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PGDSpider | PGDSpider is a powerful automated data conversion tool for population genetic and genomics programs. It facilitates the data exchange possibilities between programs for a vast range of data types (e.g. DNA, RNA, NGS, microsatellite, SNP, RFLP, AFLP, multi-allelic data, allele frequency or genetic distances) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Phy-Mer | A novel alignment-free and reference-independent mitochondrial haplogroup classifier. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pixy | pixy is a command-line tool for painlessly and correctly estimating average nucleotide diversity within (π) and between (dxy) populations from a VCF. | Genobioinfo Cluster: How to use |
PMERGE | PMERGE, is a software, which implements a new method that identifies candidate PSVs by building networks of loci that share high levels of nucleotide similarity. The PMERGE is embedded in the analysis pipeline of the widely used Stacks software, and it is straightforward to apply it as an additional filter in population-genomic studies using RAD-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pong | pong is a freely available software package, released by Behr et al. (2016, Bioinformatics), for post-processing output from clustering inference using population genetic data. |
Genologin Cluster: In Python-2.7.15 Genobioinfo Cluster: Ask for Install |
popins | Population-scale detection of novel-sequence insertions. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PoPoolation | PoPoolation is a pipeline for analysing pooled next generation sequencing data. Currently PoPoolation allows to calculate Tajima’s Pi, Watterson’s Theta and Tajima’s D with a sliding window approach for chromosomes or for set of genes. |
Genobioinfo Cluster: How to use |
PoPoolation2 | PoPoolation2 allows to compare allele frequencies for SNPs between two or more populations and to identify significant differences. PoPoolation2 requires next generation sequencing data of pooled genomic DNA (Pool-Seq). It may be used for measuring differentiation between populations, for genome wide association studies and for experimental evolution. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
popPhylABC | Scripts used for ABC analysis with homo- and heterogeneity in Migration rates or/and Effective population sizes | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
psmc | This software package infers population size history from a diploid sequence |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pyrho | Fast demography-aware inference of fine-scale recombination rates based on fused-LASSO. |
Genobioinfo Cluster: How to use |
qpWrapper | Tools allowing to launch qpAdmn analyzes (Admixtools) in series on a list of individuals. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
quickLD | High-performance Computation of Linkage Disequilibrium on CPUs and GPUs. |
Genobioinfo Cluster: How to use |
RFMIX | A discriminative method for local ancestry inference |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SCCmecFinder | SCCmecFinder identifies SCCmec elements in sequenced S. aureus isolates. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
scrm | A coalescent simulator for genome-scale sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
selscan | A program to calculate EHH-based scans for positive selection in genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
simuPOP | simuPOP is a general-purpose individual-based forward-time population genetics simulation environment. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SLiM | SLiM is an evolutionary simulation framework that combines a powerful engine for population genetic simulations with the capability of modeling arbitrarily complex evolutionary scenarios. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SMC++ | SMC++ is a program for estimating the size history of populations from whole genome sequence data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
sNMF | A fast and efficient program for estimating individual admixture coefficients based on sparse non-negative matrix factorization and population genetics. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
spaTyper | Computational method for finding spa types. Staphylococcus aureus is a major human pathogen causing skin and tissue infections, pneumonia, septicemia, and device-associated infections. The emergence of strains resistant to methicillin (MRSA) and other antibacterial agents has become a major concern, especially in the hospital environment, because of the high mortality of the infections caused by these strains. Single locus DNA-sequencing of the repeat region of the Staphylococcus protein A gene (spa) can be used for reliable, accurate and discriminatory typing of MRSA. Repeats are assigned a numerical code and the spa-type is deduced from the order of specific repeats. However, spa-typing was hampered in the past by the lack of a consensus on assignments of new spa-repeats and -types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
stairway_plot | The stairway plot is a method for inferring detailed population demographic history using the site frequency spectrum (SFS) from DNA sequence data. It does not need a pre-defined population model and can be applied to hundreds of unphased sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Structure | The program structure is a free software package for using multi-locus genotype data to investigate population structure. Its uses include inferring the presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. It can be applied to most of the commonly-used genetic markers, including SNPS, microsatellites, RFLPs and AFLPs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TreeMix | TreeMix is a method for inferring the patterns of population splits and mixtures in the history of a set of populations. In the underlying model, the modern-day populations in a species are related to a common ancestor via a graph of ancestral populations. We use the allele frequencies in the modern populations to infer the structure of this graph. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Yleaf | Software for human Y-chromosomal haplogroup inference from next generation sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
CRISPOR | CRISPOR predicts off-targets in the genome, ranks guides, highlights problematic guides, designs primers and helps with cloning. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
lima | Demultiplex Barcoded PacBio Samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MIPgen | Use MIPgen to design custom mip panels for target enrichment of moderate to high complexity DNA targets ranging from 120 to 250bp in size. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
AlphaFold | AlphaFold is an AI system developed by DeepMind that predicts a protein’s 3D structure from its amino acid sequence. |
Genobioinfo Cluster: How to use |
Alphafold2-Pytorch | An unofficial working Pytorch implementation of Alphafold2, a 3D protein predictor. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CAID | The CAID software produces all outputs necessary for Critical Assessment of Intrinsic Disorder (CAID) edition, including baselines, references, metrics and plots, starting from predictions and a reference (see Data Availability section to know how to obtain this data). |
Genobioinfo Cluster: How to use |
CITE-seq-Count | A tool that allows to get UMI counts from a single cell protein assay. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ColabFold | ColabFold is an easy-to-use Notebook based environment for fast and convenient protein structure predictions. Its structure prediction is powered by AlphaFold2 and RoseTTAFold combined with a fast multiple sequence alignment generation stage using MMseqs2. |
Genobioinfo Cluster: How to use |
DIAMOND | Accelerated BLAST compatible local sequence aligner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
E2P2 | Ensemble Enzyme Prediction Pipeline. |
Genobioinfo Cluster: How to use |
EMBER3D | Ultra-fast in-silico structure mutation. |
Genobioinfo Cluster: How to use |
EVE | EVE is a set of protein-specific models providing for any single amino acid mutation of interest a score reflecting the propensity of the resulting protein to be pathogenic. |
Genobioinfo Cluster: How to use |
genomegaMap | Within-species genome-wide dN/dS estimation from very many genomes. | Genobioinfo Cluster: How to use |
InterProScan | InterProScan is a tool that combines different protein signature recognition methods into one resource. No less than 14 pattern/profiles databanks can be interrogated.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LOCALIZER | LOCALIZER is a machine learning method for predicting the subcellular localization of both plant proteins and pathogen effectors in the plant cell. |
Genobioinfo Cluster: How to use |
Loctree3 | Protein Subcelullar Localization Sequenced-Based Predictor |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MAESTRO | A Multi AgEnt STability pRedictiOn tool for changes in unfolding free energy upon point mutation. MAESTRO is structure based and distinguishes from similar approaches in the following points: (i) MAESTRO implements a multi-agent machine learning system. (ii) It provides predicted ΔΔG values along with a corresponding prediction quality measure. (iii) MAESTRO is applicable to biological assemblies. (iv) It provides high throughput scanning for multi-point mutations where sites and types of mutation can be comprehensively controlled. (v) Finally, the software provides a specific mode for the prediction of stabilizing disulfide bonds. |
Genobioinfo Cluster: How to use |
Miniprot | Aligning proteins to genomes with splicing and frameshift. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MZmine | MZmine is an open-source software for mass-spectrometry data processing. |
Genobioinfo Cluster: How to use |
OMA | The OMA (Orthologous MAtrix) database is a well-established resource for identifying orthologs among publicly available complete genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OMArk | OMArk is a software for proteome (protein-coding gene repertoire) quality assessment. |
Genobioinfo Cluster: How to use |
OrthoLoger | Standalone pipeline for delineation of orthologs. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
OrthoMCL | OrthoMCL is a genome-scale algorithm for grouping orthologous protein sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PanTools | PanTools is a toolkit for comparative analysis of large number of genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PfamScan | A program that searches a FASTA file against a library of Pfam HMMs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Phobius | A combined transmembrane topology and signal peptide predictor. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Plass | Plass (Protein-Level ASSembler) is a software to assemble short read sequencing data on a protein level. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Proteinortho | Proteinortho is a tool to detect orthologous genes within different species. |
Genobioinfo Cluster: How to use |
ProtHint | ProtHint is a pipeline for predicting and scoring hints (in the form of introns, start and stop codons) in the genome of interest by mapping and spliced aligning predicted genes to a database of reference protein sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PROVEAN | PROVEAN (Protein Variation Effect Analyzer) is a software tool which predicts whether an amino acid substitution or indel has an impact on the biological function of a protein. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SignalP | SignalP 4.0 server predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms: Gram-positive prokaryotes, Gram-negative prokaryotes, and eukaryotes.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TMHMM | Prediction of transmembrane helices in proteins. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VinaLC | A parallel molecular docking program based on AutoDock Vina. |
Genobioinfo Cluster: How to use |
WoLFPSort | WoLF PSORT is an extension of the PSORT II program for protein subcellular localization prediction, which is based on the PSORT principle. WoLF PSORT converts a protein's amino acid sequences into numerical localization features; based on sorting signals, amino acid composition and functional motifs. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
3rdChimeraMiner | Exploration of whole genome amplification generated chimeric sequences in long-read sequencing data. | Genobioinfo Cluster: How to use |
AdapterRemoval | This program was developed to remove residual adapter sequences from next generation sequencing reads. The program handles both single end and paired end data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ALFATClust | ALignment-Free Adaptive Threshold Clustering:Biological sequence clustering tool with dynamic threshold for individual clusters. Suitable for clustering multiple groups of homologous sequences. | Genobioinfo Cluster: How to use |
Ampliconnoise | AmpliconNoise is a collection of programs for the removal of noise from 454 sequenced PCR amplicons. It involves two steps the removal of noise from the sequencing itself and the removal of PCR point errors. This project also includes the Perseus algorithm for chimera removal. | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
atac dnase pipelines | ATAC-seq and DNase-seq processing pipeline. This pipeline is designed for automated end-to-end quality control and processing of ATAC-seq or DNase-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ATLAS | ATLAS stands for Analysis Tools for Low-coverage and Ancient Samples. These tools cover all programs necessary to obtain variant calls, estimates of heterozygosity and more from a BAM file. There are sequence data processing tools, diagnostic tools, and variant discovery tools, similar to GATK by the Broad Institute. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Atropos | Atropos is tool for specific, sensitive, and speedy trimming of NGS reads. It is a fork of Cutadapt read trimmer. | Genologin Cluster: In Python-3.6.3 Genobioinfo Cluster: Ask for Install |
BCOOL | BCOOL is a read corrector for NGS sequencing data that align reads on a de Bruijn graph. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Btrim | A fast and accurate adapter, barcodes, and low-quality region trimming and binning program written in C for next-generating sequencing reads. The search algorithm is based on Eugene Myers' fast bit-vector algorithm. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
charcoal | Remove contaminated contigs from genomes using k-mers and taxonomies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CheckM2 | Assessing the quality of metagenome-derived genome bins using machine learning. |
Genobioinfo Cluster: How to use |
CheckV | CheckV is a fully automated command-line pipeline for assessing the quality of single-contig viral genomes, including identification of host contamination for integrated proviruses, estimating completeness for genome fragments, and identification of closed genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
chopper | This tool, intended for long read sequencing such as PacBio or ONT, filters and trims a fastq file. chopper is a tool that reunites the now outdated softwares NanoFilt and NanoLyse. It permits to filter QC files and has a faster execution time than NanoFilt and NanoLyse.
|
Genobioinfo Cluster: How to use |
ClipAndMerge | Clip&Merge is a tool to clip off adapters from sequencing reads and merge overlapping paired end reads together. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
compleasm | A genome completeness evaluation tool based on miniprot. |
Genobioinfo Cluster: How to use |
Consensify | Consensify is a method for generating a consensus pseudohaploid genome sequence with greatly reduced error rates compared to standard pseudohaploidisation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ContScout | ContScout is a pipeline developed for the identification and removal of contaminating sequences in draft genomes. |
Genobioinfo Cluster: How to use |
CroCo | A program to detect potential cross contaminations in HTS assembled transcriptomes using expression level quantification. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cutadapt | Cutadapt removes adapter sequences from DNA high-throughput sequencing data. This is usually necessary when the read length of the machine is longer than the molecule that is sequenced, such as in microRNA data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
DeChat | Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat. |
Genobioinfo Cluster: How to use |
decOM | decOM is a high-accuracy microbial source tracking method that is suitable for contamination quantification in paleogenomics, namely the analysis of collections of possibly contaminated ancient oral metagenomic data sets. |
Genobioinfo Cluster: How to use |
DeconSeq | Detect and remove contaminations from your sequence data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DeDup | A merged read deduplication tool capable to perform merged read deduplication on single end data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ecoPCR | ecoPCR is an electronic PCR software developed by LECAand Helix-Project . It helps you to estimate Barcode primers quality. In conjunction with OBItools, you can postprocess ecoPCR output to compute barcode coverage and barcode speci?city. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ecoPrimers | ecoPrimer is a barcoding software which is written in C language. It finds universal primers from a set of input DNA sequences by finding conserved regions without "a priori" on candidate sequences. It also evaluates the quality of the primers and barcode regions by measuring the "barcode specificity" and "barcode coverage" indices | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ELECTOR | EvaLuation of Error Correction Tools for lOng Reads. |
|
EMA | EMA uses a latent variable model to align barcoded short-reads (such as those produced by 10x Genomics' sequencing platform). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fastix | A simple command line tool to add prefixes to FASTA headers. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastk-medians | A set of utilities to calculate the median number of times the k-mers in a sequence of interest occur across the whole set. |
Genobioinfo Cluster: How to use |
fastp | A tool designed to provide fast all-in-one preprocessing for FastQ files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastplong | Ultra-fast preprocessing and quality control for long-read sequencing data. |
Genobioinfo Cluster: How to use |
FastQ Screen | FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastq_illumina_filter | This program can filter FASTQ files produced by CASAVA 1.8, and keep/discard reads based on this filter flag. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastq-tools | A collection of small and efficient programs for performing some common and uncommon tasks with FASTQ files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FastQC | A Quality Control application for FastQ files. FastQC is an application which takes a FastQ file and runs a series of tests on it to generate a comprehensive QC report. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fastQValidator | The fastQValidator validates the format of fastq files | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FCS-GX | FCS-GX detects contamination from foreign organisms in genome sequences. This tool is one module within the NCBI Foreign Contamination Screening (FCS) program suite. |
Genobioinfo Cluster: How to use |
Filtlong | Filtlong is a tool for filtering long reads by quality. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FLAS | FLAS is software that makes self-correction for PacBio long reads with fast speed and high throughput. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Flexbar | Flexbar preprocesses high-throughput sequencing data efficiently. It demultiplexes barcoded runs and removes adapter sequences. Moreover, trimming and filtering features are provided. Flexbar increases read mapping rates and improves genome and transcriptome assemblies. It supports next-generation sequencing data in fasta/q and csfasta/q format from Illumina, Roche 454, and the SOLiD platform. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FMLRC | FMLRC, or FM-index Long Read Corrector, is a tool for performing hybrid correction of long read sequencing using the BWT and FM-index of short-read sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FrameBot | RDP FrameBot is a tool for correcting frameshift errors caused by insertions and deletions in DNA sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GCI | Genome Continuity Inspector (GCI) is an assembly assessment tool for high-quality genomes (e.g. T2T genomes), in base resolution. | Genobioinfo Cluster: How to use |
ggCaller | A de Bruijn graph-based gene-caller and pangenome analysis tool. | Genobioinfo Cluster: How to use |
HELEN | HELEN (Homopolymer Encoded Long-read Error-corrector for Nanopore) uses a Recurrent-Neural-Network (RNN) based Multi-Task Learning (MTL) model that can predict a base and a run-length for each genomic position using the weights generated by MarginPolish. This installation includes MarginPolish. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HG-CoLoR | HG-CoLoR (Hybrid method based on a variable-order de bruijn Graph for the error Correction of Long Reads) is a hybrid method for the error correction of long reads that both aligns the short reads to the long reads, and uses a variable-order de Bruijn graph, in a seed-and-extend approach. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HiFiAdapterFilt | Convert .bam to .fastq and remove reads with remnant PacBio adapter sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Inspector | A tool for evaluate long-read de novo assembly results. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Jabba | A hybrid error correction tool for sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KAD | KAD is designed for evaluating the accuracy of nucleotide base quality of genome assemblies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KAT | KAT (The K-mer Analysis Toolkit) is a suite of tools that generate, analyse and compare k-mer spectra produced from sequence files. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
KCOSS | A fast and space-saving multi-threaded k-mer frequency statistics algorithm |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
kmer-counter | A fast k-mer counter written in Rust. |
Genobioinfo Cluster: How to use |
komplexity | A command-line tool built in Rust to quickly calculate and/or mask low-complexity sequences from a FAST file. This uses the number of unique k-mers over a sequence divided by the length to assess complexity. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LongQC | LongQC is a tool for the data quality control of the PacBio and ONT long reads, and it has two functionalities: sample qc and platform qc. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LoRDEC | LoRDEC is a program to correct sequencing errors in long reads from 3rd generation sequencing with high error rate, and is especially intended for PacBio reads. It uses a hybrid strategy, meaning that it uses two sets of reads: the reference read set, whose error rate is assumed to be small, and the PacBio read set, which is then corrected using the reference set. Typically, the reference set contains Illumina reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
mapDamage | tracking and quantifying damage patterns in ancient DNA sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MECAT | MECAT is an ultra-fast Mapping, Error Correction and de novo Assembly Tools for single molecula sequencing (SMRT) reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Medaka | Medaka demonstrates a framework for error correcting sequencing data, particularly aimed at nanopore sequencing. Tools are provided for both training and inference. The code exploits the keras deep learning library. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MinIONQC | Fast and effective quality control for MinION and PromethION sequencing data | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MiniScrub | MiniScrub is a de novo long sequencing read preprocessing method that improves read quality by predicting and removing ("scrubbing") read segments that have a high concentration of errors. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MitoZ | MitoZ is a Python3-based toolkit which aims to automatically filter pair-end raw data (fastq files), assemble genome, search for mitogenome sequences from the genome assembly result, annotate mitogenome (genbank file as result), and mitogenome visualization. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Musket | Musket is a well-established leading next-generation sequencing read error correction algorithm targetting Illumina sequencing. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Nano-Q | Python script for conservatively cleaning ONT reads from bam files and estimate variant frequencies. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoComp | Compare multiple runs of long read sequencing data and alignments. |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: in Python-3.11.1 |
NanoFilt | Filtering and trimming of Oxford Nanopore sequencing data |
Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: Ask for Install |
NanoLyse | Remove reads mapping to the lambda phage genome from a fastq file | Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: How to use |
NanoStat | Create statistic summary of an Oxford Nanopore read dataset | Genologin Cluster: in Python-3.6.3 Genobioinfo Cluster: Ask for Install |
NextPolish | NextPolish is used to fix base errors (SNV/Indel) in the genome generated by noisy long reads, it can be used with short read data only or long read data only or a combination of both. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nQuire | A statistical framework for ploidy estimation using NGS short-read data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
OMArk | OMArk is a software for proteome (protein-coding gene repertoire) quality assessment. |
Genobioinfo Cluster: How to use |
Pacasus | Tool for detecting and cleaning PacBio / Nanopore long reads after whole genome amplification. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PECAT | PECAT is a phased error correction and assembly tool for long reads. It includes a haplotype-aware correction method and an efficient diploid assembly method. |
Genobioinfo Cluster: How to use |
Platanus_trim | Platanus_trim is a tool for trimming adaptor sequences and low quality regions. In contrast, Platanus_internal_trim is a tool for trimming internal adaptor sequence, adaptor sequences, and low quality regions. Platanus_trim is designed for paired-end library and Platanus_internal_trim is for mate-pair library. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Porechop | Porechop is a tool for finding and removing adapters from Oxford Nanopore reads. Adapters on the ends of reads are trimmed off, and when a read has an adapter in its middle, it is treated as chimeric and chopped into separate reads. Porechop performs thorough alignments to effectively find adapters, even at low sequence identity. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Porechop_ABI | Porechop_abi (ab initio) is an extension of Porechop that is able to infer the adapter sequence from the Oxford Nanopore reads. It discovers the adapter sequence from the reads using approximate k-mers and assembly, and add the sequence found to the adapter list (adapters.py file). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PRINSEQ | PRINSEQ is a tool that generates summary statistics of sequence and quality data and that is used to filter, reformat and trim next-generation sequence data. The standalone version is primarily designed for data preprocessing and does not generate summary statistics in graphical form. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pycoQC | pycoQC computes metrics and generates Interactive QC plots from the sequencing summary report generated by Oxford Nanopore technologies basecaller (Albacore/Guppy) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PyroCleaner | PyroCleaner is intended to clean reads coming from pyrosequencing in order to ease the assembly process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Qualimap | Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Quorum | QuorUM (Quality Optimized Reads from the University of Maryland) is an error corrector for Illumina reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
READv2 | Relationship Estimation from Ancient DNA version 2. |
Genobioinfo Cluster: How to use |
RSeQC | RSeQC package provides a number of useful modules that can comprehensively evaluate high throughput sequence data especially RNA-seq data |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sabre | A barcode demultiplexing and trimming tool for FastQ files. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
samblaster | samblaster is a fast and flexible program for marking duplicates in read-id grouped1 paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
schmutzi | Bayesian maximum a posteriori contamination estimate for ancient samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
seqclean | SeqClean is a tool for validation and trimming of DNA sequences from a flat file database (FASTA format). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
sequence-stats | A fast and beginner-friendly program to generate statistics from FASTQ and FASTA files (written AWK and Bash), e.g. genome assembly sizes and GC content (%). |
Genobioinfo Cluster: How to use |
sickle | Sickle is a tool that uses sliding windows along with quality and length thresholds to determine when quality is sufficiently low to trim the 3'-end of reads and also determines when the quality is sufficiently high enough to trim the 5'-end of reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SMRTLink | SMRT Link is the web-based end-to-end workflow manager for the Sequel™ System. (installed in mode command line on our cluster) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SnoReport | Computational identification of snoRNAs with unknown targets. Detecting novel or orphan snoRNAs in RNA sequence data using sequence and structure information only without relying on target information |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
sonic | Some Organism's Nucleotide Information Container. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sprai | Sprai (single-pass read accuracy improver) is a tool to correct sequencing errors in single-pass reads for de novo assembly. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
spruceup | Tools to discover, visualize, and remove outlier sequences in large multiple sequence alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Transrate | Transrate is software for de-novo transcriptome assembly quality analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trim Galore | A wrapper tool around Cutadapt and FastQC to consistently apply quality and adapter trimming to FastQ files, with some extra functionality for MspI-digested RRBS-type (Reduced Representation Bisufite-Seq) libraries. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Trimmomatic | Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and single ended data.The selection of trimming steps and their associated parameters are supplied on the command line. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
UMI-tools | Tools for handling Unique Molecular Identifiers in NGS data sets |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
unique-kmer-counts | This program calculates the number of distinct k-mers for each sequence record in a fasta file and divides it by the total number of k-mers in that record. |
Genobioinfo Cluster: How to use |
VeChat | Correcting errors in noisy long reads using variation graphs. |
Genobioinfo Cluster: How to use |
yacrd | Yet Another Chimeric Read Detector for long reads |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Yak | Yak is initially developed for two specific use cases: 1) to robustly estimate the base accuracy of CCS reads and assembly contigs, and 2) to investigate the systematic error rate of CCS reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
BayesAss3-SNPs | Modification of BayesAss 3.0.4 to allow handling of large SNP datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fineRADstructure | A complete, easy to use, and fast population inference package for RAD-seq data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
fragmatic | Simple program for in silico restriction digest of genomic sequences, to simulate RAD-family NGS library prep methods. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ipyrad | An interactive toolkit for assembly and analysis of restriction-site associated genomic data sets (e.g., RAD, ddRAD, GBS) for population genetic and phylogenetic studies. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RADIS | Analysis of RAD-seq data for InterSpecific phylogeny |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Stacks | Stacks is a software suite for analysing RAD Sequencing data by Julian Catchen at the University of Oregon. It will process raw Illumina RAD data or RAD data aligned to a reference genome, and produce genotypes that can be viewed and filtered via a web interface. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
CENSOR | CENSOR compares and masks protein or nucleotide sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cnD | cnD is a program to detect copy number variants from short-read sequence data. |
Genobioinfo Cluster: How to use |
EarlGrey | A fully automated TE curation and annotation pipeline. |
Genobioinfo Cluster: How to use |
GraffiTE | GraffiTE is a pipeline that finds polymorphic transposable elements in genome assemblies and/or long reads, and genotypes the discovered polymorphisms in read sets using genome-graphs. |
Genobioinfo Cluster: How to use |
ICEscreen | ICEscreen is a bioinformatic pipeline for the detection and annotation of ICEs (Integrative and Conjugative Elements) and IMEs (Integrative and Mobilizable Elements) in Bacillota genomes. |
Genobioinfo Cluster: How to use |
Look4TRs | A de-novo tool for detecting simple tandem repeats using self-supervised hidden Markov models. |
Genobioinfo Cluster: How to use |
LTR_FINDER_parallel | A parallel wrapper for LTR_FINDER (LTR_Finder is an efficient program for finding full-length LTR retrotranspsons in genome sequences.) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LtrDetector | A tool-suite for detecting long terminal repeat retrotransposons de-novo on the genomic scale. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MinCED | MinCED is a program to find Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) in full genomes or environmental datasets such as metagenomes, in which sequence size can be anywhere from 100 to 800 bp. MinCED runs from the command-line and was derived from CRT |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mobster | Mobster is used to detect novel (non-reference) Mobile Element Insertion (MEI) events in BAM files and uses both a discordant read pair method and a split-read method. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
mreps | Software for tandem repeat identification in DNA. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MSIsensor-pro | MSIsensor-pro is an updated version of msisensor. MSIsensor-pro evaluates Microsatellite Instability (MSI) for cancer patients with next generation sequencing data. |
Genobioinfo Cluster: How to use |
NSEG | NSEG is used to mask nucleic acid sequences, needed by RepeatScout. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ntHits | ntHits is a method for identifying reapeats in high-throughput DNA sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Onecodetofindthemall | One code to find them all is a set of perl scripts to extract useful information from RepeatMasker about transposable elements, retrieve their sequences and get some quantitative information. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
parseRM | Few scripts facilitating the extraction of info from Repeat Masker .out files | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PILER | Genomic repeat analysis software. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PILERCR | PILERCR is public domain software for finding CRISPR repeats. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RDXplorer | The RDXplorer (Read Depth eXplorer) is a computational tool for copy number variants (CNV) detection in whole human genome sequence data using read depth (RD) coverage. |
Genobioinfo Cluster: How to use |
RECON | A package for automated de novo identification of repeat families from genomic sequence. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepAHR | RepAHR is used to identify repeats(repetitive sequences) in genome using Next-Generation Sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REPdenovo | REPdenovo is designed for constructing repeats directly from sequence (paired-end) reads. It based on the idea of frequent k-mer assembly. REPdenovo provides many functionalities, and can generate much longer repeats than existing tools. Internally, REPdenovo uses Jellyfish for k-mer counting, Velvet for assembly, and bwa to map reads on the Transposable Elements. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RepeatExplorer | RepeatExplorer is a computational pipeline designed to identify and characterize repetitive DNA elements in next-generation sequencing data from plant and animal genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RepeatMasker | RepeatMasker is a program that screens DNA sequences for interspersed repeats (thanks to RepBase repeats databanks specially formatted) and low complexity DNA sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepeatModeler | RepeatModeler is a de-novo repeat family identification and modeling package. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepeatScout | RepeatScout is a tool to discover repetitive substrings in DNA. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepEnrich2 | RepEnrich2 is an updated method to estimate repetitive element enrichment using high-throughput sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
REPET | The REPET package (t Flutre et al, 2011 ) integrates bioinformatics programs in order to tackle biological issues at the genomic scale. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RetroSeq | RetroSeq is a bioinformatics tool that searches for mobile element insertions from aligned reads in a BAM file and a library of reference transposable elements. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RMBlast | RMBlast is a RepeatMasker compatible version of the standard NCBI BLAST suite. The primary difference between this distribution and the NCBI distribution is the addition of a new program "rmblastn" for use with RepeatMasker and RepeatModeler. RMBlast supports RepeatMasker searches by adding a few necessary features to the stock NCBI blastn program. These include: |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SEDEF | SEDEF is a quick tool to find all segmental duplications in the genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
spaTyper | Computational method for finding spa types. Staphylococcus aureus is a major human pathogen causing skin and tissue infections, pneumonia, septicemia, and device-associated infections. The emergence of strains resistant to methicillin (MRSA) and other antibacterial agents has become a major concern, especially in the hospital environment, because of the high mortality of the infections caused by these strains. Single locus DNA-sequencing of the repeat region of the Staphylococcus protein A gene (spa) can be used for reliable, accurate and discriminatory typing of MRSA. Repeats are assigned a numerical code and the spa-type is deduced from the order of specific repeats. However, spa-typing was hampered in the past by the lack of a consensus on assignments of new spa-repeats and -types. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SQuIRE | SQuIRE reveals locus-specific regulation of interspersed repeat expression, Nucleic Acids Research | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
T-lex | T-lex is a computational pipeline that detects presence and/or absence of annotated individual transposable elements (TEs) using next-generation sequencing (NGS) data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tandem Repeats Finder | Tandem Repeats Finder is a program to locate and display tandem repeats in DNA sequences. A tandem repeat in DNA is two or more adjacent, approximate copies of a pattern of nucleotides. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TEsorter | It is coded for LTR_retriever to classify long terminal repeat retrotransposons (LTR-RTs) at first. It can also be used to classify any other TE sequences, including Class I and Class II elements which are covered by the REXdb database. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
tidk | tidk is a toolkit to identify and visualise telomeric repeats for the Darwin Tree of Life genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Tn3+TA_finder | Tn3 Transposon/Toxin Finder (Tn3+TA_finder) is a program for the automatic prediction of transposable elements of the Tn3 family associated with type II toxin and antitoxin pairs in bacteria and archaea. |
Genobioinfo Cluster: How to use |
TnComp_finder | Composite Transposon Finder (TnComp_finder) is a program for the prediction of putative composite transposons in bacterial and archaeal genomes based on insertion sequence replicas in a relatively short span. |
Genobioinfo Cluster: How to use |
transposon_annotation_tools | A set of bioconda packages for transposon annotation and transposon feature annotation in nucleotide sequences. transposon_annotation_tools is part of TransposonUltimate. The package includes a series of transposable element discovery tools, such as: MUSTv2, HelitronScanner, SineFinder, MiteTracker, MiteFinderII, SineScan, TirVish, LtrHarvest, RepeatModeler, TransposonPSI, and TransposonProteinNCBICDD1000. You can then use these tools independently. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
transposon_classifier_rfsb | Transposon classification tool for nucleotide sequence classification, providing classification, model training and prediction evaluation. RFSB is part of TransposonUltimate. |
Genobioinfo Cluster: How to use |
TRASH | Tandem Repeat Annotation and Structural Hierarchy: a package to identify and extract tandem repeats in genome sequences and investigate their higher order structures. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
RiboTaper | RiboTaper is a new analysis pipeline for Ribosome Profiling (Ribo-seq) experiments, which exploits the triplet periodicity of ribosomal footprints to call translated regions. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Application | Description | Availability/Use |
---|---|---|
CellRanger ARC | Cell Ranger ARC's pipelines analyze sequencing data produced from Chromium Single Cell Multiome ATAC + Gene Expression. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SEACells | Single-cEll Aggregation for High Resolution Cell States. SEACells algorithm for Inference of transcriptional and epigenomic cellular states from single-cell genomics data. |
Genobioinfo Cluster: How to use |
souporcell | souporcell is a method for clustering mixed-genotype scRNAseq experiments by individual. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
3rdChimeraMiner | Exploration of whole genome amplification generated chimeric sequences in long-read sequencing data. | Genobioinfo Cluster: How to use |
AC-DIAMOND | AC-DIAMOND attempts to speed up DIAMOND via better SIMD parallelization and compressed indexing. Experimental results show that AC-DIAMOND was about 6~7 times faster than DIAMOND on aligning DNA reads or contigs while retaining the essentially the similar sensitivity. AC-DIAMOND was developped based on DIAMOND v0.7.9. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Accel-align | Accel-align is a fast alignment tool implemented in C++ programming language. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ALFATClust | ALignment-Free Adaptive Threshold Clustering:Biological sequence clustering tool with dynamic threshold for individual clusters. Suitable for clustering multiple groups of homologous sequences. | Genobioinfo Cluster: How to use |
AMBER | Analyzing Mapping Biases and Evaluating Read Reliability. |
Genobioinfo Cluster: How to use |
AnchorWave | AnchorWave (Anchored Wavefront Alignment) identifies collinear regions via conserved anchors (full-length CDS and full-length exon have been implemented currently) and breaks collinear regions into shorter fragments, i.e., anchor and inter-anchor intervals. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
AvP | AvP performs automatic detection of HGT candidates within a phylogenetic framework. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Back_to_sequences | Given a set of kmers (fasta / fastq <.gz> format) and a set of sequences (fasta / fastq <.gz> format), this tool will extract the sequences containing some of those kmers. |
Genobioinfo Cluster: How to use |
bam2plot | Make coverage plots from bam files. |
Genobioinfo Cluster: How to use |
Bamstats (notsame as BAMstats) | Bamstats is a command line tool written in Go for computing mapping statistics from a BAM file. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BBMap | a short read aligner, as well as various other bioinformatic tools. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BCOOL | BCOOL is a read corrector for NGS sequencing data that align reads on a de Bruijn graph. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BELLA | A computationally-efficient and highly-accurate long-read to long-read aligner and overlapper. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
biohazard-tools | This is a collection of command line utilities that do useful stuff involving BAM files for Next Generation Sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Blasr | Reference-based alignment | Genologin Cluster: How to use ( See SMRTLink) Genobioinfo Cluster: Ask for Install |
blat | The BLAST-Like Alignment Tool: similarity search in databanks. BLAT on DNA is designed to quickly find sequences of 95% and greater similarity of length 25 bases or more. BLAT on proteins finds sequences of 80% and greater similarity of length 20 amino acids or more. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BOLDigger | Python program to query .fasta files against the different databases of www.boldsystems.org |
SLURM Cluster: How to use Genobioinfo Cluster: How to use |
Bowtie | Bowtie is an ultrafast, memory-efficient short read aligner. It aligns short DNA sequences (reads) to the human genome at a rate of over 25 million 35-bp reads per hour. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: typically about 2.2 GB for the human genome (2.9 GB for paired-end). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Bowtie2 | Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes. Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BPP | Bayesian analysis of genomic sequence data under the multispecies coalescent model. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BS-Seeker2- | BS-Seeker2 is a seamless and versatile pipeline for accurately and fast mapping the bisulfite-treated reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BSMAP | BSMAP is a short reads mapping software for bisulfite sequencing reads. Bisulfite treatment converts unmethylated Cytosines into Uracils (sequenced as Thymine) and leave methylated Cytosines unchanged, hence provides a way to study DNA cytosine methylation at single nucleotide resolution. BSMAP aligns the Ts in the reads to both Cs and Ts in the reference | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
bwa | Burrows-Wheeler Aligner (BWA) is an efficient program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome. It implements two algorithms, bwa-short and BWA-SW. The former works for query sequences shorter than 200bp and the latter for longer sequences up to around 100kbp. Both algorithms do gapped alignment. They are usually more accurate and faster on queries with low error rates. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bwa-mem2 | Bwa-mem2 is the next version of the bwa-mem algorithm in bwa. It produces alignment identical to bwa and is ~80% faster. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
bwa-meth | Fast and accurate alignment of BS-Seq reads using bwa-mem and a 3-letter genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cactus | Cactus is a reference-free whole-genome alignment program, as well as a pagenome graph construction toolkit. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
chromeister | A dotplot generator for large chromosomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Clustal Omega | Clustal Omega is the latest addition to the Clustal family. It offers a significant increase in scalability over previous versions, allowing hundreds of thousands of sequences to be aligned in only a few hours. It will also make use of multiple processors, where present. In addition, the quality of alignments is superior to previous versions, as measured by a range of popular benchmarks | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ClustalW | Multiple sequence alignment program for DNA or proteins. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Conterminator | Conterminator is an efficient method for detecting incorrectly labeled sequences across kingdoms by an exhaustive all-against-all sequence comparison. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DALIGNER | The commands below permit one to find all significant local alignments between reads encoded in Dazzler database. The assumption is that the reads are from a PACBIO RS IIlong read sequencer. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
DIAMOND | Accelerated BLAST compatible local sequence aligner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ecoPrimers | ecoPrimer is a barcoding software which is written in C language. It finds universal primers from a set of input DNA sequences by finding conserved regions without "a priori" on candidate sequences. It also evaluates the quality of the primers and barcode regions by measuring the "barcode specificity" and "barcode coverage" indices | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
eggNog-mapper | eggnog-mapper is a tool for fast functional annotation of novel sequences. It uses precomputed orthologous groups and phylogenies from the eggNOG database to transfer functional information from fine-grained orthologs only. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EMA | EMA uses a latent variable model to align barcoded short-reads (such as those produced by 10x Genomics' sequencing platform). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
EMBOSS | EMBOSS is "The European Molecular Biology Open Software Suite". EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Exonerate | A generic tool for sequence alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FAMSA | Algorithm for large-scale multiple sequence alignments (400k proteins in 2 hours and 8BG of RAM) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FASTA | FASTA is a sequence similarity search tool which uses heuristics for fast local alignment searching. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
FastANI | FastANI is developed for fast alignment-free computation of whole-genome Average Nucleotide Identity (ANI) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fixchr | This package selects homologous chromosomes between two genomes by comparing whole-genome alignments between them. Additionally, it generates dotplots for quick checking of the output. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
fpa | Filter Pairwise Alignment | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GEM-library | A set of very optimized tools for indexing/querying huge genomes/files. |
Genobioinfo Cluster: How to use |
GeneSeqer | Sensitive spliced-alignment of cDNAs or proteins. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Genewise | Wise2 is a package focused on comparisons of biopolymers, commonly DNA sequence and protein sequence. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomOrder | GenomOrder is a Nextflow pipeline reordering and renaming scaffolds from up to 5 assemblies using a reference. It is also able to produce D-Genies back-up files allowing rapid visual comparison of chromosomes of the assemblies versus the reference. These files can be uploaded and visualized with the online tool D-Genies : http://dgenies.toulouse.inra.fr/ The assembly mapping versus the reference is performed with minimap2. These assemblies can be scaffolded or not. If they are not, an option enables to scaffold them according to the reference. The pipeline produces D-Genies back-up file for a user defined list of reference chromosomes. The chromosome file contains one reference chromosome name per line. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GMAP-GSNAP | GMAP: A Genomic Mapping and Alignment Program for mRNA and EST SequencesGSNAP: Genomic Short-read Nucleotide Alignment Program |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Goalign | Goalign is a set of command line tools to manipulate multiple alignments. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraphAligner | Seed-and-extend program for aligning long error-prone reads to genome graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GraphMap | A highly sensitive and accurate mapper for long, error-prone reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
GSAlign | An ultra-fast sequence alignment algorithm for intra-species genome comparison. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HarvestTools | HarvestTools is a utility for creating and interfacing with Gingr files, which are efficient archives that the Harvest Suite uses to store reference-compressed multi-alignments, phylogenetic trees, filtered variants and annotations. |
Genobioinfo Cluster: How to use |
HISAT2 | HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes (as well as to a single reference genome). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
i-ADHoRe | i-ADHoRe is a highly sensitive software tool to detect degenerated homology relations within and between different genomes. |
Genologin Cluster: How to use <Genobioinfo Cluster: How to use |
JustOrthologs | A Fast, Accurate, and User-Friendly Ortholog-Finding Algorithm |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KMA | KMA is a mapping method designed to map raw reads directly against redundant databases, in an ultra-fast manner using seed and extend. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
LAST | LAST finds similar regions between sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
lastp_aai | A simple Python script for calculating pairwise amino acid identity (AAI) between protein files (extension .faa) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LASTZ | A tool for aligning two DNA sequences, and inferring appropriate scoring parameters automatically. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
leeHom | A program for the Bayesian reconstruction of ancient DNA. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MACSE | Multiple Alignment of Coding SEquences Accounting for Frameshifts and Stop Codons: a wide range of molecular analyses relies on multiple sequence alignments (MSA). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MAFFT | MAFFT is a multiple sequence alignment program for unix-like operating systems. It offers a range of multiple alignment methods, L-INS-i (accurate; for alignment of <?200 sequences), FFT-NS-2 (fast; for alignment of <?10,000 sequences), etc. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Magic-BLAST | Magic-BLAST is a tool for mapping large next-generation RNA or DNA sequencing runs against a whole genome or transcriptome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MALT | MALT (MEGAN alignment tool) is an extension of MEGAN (metagenome analyzer). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MapSplice | Accurate mapping of RNA-seq reads for splice junction discovery. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MashMap | MashMap implements a fast and approximate algorithm for computing local alignment boundaries between long DNA sequences. It can be useful for mapping genome assembly or long reads (PacBio/ONT) to reference genome(s). Given a minimum alignment length and an identity threshold for the desired local alignments, Mashmap computes alignment boundaries and identity estimates using k-mers. It does not compute the alignments explicitly, but rather estimates a k-mer based Jaccard similarity using a combination of Minimizers and MinHash. This is then converted to an estimate of sequence identity using the Mash distance. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MCScanX | MCScan is an algorithm to scan multiple genomes or subgenomes to identify putative homologous chromosomal regions, then align these regions using genes as anchors. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MECAT | MECAT is an ultra-fast Mapping, Error Correction and de novo Assembly Tools for single molecula sequencing (SMRT) reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MetaMaps | MetaMaps is tool specifically developed for the analysis of long-read (PacBio/ONT) metagenomic datasets. It simultaenously carries out read assignment and sample composition estimation. It is faster than classical exact alignment-based approaches, and its output is more information-rich than that of kmer-spectra-based methods. For example, each MetaMaps alignment comes with an approximate alignment location, an estimated alignment identity and a mapping quality. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
meteor | Meteor (Metagenomic Explorator), a software for profiling metagenomic data at gene level. |
Genobioinfo Cluster: How to use |
Minigraph | Minigraph is a sequence-to-graph mapper and graph constructor. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Minimap | Experimental tool to find approximate mapping positions between long sequences |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Miniprot | Aligning proteins to genomes with splicing and frameshift. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MixMHCpred | A pan-Allele predictor designed for predicting peptide bindings to MHC alleles and can also perform sequence alignment and binding motif plotting. |
Genobioinfo Cluster: How to use |
MMseqs2 | MMseqs2 (Many-against-Many sequence searching) is a software suite to search and cluster huge proteins/nucleotide sequence sets. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MOSAIK | MOSAIK is a reference-guided assembler comprising of two main modular programs |
Genologin Cluster How to use Genobioinfo Cluster: Ask for Install |
Mugsy | Mugsy is a multiple whole genome aligner. Mugsy uses Nucmer for pairwise alignment, a custom graph based segmentation procedure for identifying collinear regions, and the segment-based progressive multiple alignment strategy from Seqan::TCoffee. Mugsy accepts draft genomes in the form of multi-FASTA files and does not require a reference genome. Angiuoli SV and Salzberg SL. Mugsy: Fast multiple alignment of closely related whole genomes. Bioinformatics 2011 27(3):334-4 | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MultAlin | Multiple sequence alignment with hierarchical clustering. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
MUMmer | MUMmer is a package for rapidly aligning entire genomes, whether in complete or draft form. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MUSCLE | Multiple sequence alignment (nucleic or proteic). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_Blast | Similarity search against databanks. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NCBI_Blast+ | Similarity search against databanks. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
NGMLR | NGMLR is a long-read mapper designed to align PacBio or Oxford Nanopore (standard and ultra-long) to a reference genome with a focus on reads that span structural variations SV detection from paired end reads mapping |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PALEOMIX | The PALEOMIX pipeline is a set of free and open-source pipelines and tools designed to enable the rapid processing of Next Generation Sequencing (NGS) data, starting from de-multiplexed reads from one or more samples, through sequence processing and alignment, and ending with genotyping, phylogenetic inference on the samples, as well as metagenomic analysis of the samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Paragraph | Graph realignment tools for structural variants. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Parsnp | Parsnp is a command-line-tool for efficient microbial core genome alignment and SNP detection. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PASTA | PASTA estimates alignments and ML trees from unaligned sequences using an iterative approach. In each iteration, it first estimates a multiple sequence alignment using the current tree as a guide and then estimates an ML tree on (a masked version of) the alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pblat | Parallelized blat with multi-threads support. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
picard-tools | Picard comprises Java-based command-line utilities that manipulate SAM files, and a Java API (SAM-JDK) for creating new programs that read and write SAM files. Both SAM text format and SAM binary (BAM) format are supported. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PLAST | PLAST is a fast, accurate and NGS scalable bank-to-bank sequence similarity search tool providing significant accelerations of seeds-based heuristic comparison methods, such as the Blast suite of algorithms. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PRANK | PRANK is a probabilistic multiple alignment program for DNA, codon and amino-acid sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PROBCONSRNA | PROBCONS is a tool for generating multiple alignments of protein sequences. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ProgressiveCactus | Progressive Cactus is a whole-genome alignment package. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ProtHint | ProtHint is a pipeline for predicting and scoring hints (in the form of introns, start and stop codons) in the genome of interest by mapping and spliced aligning predicted genes to a database of reference protein sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Puffaligner | Puffaligner is a fast, sensitive and accurate aligner built on top of the Pufferfish index. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
pysam | Pysam is a python module for reading and manipulating Samfiles. It's a lightweight wrapper of the samtools C-API. | Genologin Cluster: In Python module Genobioinfo Cluster: Ask for Install |
qgrs-cpp | C++ implementation of QGRS mapping algorithm (QGRS Mapper is a software program that generates information on composition and distribution of putative Quadruplex forming G-Rich Sequences (QGRS) in nucleotide sequences.) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Qualimap | Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
quant3p | A set of scripts for 3' RNA-seq quantification. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
read2tree | read2tree is a software tool that allows to obtain alignment matrices for tree inference. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ROSE | To create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Salmon | Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using lightweight alignments |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
samblaster | samblaster is a fast and flexible program for marking duplicates in read-id grouped1 paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
samclip | Filter SAM file for soft and hard clipped alignments | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
samtools | SAM (Sequence Alignment/Map). SAM Tools provide various utilities for manipulating alignments in the SAM format, including sorting, merging, indexing and generating alignments in a per-position format. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Satsuma | Highly sensitive whole-genome synteny alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SECAPR | Used to process targeted sequencing (or Gene capture) data by applying assembly and subsequent mapping algorithms (reducing paralogs unlike Hybpiper). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
segemehl | segemehl is a software to map short sequencer reads to reference genomes. segemehl implements a matching strategy based on enhanced suffix arrays (ESA). |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
seqwish | seqwish implements a lossless conversion from pairwise alignments between sequences to a variation graph encoding the sequences and their alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
shiver | shiver is a tool for mapping paired-end short reads to a custom reference sequence constructed using do novo assembled contigs, in order to minimise the biased loss of information that occurs from mapping to a reference that differs from the sample. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SHRiMP | SHRiMP is a software package for aligning genomic reads against a target genome. It was primarily developed with the multitudinous short reads of next generation sequencing machines in mind, as well as Applied Biosystem's colourspace genomic representation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SibeliaZ | SibeliaZ is a whole-genome alignment and locally-coliinear blocks construction pipeline. The blocks coordinates are output in GFF format and the alignment is in MAF. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Silix | The software package SiLiX implements an ultra-efficient algorithm for the clustering of homologous sequences, based on single transitive links (single linkage) with alignment coverage constraints. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SLR | SLR is a program to detect sites in coding DNA that are unusually conserved and/or unusually variable (that is, evolving under purify or positive selection) by analysing the pattern of changes for an alignment of sequences on an evolutionary tree. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SMALT | SMALT aligns DNA sequencing reads with a reference genome. Reads from a wide range of sequencing platforms can be processed, for example Illumina, Roche-454, Ion Torrent, PacBio or ABI-Sanger. Paired reads are supported. There is no support for SOLiD reads. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Spaln | Spaln (space-efficient spliced alignment) is a stand-alone program that maps and aligns a set of cDNA or protein sequences onto a whole genomic sequence in a single job. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SpeedSeq | A flexible framework for rapid genome analysis and interpretation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Spoa | A multiple sequence alignment tool/library that implements the POA (partial order alignement) algorithm using SIMD. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
spruceup | Tools to discover, visualize, and remove outlier sequences in large multiple sequence alignments. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SqueezeMeta | A fully automated metagenomics pipeline, from reads to bins. |
Genobioinfo Cluster: How to use |
srnaMapper | This tool maps reads produced by sRNA-Seq to a genome. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
STAR | RNA-seq aligner |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
StrobeAlign | Aligns short reads using dynamic seed size with strobemers. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Subread | A tool kit for processing next-gen sequencing data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Sumaclust | Fast and exact clustering of sequences. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
T-Coffee | T-Coffee is a multiple sequence alignment package. You can use T-Coffee to align sequences or to combine the output of your favorite alignment methods (Clustal, Mafft, Probcons, Muscle...) into one unique alignmen. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Tophat | TopHat is a fast splice junction mapper for RNA-Seq reads. It aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Tracy | Tracy is an efficient and versatile command-line application to basecall, align, assemble and deconvolute Sanger Chromatogram trace files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
trimAl | trimAl: a tool for automated alignment trimmin | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
uLTRA | uLTRA is a tool for splice alignment of long transcriptomic reads to a genome, guided by a database of exon annotations. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
USEARCH | USEARCH is a unique sequence analysis tool with thousands of users world-wide. USEARCH offers search and clustering algorithms that are often orders of magnitude faster than BLAST. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
vcf2maf | Convert a VCF into a MAF, where each variant is annotated to only one of all possible gene isoforms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vg | Variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VSEARCH | Versatile open-source tool for metagenomics |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
WFA | The wavefront alignment (WFA) algorithm is an exact gap-affine algorithm that takes advantage of homologous regions between the sequences to accelerate the alignment process. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wfmash | wfmash is an aligner for pangenomes based on sparse homology mapping and wavefront inception. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
WGDI | WGDI (Whole-Genome Duplication Integrated analysis), a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidizations and cross-species genome alignments. | Genobioinfo Cluster: How to use |
Winnowmap | Winnowmap is a long-read mapping algorithm, and a result of our exploration into superior minimizer sampling techniques. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
wu-blast | Similarity search against databanks, Washington University Blast.(OBSOLETE) |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
CellBender | CellBender is a software package for eliminating technical artifacts from high-throughput single-cell RNA sequencing (scRNA-seq) data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SAMap | Mapping single-cell RNA sequencing datasets from evolutionarily distant organisms. |
Genobioinfo Cluster: How to use |
Scanpy | Scanpy is a scalable toolkit for analyzing single-cell gene expression data built jointly with anndata. It includes preprocessing, visualization, clustering, trajectory inference and differential expression testing. The Python-based implementation efficiently deals with datasets of more than one million cells. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
ACACIA | Allele CAlling proCedure for Illumina Amplicon sequencing data: This workflow aims at extracting allele information out of paired-end Illumina FASTQC files. |
Genobioinfo Cluster: How to use |
adegenet | R package dedicated to the exploratory analysis of genetic data. It implements a set of tools ranging from multivariate methods to spatial genetics and genome-wise SNP data analysis | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ALFATClust | ALignment-Free Adaptive Threshold Clustering:Biological sequence clustering tool with dynamic threshold for individual clusters. Suitable for clustering multiple groups of homologous sequences. | Genobioinfo Cluster: How to use |
AlleleSeq | pipeline which constructs a diploid personal genome from genomic sequence variants of a family trio, including SNPs, indels and structural variants and maps functional genomic data onto this personal genome. | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
ANGSD | ANGSD is a software for analyzing next generation sequencing data. The software can handle a number of different input types from mapped reads to imputed genotype probabilities. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
AnnotSV | An integrated tool for Structural Variations annotation and ranking. |
Genobioinfo Cluster: How to use |
ANNOVAR | ANNOVAR is an efficient software tool to utilize update-to-date information to functionally annotate genetic variants detected from diverse genomes (including human genome hg18, hg19, hg38, as well as mouse, worm, fly, yeast and many others) | Genologin Cluster: Ask for Install Genobioinfo Cluster: Ask for Install |
Aquila | Diploid personal genome assembly and comprehensive variant detection based on linked-reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Assemblytics | Assemblytics is a bioinformatics tool to detect and analyze structural variants from a genome assembly by comparing it to a reference genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ATLAS | ATLAS stands for Analysis Tools for Low-coverage and Ancient Samples. These tools cover all programs necessary to obtain variant calls, estimates of heterozygosity and more from a BAM file. There are sequence data processing tools, diagnostic tools, and variant discovery tools, similar to GATK by the Broad Institute. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BayesAss3-SNPs | Modification of BayesAss 3.0.4 to allow handling of large SNP datasets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BCFtools | utilities for variant calling and manipulating VCFs and BCFs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Beagle | BEAGLE is a state of the art software package for analysis of large-scale genetic data sets with hundreds of thousands of markers genotyped on thousands of samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
BISER | Genobioinfo Cluster: How to use | |
BisSNP | Accurate combined SNP/Methylation calling. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
BreakDancer | SV detection from paired end reads mapping. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
breseq | breseq is a computational pipeline for finding mutations relative to a reference sequence in short-read DNA re-sequencing data for haploid microbial-sized genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
cgMLSTFinder | Core genome Multi-Locus Sequence Typing cgMLSTFinder runs KMA <1> against a chosen core genome MLST (cgMLST) database and outputs the detected alleles in a matrix file. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Clair3 | Clair3 is a germline small variant caller for long-reads. Clair3 makes the best of two major method categories: pileup calling handles most variant candidates with speed, and full-alignment tackles complicated candidates to maximize precision and recall. Clair3 runs fast and has superior performance, especially at lower coverage. Clair3 is simple and modular for easy deployment and integration. |
Genobioinfo Cluster: How to use |
ClinSV | Genobioinfo Cluster: How to use | |
cnD | cnD is a program to detect copy number variants from short-read sequence data. |
Genobioinfo Cluster: How to use |
CNVkit | A command-line toolkit and Python library for detecting copy number variants and alterations genome-wide from high-throughput sequencing. CNVkit is a Python library and command-line software toolkit to infer and visualize copy number from high-throughput DNA sequencing data. It is designed for use with hybrid capture, including both whole-exome and custom target panels, and short-read sequencing platforms such as Illumina and Ion Torrent. Read the full documentation at: http://cnvkit.readthedocs.io |
Genobioinfo Cluster: How to use |
CNVnator | A tool for CNV discovery and genotyping from depth of read mapping. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Cnvpipelines | A pipeline to detect copy number variations (CNV) on several samples. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Consensify | Consensify is a method for generating a consensus pseudohaploid genome sequence with greatly reduced error rates compared to standard pseudohaploidisation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CRISP | CRISP is a software program to detect SNPs and short indels from pooled sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
cuteSV | Long read based human genomic structural variation detection with cuteSV. |
Genobioinfo Cluster: How to use |
DeepVariant | DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data. |
Genobioinfo Cluster: How to use |
Delly | DELLY is an integrated structural variant prediction method that can detect deletions, tandem duplications, inversions and translocations at single-nucleotide resolution in short-read massively parallel sequencing data. It uses paired-ends and split-reads to sensitively and accurately delineate genomic rearrangements throughout the genome. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
detettore | A program to detect transposable element polymorphisms | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Discovar | Assemble genomes and find variants with DISCOVAR & DISCOVAR de novo | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Dysgu | dysgu-SV is a collection of tools for calling structural variants using short or long reads. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
elPrep | elPrep is a high-performance tool for analyzing .sam/.bam files (up to and including variant calling) in sequencing pipelines. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Ensembl-VEP | VEP determines the effect of your variants (SNPs, insertions, deletions, CNVs or structural variants) on genes, transcripts, and protein sequence, as well as regulatory regions. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
EVE | EVE is a set of protein-specific models providing for any single amino acid mutation of interest a score reflecting the propensity of the resulting protein to be pathogenic. |
Genobioinfo Cluster: How to use |
FreeBayes | FreeBayes is a Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs (single-nucleotide polymorphisms), indels (insertions and deletions), MNPs (multi-nucleotide polymorphisms), and complex events (composite insertion and substitution events) smaller than the length of a short-read sequencing alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GATK | The GATK is a structured software library that makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
gemBS | gemBS is a high performance bioinformatic pipeline designed for highthroughput analysis of DNA methylation data from whole genome bisulfites sequencing data (WGBS). It combines GEM3, a high performance read aligner and bs_call, a high performance variant and methyation caller, into a streamlined and efficient pipeline for bisulfite sequence analysis. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GenomeSTRiP | Genome STRiP (Genome STRucture In Populations) is a suite of tools for discovering and genotyping structural variations using sequencing data. The methods are designed to detect shared variation using data from multiple individuals. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
gIMble | A genome-wide IM blockwise likelihood estimation toolkit | Genologin Cluster: Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
graphtyper | graphtyper is a graph-based variant caller capable of genotyping population-scale short read data sets. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
GRIDSS | GRIDSS is a module software suite containing tools useful for the detection of genomic rearrangements. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Gtools | GTOOL is a program for transforming sets of genotype data for use with the programs SNPTEST and IMPUTE. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
HapCUT2 | Software tools for haplotype assembly from sequence data | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HarvestTools | HarvestTools is a utility for creating and interfacing with Gingr files, which are efficient archives that the Harvest Suite uses to store reference-compressed multi-alignments, phylogenetic trees, filtered variants and annotations. |
Genobioinfo Cluster: How to use |
HGTector | Genome-wide prediction of horizontal gene transfer based on distribution of sequence homology patterns. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Iris | A module which corrects the sequences of structural variant calls (currently only insertions). |
Genobioinfo Cluster: How to use |
IRMA | IRMA was designed for the robust assembly, variant calling, and phasing of highly variable RNA viruses. Currently IRMA is deployed with modules for influenza and ebolavirus. IRMA is free to use and parallelizes computations for both cluster computing and single computer multi-core setups. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
iVar | Var is a computational package that contains functions broadly useful for viral amplicon-based sequencing. Additional tools for metagenomic sequencing are actively being incorporated into iVar. While each of these functions can be accomplished using existing tools, iVar contains an intersection of functionality from multiple tools that are required to call iSNVs and consensus sequences from viral sequencing data across multiple replicates. We implemented the following functions in iVar: (1) trimming of primers and low-quality bases, (2) consensus calling, (3) variant calling - both iSNVs and insertions/deletions, and (4) identifying mismatches to primer sequences and excluding the corresponding reads from alignment files. |
Genobioinfo Cluster: How to use |
Jasmine | JASMINE: Jointly Accurate Sv Merging with Intersample Network Edges. This tool is used to merge structural variants (SVs) across samples. Each sample has a number of SV calls, consisting of position information (chromosome, start, end, length), type and strand information, and a number of other values. Jasmine represents the set of all SVs across samples as a network, and uses a modified minimum spanning forest algorithm to determine the best way of merging the variants such that each merged variants represents a set of analogous variants occurring in different samples. Manual : Jasmine User Manual · mkirsche/Jasmine Wiki · GitHub Jasmine also includes a module for automating the creation of IGV screenshots of variants of interest. |
Genobioinfo Cluster: How to use |
Jvarkit | Java utilities for Bioinformatics (only requested tools are compiling) | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
KING | KING is a toolset that makes use of high-throughput SNP data typically seen in a genome-wide association study (GWAS) or a sequencing project. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
kSNP4 | kSNP4 identifies the pan-genome SNPs in a set of genome sequences, and estimates phylogenetic trees based upon those SNPs. |
Genobioinfo Cluster: How to use |
LoFreq | A sequence-quality aware, ultra-sensitive variant caller for NGS data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LongRanger | Long Ranger is a set of analysis pipelines that processes Chromium sequencing output to align reads and call and phase SNPs, indels, and structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
longshot | Longshot is a variant calling tool for diploid genomes using long error prone reads such as Pacific Biosciences (PacBio) SMRT and Oxford Nanopore Technologies (ONT). | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
LUMPY | A general probabilistic framework for structural variant discovery | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Manta | Manta calls structural variants (SVs) and indels from mapped paired-end sequencing reads. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MapThin | Reduce the number of SNPs in a gene marker dense map computed by PLINK. First, by eliminating linked SNPs. Then, by applying different criteria. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Merfin | Evaluate variant calls and its combination with k-mer multiplicity. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Mobster | Mobster is used to detect novel (non-reference) Mobile Element Insertion (MEI) events in BAM files and uses both a discordant read pair method and a split-read method. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
NanoCaller | NanoCaller is a computational method that integrates long reads in deep convolutional neural network for the detection of SNPs/indels from long-read sequencing data. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PALEOMIX | The PALEOMIX pipeline is a set of free and open-source pipelines and tools designed to enable the rapid processing of Next Generation Sequencing (NGS) data, starting from de-multiplexed reads from one or more samples, through sequence processing and alignment, and ending with genotyping, phylogenetic inference on the samples, as well as metagenomic analysis of the samples. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PanGenie | A short-read genotyper for various types of genetic variants (such as SNPs, indels and structural variants) represented in a pangenome graph. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Paragraph | Graph realignment tools for structural variants. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Parsnp | Parsnp is a command-line-tool for efficient microbial core genome alignment and SNP detection. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PCAngsd | Framework for analyzing low depth next-generation sequencing (NGS) data in heterogeneous populations using principal component analysis (PCA). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
pggb | Pangenome graph builder. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pilon | Pilon is an automated genome assembly improvement and variant detection tool. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Pindel | Pindel can detect breakpoints of large deletions, medium sized insertions, inversions, tandem duplications and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of these variants from paired-end short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
ploidyNGS | A model-free, open source tool to visualize and explore ploidy levels in a newly sequenced genome, exploiting short read data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
popins | Population-scale detection of novel-sequence insertions. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
PopIns2 | Population-scale detection of non-reference sequence variants using colored de Bruijn Graphs. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
RAiSD | RAiSD (Raised Accuracy in Sweep Detection) is a stand-alone software implementation of the μ statistic for selective sweep detection. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RDXplorer | The RDXplorer (Read Depth eXplorer) is a computational tool for copy number variants (CNV) detection in whole human genome sequence data using read depth (RD) coverage. |
Genobioinfo Cluster: How to use |
REFMAKER | REFMAKER is a command-line and user-friendly pipeline providing different tools to create nuclear references from genomic assemblies of shotgun libraries. |
Genobioinfo Cluster: How to use |
RegTools | RegTools is a set of tools that integrate DNA-seq and RNA-seq data to help interpret mutations in a regulatory and splicing context. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
ROHan | ROHan is a Bayesian framework to estimate local rates of heterozygosity, infer runs of homozygosity (ROH) and compute global rates of heterozygosity. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SEDEF | SEDEF is a quick tool to find all segmental duplications in the genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SequenceTools | Tools for population genetics on sequencing datas |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SHAPEIT | SHAPEIT is a fast and accurate method for estimation of haplotypes (aka phasing) from genotype or sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SIFT4G | Sorting Intolerant From Tolerant For Genomes. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SIFT4G_Annotator | Annotating VCF files using the SIFT4G databases. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
snape-pooled | SNAPE-pooled computes the probability distribution for the frequency of the minor allele in a certain population, at a certain position in the genome. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Sniffles | A fast structural variant caller for long-read sequencing, Sniffles2 accurately detect SVs on germline, somatic and population-level for PacBio and Oxford Nanopore read data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Snippy | Rapid haploid variant calling and core genome alignment. Snippy finds SNPs between a haploid reference genome and your NGS sequence reads. It will find both substitutions (snps) and insertions/deletions (indels). It will use as many CPUs as you can give it on a single computer (tested to 64 cores). It is designed with speed in mind, and produces a consistent set of output files in a single folder. It can then take a set of Snippy results using the same reference and generate a core SNP alignment (and ultimately a phylogenomic tree). |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SNP-sites | Rapidly extracts SNPs from a multi-FASTA alignment. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
snpArcher | snpArcher is a reproducible workflow optimized for nonmodel organisms and comparisons across datasets, built on the Snakemake workflow management system, for dataset acquisition, variant calling, quality control, and downstream analysis. |
Genobioinfo Cluster: How to use |
SnpEff | SnpEff is a variant annotation and effect prediction tool. ttttIt annotates and predicts the effects of variants on genes (such as amino acid changes) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SNPGenie | SNPGenie is a collection of Perl scripts for estimating πN/πS, dN/dS, and gene diversity from next-generation sequencing (NGS) single-nucleotide polymorphism (SNP) variant data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SNPhylo | a pipeline to generate a phylogenetic tree from huge SNP data | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SNPsplit | SNPsplit is an allele-specific alignment sorter which is designed to read alignment files in SAM/BAM format and determine the allelic origin of reads that cover known SNP positions. |
Genobioinfo Cluster: How to use |
SpeedSeq | A flexible framework for rapid genome analysis and interpretation. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
STAR-Fusion | STAR-Fusion further processes the output generated by the STAR aligner to map junction reads and spanning reads to a reference annotation set (using a GTF file, ideally the same annotation file used during the STAR genome index building process during the intial STAR setup). |
Genobioinfo Cluster: How to use |
Strelka | Strelka2 is a fast and accurate small variant caller optimized for analysis of germline variation in small cohorts and somatic variation in tumor/normal sample pairs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Subread | A tool kit for processing next-gen sequencing data |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SURVIVOR | SURVIVOR is a tool set for simulating/evaluating SVs, merging and comparing SVs within and among samples, and includes various methods to reformat or summarize SVs. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SvABA | Structural variation and indel detection by local assembly |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SVanalyzer | Tools for the analysis of structural variation in genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVDetect | A tool to detect genomic structural variations from paired-end and mate-pair sequencing data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVIM | SVIM is a structural variant caller for long reads. It is able to detect, classify and genotype five different classes of structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
SVIM-asm | SVIM-asm (pronounced SWIM-assem) is a structural variant caller for haploid or diploid genome-genome alignments. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
svimmer | Merges similar SVs from multiple single sample VCF files. The tool was written for merging SVs discovered using Manta calls, but should support (almost) any SV VCFs. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi | SVJedi is a structural variation (SV) genotyper for long read data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SVJedi-graph | SVJedi-graph is a structural variation (SV) genotyper for long read data. |
Genobioinfo Cluster: How to use |
SVMerge | A pipeline to detect structural variants (SVs) by integrating calls from several existing SV callers, which are then validated and the breakpoints refined using local de novo assembly. |
Genobioinfo Cluster: How to use |
svtyper | Bayesian genotyper for structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
SyRI | SyRI is a comprehensive tool for predicting genomic differences between related genomes using whole-genome assemblies (WGA). | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
TARDIS | Toolkit for automated and rapid discovery of structural variants. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
TEnest | TEnest is a tool for finding and annotating transposable element (TE) insertions. |
Genobioinfo Cluster: How to use |
Tracy | Tracy is an efficient and versatile command-line application to basecall, align, assemble and deconvolute Sanger Chromatogram trace files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Truvari | Structural variant comparison tool for VCFs |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VarDict | VarDict is an ultra sensitive variant caller for both single and paired sample variant calling from BAM files. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Variabel | A novel approach and method for intrahost variant detection, which outperforms existing ONT variant callers. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VarScan | VarScan is a platform-independent software tool developed at the Genome Institute at Washington University to detect variants in NGS data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
VarTrix | VarTrix is a software tool for extracting single cell variant information from 10x Genomics single cell data. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vawk | An awk-like VCF parser |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
VCF-kit | Assorted utilities for the variant call format. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vcf2maf | Convert a VCF into a MAF, where each variant is annotated to only one of all possible gene isoforms. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
vcflib | C++ library and cmdline tools for parsing and manipulating VCF files. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
vg | Variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Vt | A tool set for short variant discovery in genetic sequence data. |
Genobioinfo Cluster: How to use |
WhatsHap | WhatsHap is a software for phasing genomic variants using DNA sequencing reads, also called read-based phasing or haplotype assembly. It is especially suitable for long reads, but works also well with short reads. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
goatools | Python library to handle Gene Ontology (GO) terms. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
PromoTech | Promotech is a machine-learning-based classifier trained to generate a model that generalizes and detects promoters in a wide range of bacterial species. | Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
bam2plot | Make coverage plots from bam files. |
Genobioinfo Cluster: How to use |
CVIT | Chromosome Viewing Tool. A collection of Perl scripts that enable quick visualizations of features on linkage groups, psuedochromosomes or cytogenetic maps. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
g3d | Genomics 3D visualizer tool sets.
|
Genobioinfo Cluster: How to use |
GCI | Genome Continuity Inspector (GCI) is an assembly assessment tool for high-quality genomes (e.g. T2T genomes), in base resolution. | Genobioinfo Cluster: How to use |
GfaViz | Graphical interactive tool for the visualization of sequence graphs in GFA format. |
Genobioinfo Cluster: How to use |
GrapeTree | GrapeTree is a fully interactive, tree visualization program within EnteroBase, which supports facile manipulations of both tree layout and metadata. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Haplostrips | Haplostrips produce plots that depict variants in a genomic window among different samples. Visualize similarities between haplotypes with respect to a reference haplotype through haplotype clustering and sorting, useful for revealing hidden population structure. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
HarvestTools | HarvestTools is a utility for creating and interfacing with Gingr files, which are efficient archives that the Harvest Suite uses to store reference-compressed multi-alignments, phylogenetic trees, filtered variants and annotations. |
Genobioinfo Cluster: How to use |
HyPhy | HyPhy is an open-source software package for the analysis of genetic sequences using techniques in phylogenetics, molecular evolution, and machine learning. It features a complete graphical user interface (GUI) and a rich scripting language for limitless customization of analyses. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
Infomap | Multi-level network clustering based on the Map Equation. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
Jasmine | JASMINE: Jointly Accurate Sv Merging with Intersample Network Edges. This tool is used to merge structural variants (SVs) across samples. Each sample has a number of SV calls, consisting of position information (chromosome, start, end, length), type and strand information, and a number of other values. Jasmine represents the set of all SVs across samples as a network, and uses a modified minimum spanning forest algorithm to determine the best way of merging the variants such that each merged variants represents a set of analogous variants occurring in different samples. Manual : Jasmine User Manual · mkirsche/Jasmine Wiki · GitHub Jasmine also includes a module for automating the creation of IGV screenshots of variants of interest. |
Genobioinfo Cluster: How to use |
MAGeCK | Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout (MAGeCK) is a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens technology. Manual and video tutorials : liulab / mageck-vispr — Bitbucket |
Genobioinfo Cluster: How to use |
Met4j | Met4J is an open-source Java library dedicated to the structural analysis of metabolic networks. It also came with a toolbox gathering CLI for several analyses relevant to metabolism-related research. |
Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
plotsr | Tool to plot synteny and structural rearrangements between genomes. | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
PretextView | OpenGL Powered Pretext Contact Map Viewer. |
Genobioinfo Cluster: How to use |
proteinortho_curves | Draw pan- and core-genome curves from proteinortho output |
Genobioinfo Cluster: How to use |
TBtools-II | GUI/CommandLine Tool Box for biologistists to utilize NGS data. |
Genobioinfo Cluster: How to use |
viu | A small command-line application to view images from the terminal written in Rust. |
Genobioinfo Cluster: How to use |
Application | Description | Availability/Use |
---|---|---|
chimerascan | chimerascan is a software package that detects gene fusions in paired-end RNA sequencing (RNA-Seq) datasets. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
CulebrONT | An open-source, scalable, modular and traceable Snakemake pipeline, able to launch multiple assembly tools in parallel, giving you the possibility of circularise, polish, and correct assemblies, checking quality. CulebrONT can help to choose the best assembly between all possibilities. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
FROGS | FROGS is a CLI workflow designed to produce an OTU count matrix from high depth sequencing amplicon data. |
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
MAGeCK | Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout (MAGeCK) is a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens technology. Manual and video tutorials : liulab / mageck-vispr — Bitbucket |
Genobioinfo Cluster: How to use |
MitoFinder | Mitofinder is a pipeline to assemble mitochondrial genomes and annotate mitochondrial genes from trimmed read sequencing data. MitoFinder is also designed to find and annotate mitochondrial sequences in existing genomic assemblies (generated from Hifi/PacBio/Nanopore/Illumina sequencing data...) | Genologin Cluster: How to use Genobioinfo Cluster: How to use |
nf-core workflows | This module provide access to workflows nf-core, there are automatically downloaded into your home. More info at nf-core/config page.
|
Genologin Cluster: How to use Genobioinfo Cluster: How to use |
RepeatExplorer | RepeatExplorer is a computational pipeline designed to identify and characterize repetitive DNA elements in next-generation sequencing data from plant and animal genomes. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |
snakePipes | Customizable workflows based on snakemake and python for the analysis of NGS data. | Genologin Cluster: How to use Genobioinfo Cluster: Ask for Install |