Herramientas software

Las herramientas software para el análisis de datos de IMPaCT-Data se pueden encontrar en el dominio IMPaCT-Data de acceso público en bio.tools. Bio.tools es un registro de componentes software y bases de datos dirigida a investigadores en el campo de las ciencias biológicas y biomédicas para facilitarles el trabajo de encontrar, entender, utilizar y citar recursos de uso diario.

FAIR4Health Data Curation Tool

This is a standalone, desktop application developed by the FAIR4Health project (https://www.fair4health.eu/). The tool is used to connect the health data sources which can be in various formats (Excel files, CSV files, SQL databases) and migrate data into a HL7 FHIR Repository. The tool shows the available FHIR profiles to the user so that he/she can perform mappings appropriately. The tool can also contact a Terminology Server (which is actually another HL7 FHIR Repository) so that data fields can be annotated if coding schemes such as ICD10 or SNOMED-CT are in use.


Beyondcell is a computational methodology for identifying tumour cell subpopulations with distinct drug responses in single-cell RNA-seq data and proposing cancer-specific treatments.


EvolClust predicts groups of genes that are conserved in terms of gene order across different species distinguishing it from the background gene order conservation found between species. We define a cluster as a group of homologous proteins that are found grouped together in at least two different genomes and which are more conserved than what is expected for the pair of genomes. The order of the genes inside the cluster is not necessarily conserved. Pairwise clusters are grouped into multi-species families.


Using mechanistic models for the clinical interpretation of complex genomic variation. The sustained generation of genomic data in the last decade has increased the knowledge on the causal mutations of a large number of diseases, especially for highly penetrant Mendelian diseases, typically caused by a unique or a few genes. However, the discovery of causal genes in complex diseases has been far less successful. Many complex diseases are actually a consequence of the failure of complex biological modules, composed by interrelated proteins, which can happen in many different ways, which conferring a multigenic nature to the condition that can hardly be attributed to one or a few genes.

RD-Connect Genome-Phenome Analysis Platform (GPAP)

An online tool for diagnosis and gene discovery in rare disease research. The platform features allow identifying disease-causing mutations in rare disease patients and linking them with detailed clinical information.


Annotates variants with biological data such as protein structural information, functionally important residues, conservation of functional domains and evidence of cross-species conservation.


PhylomeDB is a public database for complete catalogs of gene phylogenie. It allows users to interactively explore the evolutionary history of genes through the visualization of phylogenetic trees and multiple sequence alignments. Moreover, phylomeDB provides genome-wide orthology and paralogy predictions which are based on the analysis of the phylogenetic trees. The automated pipeline used to reconstruct trees aims at providing a high-quality phylogenetic analysis of different genomes, including Maximum Likelihood tree inference, alignment trimming and evolutionary model testing.


Hipathia is a method for the computation of signal transduction along signaling pathways from transcriptomic data. The method is based on an iterative algorithm which is able to compute the signal intensity passing through the nodes of a network by taking into account the level of expression of each gene and the intensity of the signal arriving to it. It also provides a new approach to functional analysis allowing to compute the signal arriving to the functions annotated to each pathway.


DisGeNET is a discovery platform containing one of the largest publicly available collections of genes and variants associated to human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models and the scientific literature. DisGeNET data are homogeneously annotated with controlled vocabularies and community-driven ontologies. Additionally, several original metrics are provided to assist the prioritization of genotype–phenotype relationships. The current version of DisGeNET (v7.0) contains 1,134,942 gene-disease associations (GDAs), between 21,671 genes and 30,170 diseases, disorders, traits, and clinical or abnormal human phenotypes, and 369,554 variant-disease associations (VDAs), between 194,515 variants and 14,155 diseases, traits, and phenotypes.


DiSMed is a de-identification methodology for Spanish medical texts based on Named Entity Recognition (NER). It is based on spaCy and partially based on the networks designed by Gillaume Genthial implemented on Tensorflow 1. DiSMed includes both the Python code and the curated dataset, available under request under a research use agreement.


GeneCodis is a web-based tool for the ontological analysis of lists of genes, proteins, and regulatory elements like miRNAs, transcription factors, and CpGs.


Identification of diferentially methylated regions (DMRs) in predefined regions (promoters, CpG islands...) from the human genome using Illumina's 450K or EPIC microarray data. Provides methods to rank CpG probes based on linear models and includes plotting functions.


ImaGEO is a web tool for gene expression Meta-Analysis that implements a complete and comprehensive meta-analysis workflow starting from Gene Expression Omnibus (GEO) dataset identifiers. The application integrates GEO datasets, applies different meta-analysis techniques and provides functional analysis results in an easy-to-use environment. ImaGEO is a powerful and useful resource that allows researchers to integrate and perform meta-analysis of GEO datasets to lead robust findings for biomarker discovery studies.


MetaGenyo is a simple, ready-to-use software which has been designed to perform meta-analysis of genetic association studies.


MetaPhOrs is a public repository of phylogeny-based orthologs and paralogs that were computed using phylogenetic trees available in twelve public repositories. Currently, over 117,131,162 of unique homologs are deposited in MetaPhOrs database. These predictions were retrieved from 8,246,911 Maximum Likelihood trees for 4,094 species. For each prediction, MetaPhOrs provides a Consistency Score and Evidence Level describing its goodness, together with number of trees and links to their source databases.


DREIMT is a bioinformatics tool for hypothesis generation and prioritization of drugs capable of modulating immune cell activity from transcriptomics data.


Tool to prioritize therapeutic vulnerabilities in cancer.


PanDrugs is a method to prioritize anticancer drug treatments according to individual genomic data. PanDrugs current version integrates data from 24 primary sources and supports 56297 drug-target associations obtained from 4804 genes and 9092 unique compounds.


A curated inventory of catalytic and biologically relevant small ligand-binding sites.


TRIFID is an ML-based tool trained on the evidence of large-scale proteomics analysis and evolutionary, structural, annotation, splicing, and RNA-seq based features to classify the biologically important splice isoforms.

FAIR4Health Data Privacy Tool

This is a standalone, desktop application developed by the FAIR4Health project (https://www.fair4health.eu/). The tool aims to handle the privacy challenges exposed by the sensitive health data. It is designed to work on an HL7 FHIR API so that it can be used on top of any standard FHIR Repository as a data de-identification, anonymization, and related actions toolset. The tool accesses FHIR resources, presents metadata to the user, guide the user about the configuration to be applied and then output the processed FHIR resources.


APID Interactomes provides a comprehensive collection of protein interactomes for more than 500 organisms based on the integration of known experimentally validated protein-protein physical interactions (PPIs). Construction of the interactomes is done with a methodological approach to report quality levels and coverage over the proteomes for each organism included. APID unifies PPIs from primary databases of molecular interactions (BIND, BioGRID, DIP, HPRD, IntAct, MINT) and from experimentally resolved 3D structures (PDB) where more than two distinct proteins have been identified. APID also includes a data visualization web-tool that allows the construction of sub-interactomes using query lists of proteins of interest and the visual exploration of the corresponding networks, including an interactive selection of the properties of the interactions reliability of the "edges") and a mapping of the functional environment of the proteins (functional annotations of the "nodes").


A species-level analysis of 16S rRNA nanopore sequencing data based on de novo clustering and consensus building.


Publicly available integrated pipeline designed for the assembly and subsequent analysis of Ion Torrent bacterial sequence data. Both its components and their configuration are based on a research process aimed to discover the optimal combination of tools for obtaining good results from single-end reads generated by the Ion Torrent PGM sequencer.


NanoRTax is a taxonomic and diversity analysis pipeline built originally for Nanopore 16S rRNA data with real-time analysis support in mind. It combines state-of-the-art classifiers such as Kraken2, Centrifuge and BLAST with downstream analysis steps to provide a framework for the analysis of in-progress sequencing runs. NanoRTax retrieves the final output files in the same structure/format for every classifier which enables more comprehensive tool/database comparison and better benchmarking capabilities. Additionally, NanoRTax includes a web application (./viz_webapp/) for visualizing complete or partial pipeline outputs. The NanoRTax pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with conda environments and docker containers making installation trivial and results highly reproducible.


Small RNAseq pipeline for paired-end reads


PlasmidID is a mapping-based, assembly-assisted plasmid identification tool that analyzes and gives graphic solution for plasmid identification.


Open-source LIMS (laboratory Information Management System) for Next Generation Sequencing sample management, statistics and reports, and bioinformatics analysis service management.


cg/wgMLST allele calling software, schema evaluation and allele distance estimation for outbreak reserch.


Transcription Factor Target Enrichment Analysis


Priorr is a prioritization program of disease-linked genetic variants devoloped within the Genetics&Genomics Department of La Fundacion Jimenez Diaz University Hospital. Priorr is conceived to analyse the output of the FJD-pipeline of SNVs or CNVs. This software program offers a number of useful functionalities for variant analysis such as: filtering by a virtual panel of genes. manual control of different population frequencies or pathogenicity predictors or filtering out variants that have been already found by another protocol.


Pipeline for Single Nucleotide Variants (SNVs) and Copy Number Variation (CNVs) variant calling


PTMCode is a resource of known and predicted functional associations between protein post-translational modifications (PTMs) within and between interacting proteins. It currently contains 316,546 modified sites from 69 different PTM types which are also propagated through ortholgs between 19 different eukaryotic species. A total of 1.6 million sites and 17 million functional associations more than 100,000 proteins can currently be explored.


This pipeline was developed to detect and quantify isoforms from the expression of minigenes, whose cDNA was sequenced using Oxford Nanopore Technologies (ONT).


Prioritization of gene diseases candidates by disease-aware evaluation of heterogeneous evidence networks

Automatic segmentation tool

Tool that automatically detects the histological type of a tumor region of interest from Whole Slide Imaging technique

Slides Viewer

Tool to visualize WSI and navigate through their zones at different zoom levels.


LinkEHR is a set of tools that enables the semantic interoperability of your data by: Creating clinical information models (archetypes) Transforming clinical data into standards such as openEHR, HL7 CDA, or ISO 13606


Liferay, Inc., is an open-source company that provides free documentation and paid professional service to users of its software. Mainly focused on enterprise portal technology, the company has its headquarters in Diamond Bar, California, United States.


Metabolizer is a web tool for analysis of modular architecture of metabolic pathways using transcriptomic data. Metabolizer calculates impact of modules on production of metabolites. These modules are conserved part of metabolism which starts with substrate(s) and ends with a product.


CyPathia is a cytoscape app, that provides a user friendly and straightforward interface. The CyPathia app is based on Hipathia Bioconductor package, allowing the Cytoscape community the possibility of using mechanistic models.


A web tool implements a mechanistic model of human signaling for the interpretation of the consequences of the combined changes of gene expression levels and/or genomic mutations in the context of signalling pathways known to be involved in the infection by SARS-CoV-2, which are updated with the curated versions released by the COVID-19 Disease Map curation project .


impuSARS allows the imputation of viral whole genome sequences from partially sequenced samples. Additionally, impuSARS provides the lineage associated to the imputed sequence. impuSARS have been validated with a reference of SARS-CoV-2 sequences.


SPACNACS is a crowdsourcing initiative to provide information about Copy Number Variations of the Spanish population to the scientific/medical community.


nfcore/viralrecon is a bioinformatics analysis pipeline used to perform assembly and intra-host/low-frequency variant calling for viral samples. The pipeline supports short-read Illumina sequencing data from both shotgun (e.g. sequencing directly from clinical samples) and enrichment-based library preparation methods (e.g. amplicon-based: ARTIC SARS-CoV-2 enrichment protocol; or probe-capture-based).


OpenEBench (https://openebench.bsc.es) is the ELIXIR benchmarking and technical monitoring platform for bioinformatics tools, web servers and workflows. OpenEBench is part of the ELIXIR Tools platform and its development is led by the Barcelona Supercomputing Center (BSC) in collaboration with partners within ELIXIR and beyond. OpenEBench, holds a specific infrastructure to monitor software quality. In an initial analysis phase BSC has put together a series of quality metrics taken from a number of sources. The source of such metrics includes documents by the Software Sustainability Institute, recommendations for open source software development, or for software quality metrics. For each metric, a specific source of information have been chosen and the necessary interface implemented.


MIGNON (Mechanistic InteGrative aNalysis Of rNa-seq data) is a versatile workflow to integrate RNA-seq genomic and transcriptomic data into mechanistic models of signaling pathways.


ExpHunterSuite is an R package for the comprehensive analysis of transcriptomic data.


MetaFun is a web tool for the integration and functional characterization by unveiling sex differences in multiple omics studies through comprehensive functional meta-analysis.


DomFun is a system to assign functions to unknown proteins using a systemic approach without considering their sequence but their domains associated with functional systems. It uses associations calculated between protein domains and functional annotations as training dataset and performs predictions over proteins (using UniProt identifiers) by finding their domains and if they have been associated with functional annotations (in GO molecular functions, biological processes, KEGG and Reactome pathway terms).


SMAca: SMA Carrier Analysis tool. SMN1 copy-number and sequence variant analysis from next generation sequencing data. SMAca is a python tool to detect putative SMA carriers and estimate the absolute SMN1 copy-number in a population. Moreover, SMAca takes advantage of the knowledge of certain variants specific to SMN1 duplication to also identify the so-called “silent carriers” (i.e. individuals with two copies of SMN1 on one chromosome, but none on the other).


Collaborative Spanish Variant Server (CSVS) is a crowdsourcing initiative to provide information about the genomic variability of the Spanish population to the scientific/medical community. It is useful for filtering polymorphisms and local variations in the process of prioritizing candidate disease genes. Submissions from WES or WGS are accepted.


Dockerized Jupyter notebook for interactive Oxford Nanopore MinION sequence manipulation and genome assembly.


Tool to assess the efficiency of targeted enrichment sequencing.

Este sitio web utiliza cookies y cookies de terceros para un mejor funcionamiento. El uso de sus datos personales es limitado.
Al usar el sitio, usted acepta esto como se describe en nuestra Nota legal.