Data Resources
![Data Data](https://www.validate-network.org/sites/default/files/styles/mt_image_large/public/validate/images/page/replace_1.jpg?itok=qdro1maL)
Welcome to our Data Resources page. Below you will find a list of relevant DNA databases as well as some links to free online training.
You can find a list of available data sets from VALIDATE projects on our Data Sharing page.
Databases
![Addgene Logo Addgene Logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/addgene_logo.jpg?itok=qaRksivw)
Plasmid repository, archives and distributes plasmids for scientists, while also providing free molecular biology resources.
![Blast logo blast logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/blast_logo.jpg?itok=DEYvT_Yw)
Basic Local Alignment Search Tool
BLAST finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.
![Biocyc logo biocyc logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/biocyc_logo.jpg?itok=-ruQl2Cg)
Microbial genome Web portal that combines thousands of genomes. It provides an extensive range of query tools, visualization services and analysis software.
![Cattle gene atlas logos cattle gene atlas logos](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/cattle_gene_atlas_logos.jpg?itok=XXxSQsN9)
Website shows the expression of genes of interest based on Ensembl gene ID or gene symbol, and plot them according to tissue types.
![Centre for genomic epidemiology centre for genomic epidemiology](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/centre_for_genomic_epidemiology.jpg?itok=VrhIz1eZ)
Center for Genomic Epidemiology
Website For the analysis of bacterial genome.
![Chip atlas logo chip atlas logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/chip-atlas_logo.jpg?itok=uoU9lEjB)
ChIP-Atlas is an integrative and comprehensive database for visualizing and making use of public ChIP-seq data. ChIP-Atlas covers almost all public ChIP-seq data submitted to the SRA (Sequence Read Archives) in NCBI, DDBJ, or ENA, and is based on over 144,000 experiments.
![Dna vax db logo dna vax db logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/dna_vax_db_logo.jpg?itok=cAtqd2pQ)
A web-based DNA vaccine database and analysis system that curates, stores, and analyzes DNA vaccines and DNA vaccine plasmid vectors. DNAVaxDB includes only those DNA vaccines that have been verified to induce protection in at least a laboratory animal model.
![Enrichr enrichr](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/enrichr.jpg?itok=--wvrZrk)
Interactive and Collaborative Gene List Enrichment Analysis Tool.
![E ensembl logo e ensembl logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/e_ensembl_logo.jpg?itok=sFC9vJ0l)
Ensembl is a genome browser for vertebrate genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation. Ensembl annotate genes, computes multiple alignments, predicts regulatory function and collects disease data. Ensembl tools include BLAST, BLAT, BioMart and the Variant Effect Predictor (VEP) for all supported species.
![Veupath db logo veupath db logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/veupath_db_logo.jpg?itok=WrFeAO-P)
Web Portal for accessing genomic-scale datasets associated with the diverse eukaryotic microbes.
![Gprofiler logo gprofiler logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/gprofiler_logo.jpg?itok=HzeiOCyz)
A gene-centric data integrator with web UI and API services.
![Gene Atlas Gene atlas](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/gener_atlas.jpg?itok=_0usosv4)
Gene ATLAS is a large database of associations between hundreds of traits and millions of variants using the UK Biobank cohort.
![GOrilla GOrilla](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/gorilla.jpg?itok=G98LuFm6)
Gene Ontology Enrichment analysis and visualization tool.
Huvax: Licensed Human Vaccines
![Huvax Logo Huvax Logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/huvax_logo_0.jpg?itok=kdZdV9HV)
A web-based human licensed vaccine database. Huvax collects, annotates and analyses licensed human vaccines around the world. Currently it contains all licensed human vaccines in the US and Canada, and many licensed human vaccines from other countries. Huvax provides a user-friendly web interface for you to search, compare, and analyze different vaccines.
![Immport immport](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/immport.jpg?itok=beHOQE1-)
Website in support of the NIH mission to share data with the public.
![Iedar logo iedar logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/iedar_logo_0.jpg?itok=3OJ7G542)
Immune Epitope Database and Analysis Resources
IEDB catalogs experimental data on antibody and T cell epitopes studied in different species in the context of infectious disease, allergy, autoimmunity and transplantation. IEDB could help in the prediction and analysis of epitopes.
![Inate DB Innate DB](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/nnate_db_logo.jpg?itok=FANSXujV)
Publicly available database of the genes, proteins, experimentally-verified interactions and signaling pathways involved in the innate immune response of humans, mice and bovines to microbial infection. The database captures an improved coverage of the innate immunity interactome by integrating known interactions and pathways from major public databases together with manually-curated data into a centralised resource.
![Intergrated dna technologies intergrated dna technologies](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/intergrated_dna_technologies_0.jpg?itok=JxieIPxU)
Tool to design qPCR primers.
![Metascape logo metascape logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/metascape_logo.jpg?itok=T4EO_dmY)
Metascape is a free gene annotation and analysis resource that helps biologists make sense of one or multiple gene lists.
![Premier biosfot logo premier biosfot logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/premier_biosfot_logo.jpg?itok=qKuW1qYI)
Primer Analysis Software. It analyzes the secondary structure, melting temperature, and the best primer pairs for given experimental conditions.
![DTU Logo DTU Logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/dtu_logo_long.jpg?itok=D3WNHDaK)
Server predicts CTL epitopes in protein sequences.
The NetMHCIIpan-4.0 server predicts peptide binding to any MHC II molecule of known sequence using Artificial Neural Networks.
![Plasmodb logo plasmodb logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/plasmodb_logo.jpg?itok=3dCj1AFx)
Genome database for the genus Plasmodium.
![Polyphen 2 logo polyphen 2 logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/polyphen_2_logo_0.jpg?itok=fIhdaGcP)
PolyPhen-2 (Polymorphism Phenotyping v2) is a tool which predicts possible impact of an amino acid substitution on the structure and function of a human protein using straightforward physical and comparative considerations.
![Tb database logo tb database logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/tb_database_logo_0.jpg?itok=s1UqCctg)
TBDB is an integrated database to genome sequence, expression data and literature for tuberculosis. It contains genome sequence data for Mycobacterium tuberculosis strains and other sequenced Mycobacteria. It offers a collection of tools for the visualization, analysis and data download.
![The human protien atlas logo the human protien atlas logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/the_human_protien_atlas_logo.jpg?itok=-OgsKr1Z)
This Atlas contains information regarding the expression profiles of human genes both on the mRNA and protein level. The protein expression data from 44 normal human tissue types is derived from antibody-based protein profiling using immunohistochemistry. The protein data covers 15313 genes (78%) for which there are available antibodies. The mRNA expression data is derived from deep sequencing of RNA (RNA-seq) from 37 different normal tissue types. It also contain information about the expression and spatio-temporal distribution of proteins within human cells.
![Tm calculator tm calculator](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/tm_calculator.jpg?itok=xFV8Tpds)
This tool calculates the Tm of primers and estimates an appropriate annealing temperature when using different DNA polymerases.
![Vaxign vaccine design logo vaxign vaccine design logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/vaxign_vaccine_design_logo.jpg?itok=zc4SF_r2)
Vaxign (Vaccine Design) is a vaccine target prediction and analysis system based on the principle of reverse vaccinology.
![Vaxijen vaxijen](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/vaxijen.jpg?itok=BG_uIrpm)
Server for alignment-independent prediction of protective antigens and subunit vaccines.
![VaxJo Logo VaxJo Logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/vaxjo_logo.jpg?itok=EExgYqAi)
A program to analyze vaccine adjuvants used in the vaccines collected in the VIOLIN vaccine database. A program to analyze vaccine adjuvants used in the vaccines collected in the VIOLIN vaccine database.
Vevax: Licensed Veterinary Vaccines
![Vevax Logo Vevax Logo](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/vexvac.jpg?itok=BWSLmCU4)
A web-based licensed veterinary vaccine database. Vevax collects, annotates and analyses licensed veterinary vaccines around the world. Current Vevex focuses on the USA-licensed veterinary vaccines. Vevex contains all licensed veterinary vaccines in the US. Vevax provides a user-friendly web interface for you to search, compare, and analyze different vaccines.
![Vaxvec vaxvec](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/vaxvec_0.jpg?itok=8P9Elok7)
A database collect and analyze vaccine vectors used in vaccine development and research for diseases important for the public health.
![Virumugendb virumugendb](https://www.validate-network.org/sites/default/files/styles/mt_image_small/public/validate/images/media/virumugendb_0.jpg?itok=K8rm25YM)
A Database of Virulent Genes used for Development of Live Attenuated Vaccines.
Public Access Training
A basic task in the analysis of count data from RNA-seq is the detection of differentially expressed genes. The count data are presented as a table which reports, for each sample, the number of sequence fragments that have been assigned to each gene. Analogous data also arise for other assay types, including comparative ChIP-Seq, HiC, shRNA screening, and mass spectrometry. An important analysis question is the quantification and statistical inference of systematic changes between conditions, as compared to within-condition variability. The package DESeq2 provides methods to test for differential expression by use of negative binomial generalized linear models; the estimates of dispersion and logarithmic fold changes incorporate data-driven prior distributions. This vignette explains the use of the package and demonstrates typical workflows. An RNA-seq workflow on the Bioconductor website covers similar material to this vignette but at a slower pace, including the generation of count matrices from FASTQ files. DESeq2 package version: 1.30.0
Michael I. Love, Simon Anders, and Wolfgang Huber
In this course you will discuss some of the questions that can be addressed using scRNA-seq as well as the available computational and statistical methods available. The course is taught through the University of Cambridge Bioinformatics training unit, but the material found on these pages is meant to be used for anyone interested in learning about computational analysis of scRNA-seq data. The course is taught twice per year and the material here is updated prior to each event.
![R Logo R Logo](https://www.validate-network.org/sites/default/files/styles/mt_image_medium/public/validate/images/media/r_programming_language.jpg?itok=uVQxGVpj)
R is a licence free programming language used for statistical computing and data science and has been used to visualise everything from Market trends to vaccine efficacy. The R language is widely used among statisticians and data miners for developing statistical software and data analysis.
Mirvat has selected training that will help you enhance your bioinformatic skills:
Pdfs:
Online Courses: