MiModD User Guide

last updated: 20-Jan-2015


What is MiModD ?

MiModD is a comprehensive software package for the identification and annotation of mutations in the genomes of model organisms from whole-genome sequencing (WGS) data.

Whom is it for ?

MiModD was designed specifically to enable biologists/geneticists with limited bioinformatical knowledge to analyze genome-wide sequencing data of their preferred organism on off-the-shelf desktop computers.

MiModD could be interesting for you if:

  • you are a geneticist, who wants to identify mutations in a mutant strain of a model organism.

    You probably have a good understanding of the principles of mutation mapping, and you know that whole-genome sequencing can speed up this process enormously, but you do not know how you would analyze WGS data and you do not know a bioinformatician who could do it for you.

    Alternatively, you may have a bioinformatics facility in your institute, but they are overwhelmed with user requests and have a long queue and limited time per experiment so you would prefer to analyze your data independently.

  • you are analyzing WGS data already yourself, but you think your own computer cannot do such computation-intensive tasks. So you are using web-based tools, but it is really annoying to transfer all those huge files back and forth over the internet.

  • you are working in a bioinformatics facility and you are overwhelmed with user requests and would like to leave relatively standard tasks to your clients to have more time to tackle challenging problems.

MiModD runs quite efficiently on standard notebooks and desktop computers (see the section on hardware requirements for details). It is really straightforward to install and easy to use (in particular, when integrated into a local Galaxy instance). When combined with Galaxy it can turn a regular PC into a small server for WGS data analysis for your research group or maybe even for your department.

MiModD is beginner-friendly, but that does not mean that it produces simplified results. Internally, it uses established tools like the spaced-seeds aligner SNAP and samtools/bcftools for variant calling, which ensures quality output.

_images/analysis_workflow.png

A typical analysis workflow in MiModD.

What exactly can MiModD do for me ?

  • short-reads alignment from different input formats (fastq and gzipped fastq, SAM, BAM)
  • conversion between input formats and annotation with run metadata
  • variant calling (SNPs and indels)
  • variant post-processing, including filtering and annotation
  • deletion calling (currently only for paired-end data)
  • support for mapping-by-sequencing approaches through CloudMap-ready output

What makes MiModD user-friendly ?

Here is a short list:

  • completely automated management of temporary files
  • analysis of multiple samples through a single linear workflow
  • built-in multithreading in the alignment and variant calling steps for increased speed
  • choice between a consistent command-line interface and a complete and straightforward integration of the package into a Galaxy server

Contents of this User Guide

  • Installation Instructions

    This section describes the installation of MiModD and all required and optional software including a guide to installing Galaxy and integrating MiModD into it.

  • Tutorial

    We hope you will find MiModD and, especially, its Galaxy interface beginner-friendly and easy to use. The package is, however, versatile enough to be compatible with a number of different use-cases, which is best explained to new users by taking them through some example analyses that introduce the functionality of MiModD step-by-step.

  • Tool Documentation

    This section holds the detailed description of the MiModD command line and Galaxy user interface. It presents and explains all tools of the package and the complete set of parameters that can be used with each of them.

  • MiModD Recipes

    We have started to compile a list of recipes for performing some typical tasks that you are likely to encounter as a user of MiModD. Not all of these are MiModD-specific, but there are also some Galaxy recipes for working efficiently with WGS datasets.

  • MiModD and Galaxy

    This section explains some of the peculiarities and possibilities (like Galaxy Workflows) that will concern you only if you are using MiModD through its Galaxy interface.

  • Mapping-by-Sequencing using MiModD and CloudMap

    This section explains how to generate output with MiModD for further analysis with the CloudMap series of mapping tools. Read if you are intending to analyze mapping-by-sequencing experiments.

  • Overview of the Python API

    MiModD is aimed at end-users. It is NOT another Python package for manipulating next-generation sequencing data programmatically. That said, as a Python programmer you may still find certain parts of the package useful for your work, or you may wish to contribute code or bug-fixes. In these cases, this section may serve as your starting point for exploring the package code.