[5.14.0] - 2020-04-08¶
- Redesigned HTML reports, with improved interface and performance.
- For big data, HTML reports can now be stored as ZIP, where files are not anymore embedded but rather are stored in an auxilliary folder, such that they don’t have to be in memory during report rendering.
- Added subcategories to report (see docs).
- Fixed a bug linter, leading to only one rule or snakefile to be linted.
- Breaking change in CLI: added flags –conda-cleanup-envs and –conda-cleanup-pkgs, removed flag –cleanup-conda.
- Fixed scheduling of pipe jobs, they are now always scheduled, fixing a hangup.
- Corrected quoting of shell command for cluster submission.
[5.13.0] - 2020-03-27¶
- Allow to flag directories for inclusion in the report.
- Fixed hash computation for –cache in case of positional params arguments.
- Automatically restrict thread usage of linear algebra libraries to whatever is specified in the rule/job.
[5.12.0] - 2020-03-24¶
- Fixed the ability to supply functions for the thread directive.
- Improved error messages for caching.
- A new “cache: true” directive that allows to annotate between workflow caching eligibility for rules in the workflow.
[5.11.2] - 2020-03-19¶
- Fixed a spurious error message complaining about missing singularity image if –use-singularity is not activated.
[5.11.1] - 2020-03-16¶
- Fixed a KeyError bug when executing a workflow that defines containers without –use-singularity.
[5.11.0] - 2020-03-16¶
- Fixes for environment modules and tibanna-based AWS execution.
- Fixes for –default-resources defaults.
- –cores is now a mandatory argument!
- Automatic checksum validation for google storage.
- Azure storage authentication via SAS
- A generic cintainer directive that will in the future allow for other backends than just singularity.
- envvars directive for asserting environment variable existence. See docs.
- support for AWS spot instances via –tibanna-config spot=true.
- Automatic code quality linting via –lint.
[5.10.0] - 2020-01-20¶
- Jupyter notebook integration, see docs. This enables interactive development of certain data analysis parts (e.g. for plotting).
- Ability to overwrite thread definitions at the command line (
--threads rulename=3), thereby improving scalability.
- Requester pays configuration for google storage remote files.
- Add keyword
allow_missingto expand function, thereby allowing partical expansion by skipping wildcards for which no keywords are defined.
- Various bug fixes, e.g. for between workflow caching and script execution.
[5.9.0] - 2019-12-20¶
- Support for per-rule environment module definitions to enable HPC specific software deployment (see docs).
- Allow custom log handler defitions via –log-handler-script (e.g. post errors and progress to a slack channel or send emails).
- Allow setting threads as a function of the given cores (see docs).
- Various minor fixes.
[5.8.2] - 2019-12-16¶
- Implemented a
multiexthelper, allowing to define a set of output files that just differ by extension.
- Fixed a failure when caching jobs with conda environments.
- Fixed various minor bugs.
- Caching now allows to cache the output of rules using
[5.8.0] - 2019-11-15¶
- Blockchain based caching between workflows (in collaboration with Sven Nahnsen from QBiC), see the docs.
- New flag –skip-cleanup-scripts, that leads to temporary scripts (coming from script or wrapper directive) are not deleted (by Vanessa Sochat).
- Various bug fixes.
[5.7.4] - 2019-10-23¶
- Various fixes and adaptations in the docker container image and the test suite.
[5.7.1] - 2019-10-16¶
- Ability to print log files of failed jobs with –show-failed-logs.
- Fixed bugs in tibanna executor.
- Fixed handling of symbolic links.
- Fixed typos in help texts.
- Fixed handling of default resources.
- Fixed bugs in azure storage backend.
[5.7.0] - 2019-10-07¶
- Fixed various corner case bugs. Many thanks to the community for pull requests and reporting!
- Container execution adapted to latest singularity.
- First class support for Amazon cloud execution via a new Tibanna backend <https://snakemake.readthedocs.io/en/v5.7.0/executable.html#executing-a-snakemake-workflow-via-tibanna-on-amazon-web-services>. Thanks to Soo Lee from Harvard Biomedical Informatics!
- Allow multiple config files to be passed via the command line.
- A new, more detailed way to visualize the DAG (–filegraph). Thanks to Henning Timm!
- Pathlib compatibility added. Input and output files can now also be Path objects. Thanks to Frederik Boulund!
- New azure storage remote provider. Transparently access input and output files on Microsoft Azure. Thanks to Sebastian Kurscheid!
[5.6.0] - 2019-09-06¶
- Fix compatibility with latest singularity versions.
- Various bug fixes (e.g. in cluster error handling, remote providers, kubernetes backend).
- Add –default-resources flag, that allows to define default resources for jobs (e.g. mem_mb, disk_mb), see docs.
--dry-runas a synonym of
--dryrun. Other Snakemake options are similarly hyphenated, so other documentation now refers to
--dry-runbut both (and also
-n) will always be accepted equivalently.
[5.5.4] - 2019-07-21¶
- Reports now automatically include workflow code and configuration for improved transparency.
[5.5.2] - 2019-06-25¶
- Various minor bug fixes in reports.
- Speed improvements when using checkpoints.
[5.5.1] - 2019-06-18¶
- Improved report interface. In particular for large files.
- Small TSV tables are automatically rendered as HTML with datatables.
- Be more permissive with Snakefile choices: allow “Snakefile”, “snakefile”, “workflow/Snakefile”, “workflow/snakefile”.
[5.5.0] - 2019-05-31¶
- Script directives now also support Julia.
- Various small bug fixes.
[5.4.4] - 2019-03-22¶
- Vastly improved performance of HTML reports generated with –report, via a more efficient encoding of dara-uri based download links.
- Tighter layout, plus thumbnails and a lightbox for graphical results in HTML reports.
- Bug fix for pipe groups.
- Updated docs.
- Better error handling in DRMAA executor.
[5.4.3] - 2019-03-11¶
- More robust handling of conda environment activation that should work with all setups where the conda is available when starting snakemake.
- Fixed bugs on windows.
[5.4.1] - 2019-02-14¶
- Warning when R script is used in combination with conda and R_LIBS environment variable is set. This can cause unexpected results and should be avoided.
- Improved quoting of paths in conda commands.
- Fixed various issues with checkpoints.
- Improved error messages when combining groups with cluster config.
- Fixed bugs in group implementation.
- Fixed singularity in combination with shadow.
[5.4.0] - 2018-12-18¶
[5.3.1] - 2018-12-06¶
- Various fixed bugs and papercuts, e.g., in group handling, kubernetes execution, singularity support, wrapper and script usage, benchmarking, schema validation.
[5.3.0] - 2018-09-18¶
- Fixed bug in script and wrapper execution when using
- Add host argument to S3RemoteProvider.
- Various minor bug fixes.
[5.2.4] - 2018-09-10¶
- New command line flag –shadow-prefix
- Fixed permission issue when using the script directive. This is a breaking change for scripts referring to files relative to the script directory (see the docs).
- Fixed various minor bugs and papercuts.
- Allow URL to local git repo with wrapper directive
[5.2.2] - 2018-08-01¶
- Always print timestamps, removed the –timestamps CLI option.
- more robust detection of conda command
- Fixed bug in RMarkdown script execution.
- Fixed a bug in detection of group jobs.
[5.2.0] - 2018-06-28¶
- Directory outputs have to marked with
directory. This ensures proper handling of timestamps and cleanup. This is a breaking change. Implemented by Rasmus Ågren.
- Fixed kubernetes tests, fixed kubernetes volume handling. Implemented by Andrew Schriefer.
- jinja2 and networkx are not optional dependencies when installing via pip.
- When conda or singularity directives are used and the corresponding CLI flags are not specified, the user is notified at the beginning of the log output.
- Fixed numerous small bugs and papercuts and extended documentation.
[5.1.5] - 2018-06-24¶
- fixed missing version info in docker image.
- several minor fixes to EGA support.
[5.1.4] - 2018-05-28¶
categoryto be set.
- Various cosmetic changes to reports.
- Fixed encoding issues in reports.
[5.1.3] - 2018-05-22¶
- Fixed various bugs in job groups, shadow directive, singularity directive, and more.
[5.1.0] - 2018-05-17¶
- A new framework for self-contained HTML reports, including results, statistics and topology information. In future releases this will be further extended.
- A new utility snakemake.utils.validate() which allows to validate config and pandas data frames using JSON schemas.
- Two new flags –cleanup-shadow and –cleanup-conda to clean up old unused conda and shadow data.
- Benchmark repeats are now specified inside the workflow via a new flag repeat().
- Command line interface help has been refactored into groups for better readability.
[5.0.0] - 2018-05-11¶
- Group jobs for reduced queuing and network overhead, in particular with short running jobs.
- Output files can be marked as pipes, such that producing and consuming job are executed simultaneously and interfomation is transferred directly without using disk.
- Command line flags to clean output files.
- Command line flag to list files in working directory that are not tracked by Snakemake.
- Fix of –default-remote-prefix in case of input functions returning lists or dicts.
- Scheduler no longer prefers jobs with many downstream jobs.
[4.8.1] - 2018-04-25¶
- Allow URLs for the conda directive. # Changed
- Various minor updates in the docs.
- Several bug fixes with remote file handling.
- Fix ImportError occuring with script directive.
- Use latest singularity.
- Improved caching for file existence checks. We first check existence of parent directories and cache these results. By this, large parts of the generated FS tree can be pruned if files are not yet present. If files are present, the overhead is minimal, since the checks for the parents are cached.
- Various minor bug fixes.
[4.8.0] - 2018-03-13¶
- Integration with CWL: the
cwldirective allows to use CWL tool definitions in addition to shell commands or Snakemake wrappers.
- A global
singularitydirective allows to define a global singularity container to be used for all rules that don’t specify their own.
- Singularity and Conda can now be combined. This can be used to specify the operating system (via singularity), and the software stack (via conda), without the overhead of creating specialized container images for workflows or tasks.
[4.7.0] - 2018-02-19¶
- Speedups when calculating dry-runs.
- Speedups for workflows with many rules when calculating the DAG.
- Accept SIGTERM to gracefully finish all running jobs and exit.
- Various minor bug fixes.
[4.6.0] - 2018-02-06¶
- Log files can now be used as input files for other rules.
- Adapted to changes in Kubernetes client API.
- Fixed minor issues in –archive option.
- Search path order in scripts was changed to fix a bug with leaked packages from root env when using script directive together with conda.
[4.5.1] - 2018-02-01¶
- Input and output files can now tag pathlib objects. # ## Changed
- Various minor bug fixes.
[4.5.0] - 2018-01-18¶
- iRODS remote provider # ## Changed
- Bug fix in shell usage of scripts and wrappers.
- Bug fixes for cluster execution, –immediate-submit and subworkflows.
[4.4.0] - 2017-12-21¶
- A new shadow mode (minimal) that only symlinks input files has been added.
- The default shell is now bash on linux and macOS. If bash is not installed, we fall back to sh. Previously, Snakemake used the default shell of the user, which defeats the purpose of portability. If the developer decides so, the shell can be always overwritten using shell.executable().
- Snakemake now requires Singularity 2.4.1 at least (only when running with –use-singularity).
- HTTP remote provider no longer automatically unpacks gzipped files.
- Fixed various smaller bugs.
[4.3.1] - 2017-11-16¶
- List all conda environments with their location on disk via –list-conda-envs.
- Do not clean up shadow on dry-run.
- Allow R wrappers.
[4.3.0] - 2017-10-27¶
- GridFTP remote provider. This is a specialization of the GFAL remote provider that uses globus-url-copy to download or upload files. # ## Changed
- Scheduling and execution mechanisms have undergone a major revision that removes several potential (but rare) deadlocks.
- Several bugs and corner cases of the singularity support have been fixed.
- Snakemake now requires singularity 2.4 at least.
[4.2.0] - 2017-10-10¶
- Support for executing jobs in per-rule singularity images. This is meant as an alternative to the conda directive (see docs), providing even more guarantees for reproducibility.
- In cluster mode, jobs that are still running after Snakemake has been killed are automatically resumed.
- Various fixes to GFAL remote provider.
- Fixed –summary and –list-code-changes.
- Many other small bug fixes.
[4.1.0] - 2017-09-26¶
- Support for configuration profiles. Profiles allow to specify default options, e.g., a cluster submission command. They can be used via ‘snakemake –profile myprofile’. See the docs for details.
- GFAL remote provider. This allows to use GridFTP, SRM and any other protocol supported by GFAL for remote input and output files.
- Added –cluster-status flag that allows to specify a command that returns jobs status. # ## Changed
- The scheduler now tries to get rid of the largest temp files first.
- The Docker image used for kubernetes support can now be configured at the command line.
- Rate-limiting for cluster interaction has been unified.
- S3 remote provider uses boto3.
- Resource functions can now use an additional
attemptparameter, that contains the number of times this job has already been tried.
- Various minor fixes.
[4.0.0] - 2017-07-24¶
- Cloud computing support via Kubernetes. Snakemake workflows can be executed transparently in the cloud, while storing input and output files within the cloud storage (e.g. S3 or Google Storage). I.e., this feature does not need a shared filesystem between the cloud notes, and thereby makes the setup really simple.
- WebDAV remote file support: Snakemake can now read and write from WebDAV. Hence, it can now, e.g., interact with Nextcloud or Owncloud.
- Support for default remote providers: define a remote provider to implicitly use for all input and output files.
- Added an option to only create conda environments instead of executing the workflow. # ## Changed
- The number of files used for the metadata tracking of Snakemake (e.g., code, params, input changes) in the .snakemake directory has been reduced by a factor of 10, which should help with NFS and IO bottlenecks. This is a breaking change in the sense that Snakemake 4.x won’t see the metadata of workflows executed with Snakemake 3.x. However, old metadata won’t be overwritten, so that you can always go back and check things by installing an older version of Snakemake again.
- The google storage (GS) remote provider has been changed to use the google SDK. This is a breaking change, since the remote provider invocation has been simplified (see docs).
- Due to WebDAV support (which uses asyncio), Snakemake now requires Python 3.5 at least.
- Various minor bug fixes (e.g. for dynamic output files).
[3.13.3] - 2017-06-23¶
- Fix a followup bug in Namedlist where a single item was not returned as string.
[3.13.2] - 2017-06-20¶
- The –wrapper-prefix flag now also affects where the corresponding environment definition is fetched from.
- Fix bug where empty output file list was recognized as containing duplicates (issue #574).
[3.13.1] - 2017-06-20¶
- Fix –conda-prefix to be passed to all jobs.
- Fix cleanup issue with scripts that fail to download.
[3.13.0] - 2017-06-12¶
- An NCBI remote provider. By this, you can seamlessly integrate any NCBI resouce (reference genome, gene/protein sequences, …) as input file. # ## Changed
- Snakemake now detects if automatically generated conda environments have to be recreated because the workflow has been moved to a new path.
- Remote functionality has been made more robust, in particular to avoid race conditions.
--configparameter evaluation has been fixed for non-string types.
- The Snakemake docker container is now based on the official debian image.
[3.12.0] - 2017-05-09¶
- Support for RMarkdown (.Rmd) in script directives.
- New option –debug-dag that prints all decisions while building the DAG of jobs. This helps to debug problems like cycles or unexpected MissingInputExceptions.
- New option –conda-prefix to specify the place where conda environments are stored.
- Benchmark files now also include the maximal RSS and VMS size of the Snakemake process and all sub processes.
- Speedup conda environment creation.
- Allow specification of DRMAA log dir.
- Pass cluster config to subworkflow.
[3.11.2] - 2017-03-15¶
- Fixed fix handling of local URIs with the wrapper directive.
[3.11.1] - 2017-03-14¶
- –touch ignores missing files
- Fixed handling of local URIs with the wrapper directive.
[3.11.0] - 2017-03-08¶
- Param functions can now also refer to threads. # ## Changed
- Improved tutorial and docs.
- Made conda integration more robust.
- None is converted to NULL in R scripts.
[3.10.2] - 2017-02-28¶
- Improved config file handling and merging.
- Output files can be referred in params functions (i.e. lambda wildcards, output: …)
- Improved conda-environment creation.
- Jobs are cached, leading to reduced memory footprint.
- Fixed subworkflow handling in input functions.
[3.10.0] - 2017-01-18¶
- Workflows can now be archived to a tarball with
snakemake --archive my-workflow.tar.gz. The archive contains all input files, source code versioned with git and all software packages that are defined via conda environments. Hence, the archive allows to fully reproduce a workflow on a different machine. Such an archive can be uploaded to Zenodo, such that your workflow is secured in a self-contained, executable way for the future. # ## Changed
- Improved logging.
- Reduced memory footprint.
- Added a flag to automatically unpack the output of input functions.
- Improved handling of HTTP redirects with remote files.
- Improved exception handling with DRMAA.
- Scripts referred by the script directive can now use locally defined external python modules.
[3.9.1] - 2016-12-23¶
- Jobs can be restarted upon failure (–restart-times). # ## Changed
- The docs have been restructured and improved. Now available under snakemake.readthedocs.org.
- Changes in scripts show up with –list-code-changes.
- Duplicate output files now cause an error.
- Various bug fixes.
[3.9.0] - 2016-11-15¶
- Ability to define isolated conda software environments (YAML) per rule. Environments will be deployed by Snakemake upon workflow execution.
- Command line argument –wrapper-prefix in order to overwrite the default URL for looking up wrapper scripts. # ## Changed
- –summary now displays the log files correspoding to each output file.
- Fixed hangups when using run directive and a large number of jobs
- Fixed pickling errors with anonymous rules and run directive.
- Various small bug fixes
[3.8.2] - 2016-09-23¶
- Add missing import in rules.py.
- Use threading only in cluster jobs.
[3.8.1] - 2016-09-14¶
- Snakemake now warns when using relative paths starting with “./”.
- The option -R now also accepts an empty list of arguments.
- Bug fix when handling benchmark directive.
- Jobscripts exit with code 1 in case of failure. This should improve the error messages of cluster system.
- Fixed a bug in SFTP remote provider.
[3.8.0] - 2016-08-26¶
- Wildcards can now be constrained by rule and globally via the new
wildcard_constraintsdirective (see the docs).
- Subworkflows now allow to overwrite their config file via the configfile directive in the calling Snakefile.
- A method
log_fmt_shellin the snakemake proxy object that is available in scripts and wrappers allows to obtain a formatted string to redirect logging output from STDOUT or STDERR.
- Functions given to resources can now optionally contain an additional
inputthat refers to the input files.
- Functions given to params can now optionally contain additional
input(see above) and
resources. The latter refers to the resources.
- It is now possible to let items in shell commands be automatically quoted (see the docs). This is usefull when dealing with filenames that contain whitespaces.
- Snakemake now deletes output files before job exection. Further, it touches output files after job execution. This solves various problems with slow NFS filesystems.
- A bug was fixed that caused dynamic output rules to be executed multiple times when forcing their execution with -R.
- A bug causing double uploads with remote files was fixed. Various additional bug fixes related to remote files.
- Various minor bug fixes.
[3.7.1] - 2016-05-16¶
- Fixed a missing import of the multiprocessing module.
[3.7.0] - 2016-05-05¶
- The entries in
threadsjob attribute can now be callables that must return
--cluster-configarguments can be given to the Snakemake command line. Later one override earlier ones.
- In the API, multiple
cluster_configpaths can be given as a list, alternatively to the previous behaviour of expecting one string for this parameter.
- When submitting cluster jobs (either through
--drmaa), you can now use
--max-jobs-per-secondto limit the number of jobs being submitted (also available through Snakemake API). Some cluster installations have problems with too many jobs per second.
- Wildcard values are now printed upon job execution in addition to input and output files. # ## Changed
- Fixed a bug with HTTP remote providers.
[3.6.1] - 2016-04-08¶
- Work around missing RecursionError in Python < 3.5
- Improved conversion of numpy and pandas data structures to R scripts.
- Fixed locking of working directory.
[3.6.0] - 2016-03-10¶
- onstart handler, that allows to add code that shall be only executed before the actual workflow execution (not on dryrun).
- Parameters defined in the cluster config file are now accessible in the job properties under the key “cluster”.
- The wrapper directive can be considered stable. # ## Changed
- Allow to use rule/job parameters with braces notation in cluster config.
- Show a proper error message in case of recursion errors.
- Remove non-empty temp dirs.
- Don’t set the process group of Snakemake in order to allow kill signals from parent processes to be propagated.
- Fixed various corner case bugs.
- The params directive no longer converts a list
[3.5.5] - 2016-01-23¶
- New experimental wrapper directive, which allows to refer to re-usable wrapper scripts. Wrappers are provided in the Snakemake Wrapper Repository.
- David Koppstein implemented two new command line options to constrain the execution of the DAG of job to sub-DAGs (–until and –omit-from). # ## Changed
- Fixed various bugs, e.g. with shadow jobs and –latency-wait.
[3.5.4] - 2015-12-04¶
- The params directive now fully supports non-string parameters. Several bugs in the remote support were fixed.
[3.5.3] - 2015-11-24¶
- The missing remote module was added to the package.
[3.5.2] - 2015-11-24¶
- Support for easy integration of external R and Python scripts via the new script directive.
- Chris Tomkins-Tinch has implemented support for remote files: Snakemake can now handle input and output files from Amazon S3, Google Storage, FTP, SFTP, HTTP and Dropbox.
- Simon Ye has implemented support for sandboxing jobs with shadow rules.
- Manuel Holtgrewe has fixed dynamic output files in combination with multiple wildcards.
- It is now possible to add suffixes to all shell commands with shell.suffix(“mysuffix”).
- Job execution has been refactored to spawn processes only when necessary, resolving several problems in combination with huge workflows consisting of thousands of jobs and reducing the memory footprint.
- In order to reflect the new collaborative development model, Snakemake has moved from my personal bitbucket account to http://snakemake.bitbucket.org.
[3.4.2] - 2015-09-12¶
- Willem Ligtenberg has reduced the memory usage of Snakemake.
- Per Unneberg has improved config file handling to provide a more intuitive overwrite behavior.
- Simon Ye has improved the test suite of Snakemake and helped with setting up continuous integration via Codeship.
- The cluster implementation has been rewritten to use only a single thread to wait for jobs. This avoids failures with large numbers of jobs.
- Benchmarks are now writing tab-delimited text files instead of JSON.
- Snakemake now always requires to set the number of jobs with -j when in cluster mode. Set this to a high value if your cluster does not have restrictions.
- The Snakemake Conda package has been moved to the bioconda channel.
- The handling of Symlinks was improved, which made a switch to Python 3.3 as the minimum required Python version necessary.
[3.4.1] - 2015-08-05¶
- This release fixes a bug that caused named input or output files to always be returned as lists instead of single files.
[3.4] - 2015-07-18¶
- This release adds support for executing jobs on clusters in synchronous mode (e.g. qsub -sync). Thanks to David Alexander for implementing this.
- There is now vim syntax highlighting support (thanks to Jay Hesselberth).
- Snakemake is now available as Conda package.