×

nf-core/metacache/build @ 0.0.0-0c7146d

Taxonomic profiling database building with MetaCache

Latest version: 0.0.0-0c7146d
Total downloads: 1
Source: nf-core/modules
Authors: @Gullumluvl
Maintainers: @Gullumluvl

Summary

Taxonomic profiling database building with MetaCache

Get started

Add the following snippet to your workflow script to include this module.

include { METACACHE_BUILD } from 'nf-core/metacache/build'

License

MIT License

Process
Name METACACHE_BUILD
Input 3 channels
#1 tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

genome_files file

(possibly gzipped) fasta or fastq files of full genomes, for example from an NCBI assembly

*.{fna,fa,fasta,fnq,fq,fastq}{,.gz}
taxonomy file

NCBI taxonomy formatted files nodes.dmp and names.dmp

{names,nodes,merged}.dmp
seq2taxid file

NCBI-style 'accession2taxid' tab-separated file with 3 or 4 columns: accession, accession_version, taxid, and gid (optional)

*
Output 2 channels
#1 db tuple
meta map

Groovy Map containing sample information e.g. [ id:'sample1' ]

*.meta file

sequence signature database binary file

*.meta
*.cache* file

sequence signature database binary files

*.cache+([0-9])
#2 versions_metacache tuple
${task.process} string

The name of the process

metacache string

The name of the tool

metacache info |& sed -n 's/^MetaCache version \+\([0-9.]\+\).*\$/\1/p' eval

The expression to obtain the version of the tool

Tool Description Homepage
metacache MetaCache is a classification system for mapping genomic sequences (short reads, long reads, contigs, ...) from metagenomic samples to their most likely taxon of origin. It aims to reduce the memory requirement usually associated with k-mer based methods while retaining their speed. MetaCache uses locality sensitive hashing to quickly identify candidate regions within one or multiple reference genomes. A read is then classified based on the similarity to those regions. For an independent comparison to other tools in terms of classification accuracy see the LEMMI benchmarking site. The latest version of MetaCache classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs. https://muellan.github.io/metacache
Version 0.0.0-0c7146d
Commit ID 135d883a43b1a2324b4112cd270cf22e20835533
Release Date 08 Apr 2026 18:55:07 (UTC)
Download URL https://registry-stage.nextflow.io/api/v1/modules/nf-core%2Fmetacache%2Fbuild/0.0.0-0c7146d/download
OCI Store URL https://public.cr.stage-seqera.io/v2/nextflow/plugin-stage/modules/nf-core/metacache/build/blobs/sha256:5b167d506e4c7f6b121101db6bfb723a3c9e473f776dd55112ea644a3c075190
Size 4.0 KB
Checksum sha256:5b167d506e4c7f6b121101db6bfb723a3c9e473f776dd55112ea644a3c075190
Downloads 1
Version Date Status Downloads Size Diff
0.0.0-0c7146d 08 Apr 2026 18:55:07 (UTC) 1 4.0 KB -