This benchmarking workflow can be used to assess performance of germline variant calling pipelines run on Genome in a bottle (GIAB) samples HGOO2, HG003, and HG004 (https://truwl.com/files/library/FC_ac361c.1). It is an implementation of the [GA4GH best practices](https://doi.org/10.1038/s41587-019-0054-x) that uses vcfeval as the comparison engine and 'genotype match' to calculate true positive, false positives, and false negatives. Additional statistics are generated for partial matches. The workflow allows for selection of 'competitors' and 'regions'. Competitor selection allows you to compare your VCF against the winning VCF submissions for the [PrecisionFDA Truth Challenge V2](https://www.biorxiv.org/content/10.1101/2020.11.13.380741v4). Region selection allows you to specify which genomic regions are used for the comparison. The workflow generates two html reports: a multiQC report generated using Bcftools and a notebook that contains upset plots, precision recall metrics, and indell size distribution plots generated using papermill. Metrics from each benchmarking run are also loaded into the Truwl Performance Metrics table which enables you to compare metrics across multiple benchmarking runs.