HaplotypeCallerSpark

Ambarish_Kumar · November 5, 2020, 7:28am

I am getting error while running CWL tool for HaplotypecallerSpark. Error is as follows.

‘’’
ERROR [job HaplotypeCallerSparkNew.cwl] Job error:
(“Error collecting output for parameter ‘rawVCF’:\nHaplotypeCallerSparkNew.cwl:37:5: Did not find output file with glob pattern: ‘[‘en.vcf’]’”, {})

‘’’

mrc · November 5, 2020, 7:52am

Hello @Ambarish_Kumar. Can you link or paste in the source for HaplotypeCallerSparkNew.cwl and how you are running it? It won’t be possible for anyone to assist you without that. Thanks!

Ambarish_Kumar · November 5, 2020, 9:02am

This is the web-link for the tool.

github.com

ambarishK/bio-cwl-tools/blob/530e671c113e91fe6b9114eda32103bb980450c7/HaplotypeCallerSparkNew.cwl

#!/usr/bin/env cwl-runner
# This tool description was generated automatically by wdl2cwl ver. 0.2

class: CommandLineTool
cwlVersion: v1.0

requirements:
- class: ShellCommandRequirement
- class: InlineJavascriptRequirement
- class: InitialWorkDirRequirement
  listing: 
    - $(inputs.Reference2bitGenome)
- class: DockerRequirement
  dockerPull: quay.io/biocontainers/gatk4:4.1.6.0--py38_0 

inputs:

- id: inputBAM
  type: File
- id: Reference2bitGenome

This file has been truncated. show original

I am running it over command-line.

‘’’
sudo cwltool HaplotypeCallerSparkNew.cwl --inputBAM ebolamutantmarkdup.bam --Reference2bitGenome ebola1.2bit --RefIndex ebola.fasta.fai --sampleName eb --RefDict ebola.dict --BAMindex ebola.bam.bai --output_filename en.vcf

‘’’
Input files are contained into following sub-directory.

‘’’

Ambarish_Kumar · November 5, 2020, 2:50pm

I am trying with different outputbinding parameters but error persists.

alexiswl · November 11, 2020, 4:45am

I don’t think this command is running as you expect it:

arguments:
- valueFrom: |-
    gatk HaplotypeCallerSpark -R $(inputs.Reference2bitGenome.path) -I $(inputs.inputBAM.path) -O $(inputs.output_filename)
  shellQuote: false

Will actually run as “gatk HaplotypeCallerSpark -R …” rather than “gatk” “HaplotypeCallerSpark” “-R” …

I would consider using the ‘inputBinding’ for each of your inputs and a ‘baseCommand’ set to ['gatk', 'HaplotypeCallerSpark']

For the reference, you have put it in as a listing (can you justify why this is necessary), but have specified the .path attribute. If you wish to reference the file that you have listing in the working directory, you should use the .basename attribute instead.

Alexis