Getting LINC to work on a slurm cluster
Hi, I'm having trouble getting the LINC pipeline to run on the OzSTAR cluster in Australia, which uses slurm. I cannot seem to track down exactly what the root of the problem is as the log file error messages are rather cryptic to me. I noticed that if I grep "error" in the *.err.log files in the work or temporary directory, all the files say that the job was cancelled due to slurm time limit. I'm surprised because the head job failed before reaching the time limit I allocated in the batch script. I wonder if this is related to the root cause of the issue? The job ran for about 3.5 hours before failing. I have attached the batch script as well as the log file. This is for the calibrator pipeline by the way (have yet to proceed to target).slurm-41973723.out [cal_linc.sh](/uploads/5282dc41ef90f3c3a8d3b6d0961a616b/cal_linc.sh
I would appreciate any guidance on how to proceed here - I'm eager to process some DDT data:)
Many thanks, Kelly