Could you post your entire runscript here. Additionally, I would read these two links:
- Is it possible (how?) to run containerized CMAQ across multiple nodes using Azure CycleCloud & Slurm?
- CMAQ/DOCS/Users_Guide/Appendix/CMAQ_UG_appendixD_parallel_implementation.md at main · USEPA/CMAQ · GitHub
Note that picking a domain decomposition should be done carefully. In your case, each processor would be responsible for (90 columns x ~ 2-3 rows). Is there a reason you did it this way, instead of balancing the rows/columns per processor?
Also when you say it “fails” are there any messages in the main and ancillary log files (usually found at the very bottom)?