Running Custom Inputs on SMOKE

@wong.david-c @cjcoats @cgnolte @lizadams @tlspero @bbaek @Akhila @hogrefe.christian

Hello!

I’m new to using the SMOKE (Sparse Matrix Operator Kernel Emissions) model and just got started recently. I was able to successfully run the example test cases by following the instructions on the SMOKE installation wiki.

The test case I used had inputs for biogenics, pt_oil, and np_oil. I got it from a GitHub repository that seems to have been taken down or made private now. I’m now looking to move beyond the test cases and want to understand the full pipeline better.

I’m currently confused about the following:

  1. What are the required input datasets to run SMOKE with custom real-world data, especially for biogenic?
  2. Where can I download the datasets required (e.g., land use, meteorology, inventory data)?
  3. How do I generate or obtain MCIP outputs to feed into SMOKE?
  4. What are the steps to go from raw data to SMOKE-ready inputs?

It would be really helpful if anyone could tell me what are the requisites and could provide me with the links to download them in order to run it.

Thank you in advance.

Hello,

I suggest familiarizing yourself with one of the recent EPA emissions platforms to get started. The 2022v1 EMP has detailed technical documentation, input data, and run scripts (A.1. EPA's 2022v1 Emissions Modeling Platform · CEMPD/SMOKE Wiki · GitHub). The EQUATES project is also well documented and provides emissions data over multiple years (2002–2017 anthropogenic emissions data for air quality modeling over the United States - ScienceDirect).

As you have noted from the test case, platforms are divided into source sectors. All sectors require different inputs containing information related to spatial allocation, temporal allocation, speciation, etc. The formats of these files and which of the files are required can vary by sector. These complexities make it difficult to answer all of your questions in a direct, concise way.

Do you have a specific dataset that you would like to use in the preparation of model-ready emissions? We can give more detailed technical assistance if you describe the dataset and your goals in a separate post.

1 Like

Our SMOKE-ExampleCase is still available for public and has never been taken down. Please give it another try and let me know if you still have issue with accessing it.