Skip to content

Track 1 (ML): Cerebras modelzoo ML models

Corresponds to models already present in version R1.6.0 of the Cerebras modelzoo ML models software.

Good Match Criteria: You would be a good match for this track if your research already uses, or could potentially use, any of the following models, supported via TensorFlow and PyTorch:

Model Layer Pipeline mode Weight Streaming mode
BERT TensorFlow code
PyTorch code
-
BERT (fine-tuning) Classifier TensorFlow code
PyTorch code
-
BERT (fine-tuning) Named Entity Recognition TensorFlow code
PyTorch code
-
BERT (fine-tuning) Summarization TensorFlow code
PyTorch code
-
BERT (fine-tuning) Question Answering TensorFlow code
PyTorch code
-
GPT-2 TensorFlow code
PyTorch code
TensorFlow code
GPT-3 - TensorFlow code
GPT-J - TensorFlow code
RoBERTa TensorFlow code
PyTorch code
-
T5 TensorFlow code
PyTorch code
-
Transformer TensorFlow code
PyTorch code
-
MNIST (fully connected) TensorFlow code
PyTorch code
-
2D UNet (experimental) TensorFlow code -

Based on the Cerebras modelzoo R1.6.0 GitHub page.

Track Specific Questions

If your project falls under this category, make sure to address the following questions in your application document:

  1. Please, indicate which model(s) from the modelzoo you intend to use. Do you anticipate being interested in adjusting the model architecture?
  2. Please, describe the dataset you are intending to use.
  3. How big is the dataset of interest (total dataset size, number of samples, and sample size in MB)?
  4. Please, elaborate on the readiness of the dataset of interest. Is it fully available at this time? If not, how soon would it be fully available?
  5. Please specify the shapes of the input and output tensors for your model/s.
  6. If possible, please specify the name of the dimensions for your input and output tensors from the previous question. E.g. (batch, input channels, height, width)
  7. Please specify the loss function that you would like to use.
  8. Please, list the libraries complementary to standard PyTorch and/or TensorFlow distributions that you would need to train your model(s).
  9. Please, list the key libraries that you would need for data preprocessing.