Track 1 (ML): Cerebras modelzoo ML models¶

Corresponds to models already present in version R1.6.0 of the Cerebras modelzoo ML models software.

Good Match Criteria: You would be a good match for this track if your research already uses, or could potentially use, any of the following models, supported via TensorFlow and PyTorch:

Model	Layer Pipeline mode	Weight Streaming mode
BERT	TensorFlow code PyTorch code	-
BERT (fine-tuning) Classifier	TensorFlow code PyTorch code	-
BERT (fine-tuning) Named Entity Recognition	TensorFlow code PyTorch code	-
BERT (fine-tuning) Summarization	TensorFlow code PyTorch code	-
BERT (fine-tuning) Question Answering	TensorFlow code PyTorch code	-
GPT-2	TensorFlow code PyTorch code	TensorFlow code
GPT-3	-	TensorFlow code
GPT-J	-	TensorFlow code
RoBERTa	TensorFlow code PyTorch code	-
T5	TensorFlow code PyTorch code	-
Transformer	TensorFlow code PyTorch code	-
MNIST (fully connected)	TensorFlow code PyTorch code	-
2D UNet (experimental)	TensorFlow code	-

Based on the Cerebras modelzoo R1.6.0 GitHub page.

Track Specific Questions¶

If your project falls under this category, make sure to address the following questions in your application document:

Please, indicate which model(s) from the modelzoo you intend to use. Do you anticipate being interested in adjusting the model architecture?
Please, describe the dataset you are intending to use.
How big is the dataset of interest (total dataset size, number of samples, and sample size in MB)?
Please, elaborate on the readiness of the dataset of interest. Is it fully available at this time? If not, how soon would it be fully available?
Please specify the shapes of the input and output tensors for your model/s.
If possible, please specify the name of the dimensions for your input and output tensors from the previous question. E.g. (batch, input channels, height, width)
Please specify the loss function that you would like to use.
Please, list the libraries complementary to standard PyTorch and/or TensorFlow distributions that you would need to train your model(s).
Please, list the key libraries that you would need for data preprocessing.