Skip to content
This repository has been archived by the owner on Feb 25, 2022. It is now read-only.

GPT-3 configuration for a v3-32 TPU #183

Open
stefan-it opened this issue Mar 30, 2021 · 0 comments
Open

GPT-3 configuration for a v3-32 TPU #183

stefan-it opened this issue Mar 30, 2021 · 0 comments
Labels
documentation Improvements or additions to documentation.

Comments

@stefan-it
Copy link

Hi,

many thanks for releasing this GPT training code 馃憤

I just wanted to train a new model from scratch (with own vocab), so I was using the following configuration file

https://github.com/EleutherAI/gpt-neo/blob/master/configs/gpt3_small_256.json

However, I'm not 100% sure what to use for mesh_shape and layout, because I'm not using a 256 TPU pod, I'm using a v3-32 only.

Could you please provide some more information about how to use the correct values?

Many thanks in advance and best,

Stefan

@stefan-it stefan-it added the bug Something isn't working. label Mar 30, 2021
@StellaAthena StellaAthena added documentation Improvements or additions to documentation. and removed bug Something isn't working. labels Mar 31, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
documentation Improvements or additions to documentation.
Projects
None yet
Development

No branches or pull requests

2 participants