Life of a Model on Google Cloud TPU
Join us for a demo showcasing the power of Google Cloud TPU by walking through the complete lifecycle of a model, from post-training to inference at massive scale.We’ll show you how to post-train a model using new RL capabilities in MaxText and Tunix, then take that same model and serve it on TPU seamlessly with the new JAX backend in vLLM.Learn about how to rightsize a workload for TPU for both training and inference, optimizing for performance-per-dollar from beginning to end.
