Logo
NVIDIA Megatron-LM logo

NVIDIA Megatron-LM

A research-oriented framework for large language model training

Visit Website
Screenshot of NVIDIA Megatron-LM

About NVIDIA Megatron-LM

NVIDIA Megatron-LM is a research-oriented framework that leverages the Megatron-Core library for large language model (LLM) training. Megatron-Core is a GPU-optimized training technique library with versioned APIs and regular releases. Megatron-LM can be used alongside Megatron-Core or NVIDIA.

Key Features

4 features
  • Leverages Megatron-Core library for LLM training.
  • Supports large-scale pretraining.
  • Optimized for GPU usage.
  • Versioned APIs and regular releases.

Use Cases

4 use cases
  • Language model training.
  • Natural language processing.
  • Text generation.
  • Pretraining large-scale models.
Added April 15, 2024
Loading reviews...

Browse All Tools in These Categories