Tinker by Thinking Machines

11 hours ago

Copy Link

Tinker is a training API designed for researchers, offering control over model training and fine-tuning while managing infrastructure.
Key functions include forward_backward (performs forward and backward passes), optim_step (updates weights), sample (generates tokens), and save_state (saves training progress).
Uses LoRA for efficient fine-tuning by training a small add-on instead of altering all original weights, matching full fine-tuning performance with less compute.
Handles scheduling, tuning, resource management, and infrastructure reliability, orchestrating distributed training on GPU clusters.
Supports a range of open-source models, from compact to large, with plans to expand the selection.
Allows downloading model weights during and after training.
Initially free, with usage-based pricing to be introduced soon.

Hasty Briefsbeta