smollora

278d87286a Remove broken in-place replacement code, use _tokenize_with_special_tokens only master Jinx 2026-04-10 17:35:09 +00:00
ca50973065 Fix tokenization: replace text token sequences with actual special token IDs Jinx 2026-04-10 17:28:51 +00:00
d3b5f04f88 Add embed_tokens to LoRA targets + token ID verification before training Jinx 2026-04-10 17:20:06 +00:00
f46995690c Update runbook for tool-call token training run Jinx 2026-04-10 17:14:57 +00:00
af497eb16c Add training plan: teach model to emit native tool-call tokens Jinx 2026-04-10 17:07:28 +00:00
d1e8c306e3 Add critical training objective: teach model to emit native tool-call tokens Jinx 2026-04-10 16:52:09 +00:00
eb0850bca6 Fix: correct source dataset tag names for tool_call/tool_response Jinx 2026-04-10 06:37:43 +00:00
99481ca127 Add HF_TOKEN env var Jinx 2026-04-10 06:33:09 +00:00
9e723d393e Fix: remove unsupported --model arg from prepare_data.py call Jinx 2026-04-10 06:32:55 +00:00
adbd85366b init commit Jinx 2026-04-10 06:24:05 +00:00
46a3ddbb25 Add deployment runbook Jinx 2026-04-10 05:28:30 +00:00
6af62c85d5 Add docker-compose with /srv mounts for persistent data Jinx 2026-04-10 05:21:04 +00:00
82348341b0 Initial LoRA training setup for SmolLM3-3B tool calling Jinx 2026-04-10 05:11:05 +00:00

Commit Graph Select branches Hide Pull Requests master Mono Color