Hasty Briefsbeta

Bilingual

Training Qwen 4B to Beat Large Models on Work Tasks

3 months ago
  • #AI
  • #Machine Learning
  • #Salesforce
  • Neurometric focuses on auto-generating Small Language Models (SLMs) for specific tasks.
  • CRMArena benchmark tests models on realistic Salesforce CRM tasks like lead qualification and activity prioritization.
  • Fine-tuned a 4B parameter Qwen model to outperform larger models on CRM tasks with 95% accuracy.
  • Initial attempts to teach SLMs to generate SQL queries were rough but improved with expanded training data.
  • Phase II involved direct answer generation using the BANT framework, achieving an evaluation score of 0.825.
  • Key takeaways: SLMs can outperform larger models with task-specific fine-tuning, synthetic data has quality challenges, and constrained answer spaces improve results.