StarCoder

9 comments
StarCoder and StarCoderBase are Large Language Models for Code (Code LLMs) trained on permissively l0
Visit website

What is StarCoder?

StarCoderBase and StarCoder are Large Language Models (Code LLMs), trained on permissively-licensed data from GitHub. This includes data from 80+ programming language, Git commits and issues, Jupyter Notebooks, and Git commits.

We trained a 15B-parameter model for 1 trillion tokens, similar to LLaMA.

We refined the StarCoderBase for 35B Python tokens. The result is a new model we call StarCoder.


StarCoderBase is a model that outperforms other open Code LLMs in popular programming benchmarks. It also matches or exceeds closed models like code-cushman001 from OpenAI, the original Codex model which powered early versions GitHub Copilot. StarCoder models are able to process more input with a context length over 8,000 tokens than any other open LLM. This allows for a variety of interesting applications. By prompting the StarCoder model with a series dialogues, we allowed them to act like a technical assistant.


More information on StarCoder

Launched
2023
Pricing Model
Free
Starting Price
Global Rank
Country
Month Visit
<5k
Tech used
Amazon AWS CloudFront,cdnjs,Google Fonts,KaTeX,Gzip,OpenGraph,RSS,Stripe
StarCoder was manually vetted by our editorial team and was first featured on September 4th 2024.
Aitoolnet Featured banner

StarCoder Alternatives

Load more Alternatives
  1. Making our our text to SQL model 30 percentage points more accurate over 5 months

  2. DeciCoder 1B is a 1 billion parameter decoder-only code completion model trained on the Python, Java, and Javascript subsets of Starcoder Training Dataset.

  3. This product is designed to assist programmers with their daily work while also providing a great le

  4. Discover Code Llama, a cutting-edge AI tool for code generation and understanding. Boost productivity, streamline workflows, and empower developers.

  5. Enhance language models, improve performance, and get accurate results. WizardLM is the ultimate tool for coding, math, and NLP tasks.