Models
StarCoder
- Paper: A technical report about StarCoder.
- GitHub: All you need to know about using or fine-tuning StarCoder.
- StarCoder: StarCoderBase further trained on Python.
- StarCoderBase: Trained on 80+ languages from The Stack.
- StarCoder+: StarCoderBase further trained on English web data.
- StarEncoder: Encoder model trained on TheStack.
- StarPii: StarEncoder based PII detector.
StarCoder Tools & Demos
- StarCoder Playground: Write with StarCoder Models!
- VSCode Extension: Code with StarCoder!
- StarChat: Chat with StarCoder!
- Tech Assistant Prompt: With this prompt you can turn StarCoder into tech assistant.
- StarCoder Editor: Edit with StarCoder!
StarCoder Data & Governance
- Governance Card: A card outlining the governance of the model.
- StarCoder License Agreement: The model is licensed under the BigCode OpenRAIL-M v1 license agreement.
- StarCoder Data: Pretraining dataset of StarCoder.
- StarCoder Search: Full-text search code in the pretraining dataset.
- StarCoder Membership Test: Blazing fast test if code was present in pretraining dataset.
SantaCoder
SantaCoder aka smol StarCoder: same architecture but only trained on Python, Java, JavaScript.
- SantaCoder: SantaCoder Model.
- SantaCoder Demo: Write with SantaCoder.
- SantaCoder Search: Search code in the pretraining dataset.
- SantaCoder License: The OpenRAIL license for SantaCoder.