Skip to content

[FEAT]: Add cerebras integration for superfast inference #5605

@emanu3lj

Description

@emanu3lj

What would you like to see?

cerebras offer fast inference, up too 2k/tokens per sec.

https://www.cerebras.ai/inference

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions