Inferable

The managed LLM-engineering platform for production-ready AI applications.

What is Inferable?

Inferable is a fully managed platform that handles state, reliability, and orchestration of custom LLM-based applications. It's developer-first and API-driven, providing production-ready LLM primitives for building sophisticated LLM-based applications.

⚡️ Quick Start

Follow the quick start guide to get started with Inferable.

🔑 Key Features

Here are some of the key features of Inferable.

📦 Workflows that execute in your own infrastructure

Workflows execute in your own infrastructure, even behind firewalls or private VPCs. No deployment step is required. We use long polling to connect to your infrastructure, so there is no need to open any inbound ports.

const workflow = inferable.workflows.create({
  name: "simple",
  inputSchema: z.object({
    executionId: z.string(),
    greeting: z.string(),
  }),
});

🔄 Versioned Workflows for backward compatibility

When you need to change the input schema or the logic of a workflow, you can create a new version of the workflow. Inferable will maintain version affinity for currently executing workflows, so you can roll out new versions gradually. See Workflows.

workflow.version(1).define(async (ctx, input) => {
  // ...
});

workflow.version(2).define(async (ctx, input) => {
  // ...
});

🏗️ Structured Outputs with automatic parsing, validation, and retries

Inferable automatically parses and validates structured outputs, and retries failed executions. See Structured Outputs.

workflow.version(1).define(async (ctx, input) => {
  const { ticketType } = ctx.llm.structured({
    input: `Ticket text: ${input.ticketText}`,
    schema: z.object({
      ticketType: z.enum(["data-deletion", "refund", "other"]),
    }),
  });

  // do something with the items
  console.log(ticketType);
});

🧑‍💼 Human-in-the-Loop with approval workflows

Inferable allows you to integrate human approval and intervention with full context preservation. See Human-in-the-Loop.

deleteUserWorkflow.version(1).define(async (ctx, input) => {
  // ... existing workflow code ...

  if (!ctx.approved) {
    return Interrupt.approval({
      message: `I need your approval to delete the user ${input.userId}. Is this ok?`,
      destination: {
        type: "email",
        // The email address to notify
        email: "test@example.com",
      },
    });
  }

  await db.customers.delete({
    userId: input.userId,
  });
});

🤖 Agents with Tool Use

Inferable agents can use tools to achieve pre-defined goals. See Agents.

const agentInstructions = `
  Evaluate the provided support ticket body and extract the user from the database.

  When searching for users, if you don't get specific results, try to search with a more general term with sub strings with unique nouns.
  For example, "John Smith": searchUser("John Smith"), searchUser("John"), searchUser("Smith"), etc.
`;

workflow.tools.register({
  name: "searchUser",
  schema: z.object({
    userId: z.string(),
  }),
  handler: async (ctx, input) => {
    // your own code to search for the user
  },
});

workflow.version(1).define(async (ctx, input) => {
  const { userId } = await ctx.llm.agents.react({
    name: "restaurantSearch",
    instructions: agentInstructions,
    input: JSON.stringify({ ticket }),
    tools: ["searchUser"],
    resultSchema: z.object({
      userId: z.string(),
    }),
  });

  // do something with the userId
  console.log(userId);
});

And more stuff...

Notifications to send notifications to users via Slack or Email.
Memoized Results to cache the results of side-effects and expensive operations in a distributed way.
Obervability in a timeline view, or plug into your own observability tools.
Developer-friendly SDKs in Node.js, and Go supported with more languages coming soon.

📚 Language Support

Language	Source	Package
Node.js / TypeScript	Quick start	NPM
Go	Quick start	Go

🚀 Open Source

This repository contains the Inferable control-plane, as well as SDKs for various languages.

Core services:

/control-plane - The core Inferable control plane service
/app - Playground front-end and management console
/cli - Command-line interface tool (alpha)

SDKs:

/sdk-node - Node.js/TypeScript SDK
/sdk-go - Go SDK
/sdk-dotnet - .NET SDK (experimental)

💾 Self Hosting

Inferable is completely open source and can be self-hosted on your own infrastructure for complete control over your data and compute. This gives you:

Full control over your data and models
No vendor lock-in
Enhanced security with your own infrastructure
Customization options to fit your specific needs

See our self hosting guide for more details.

🤝 Contributing

We welcome contributions to all projects in the Inferable repository. Please read our contributing guidelines before submitting any pull requests.

📝 License

All code in this repository is licensed under the MIT License.

Name	Name	Last commit message	Last commit date
Latest commit Inferable CI Bump sdk-node version to 0.30.134 Mar 10, 2025 144cf95 · Mar 10, 2025 History 1,013 Commits
.github	.github	chore: Explicitly attach tools in load test (#891 )	Mar 5, 2025
.husky	.husky	chore: Add root package.json for lint-staged (#897 )	Mar 6, 2025
.vscode	.vscode	chore: Add prettier commands to format code on PR (#348 )	Dec 21, 2024
adapters/pgsql-adapter	adapters/pgsql-adapter	chore: Bump SDK in pgsql-adapter	Mar 4, 2025
app	app	feat: Implement auto-refresh for workflows and enhance UI components (#…	Mar 9, 2025
archives	archives	[skip ci] Update bootstrap project archives	Feb 6, 2025
assets	assets	chore: Update README examples	Mar 7, 2025
bootstrap-dotnet	bootstrap-dotnet	chore: Update README content and assets (#735 )	Feb 6, 2025
bootstrap-go	bootstrap-go	chore: Update README content and assets (#735 )	Feb 6, 2025
bootstrap-node	bootstrap-node	chore: Update README content and assets (#735 )	Feb 6, 2025
cli	cli	Bump cli version to 0.24.13	Jan 9, 2025
control-plane	control-plane	feat: Add ctx.notify function	Mar 7, 2025
demos	demos	chore: Update CHANGELOG.md with new release details and demo implemen…	Mar 1, 2025
load-tests	load-tests	feat: Extract structure text from workflows and support providers (#890 )	Mar 5, 2025
sdk-bash	sdk-bash	chore: Remove key type restictions (#5 )	Oct 26, 2024
sdk-dotnet	sdk-dotnet	Bump sdk-dotnet version to 0.0.27	Mar 4, 2025
sdk-go	sdk-go	Bump sdk-go version to 0.1.44	Mar 5, 2025
sdk-node	sdk-node	Bump sdk-node version to 0.30.134	Mar 10, 2025
sdk-react	sdk-react	chore: Update README content and assets (#735 )	Feb 6, 2025
.editorconfig	.editorconfig	chore: Extract SDK from monorepo (#1 )	Oct 7, 2024
.gitignore	.gitignore	chore: Fix .gitgnore to exclude .env.base (#305 )	Dec 14, 2024
.prettierignore	.prettierignore	chore: Add prettier commands to format code on PR (#348 )	Dec 21, 2024
.prettierrc	.prettierrc	chore: Update pre-commit hooks (#880 )	Mar 5, 2025
CHANGELOG.md	CHANGELOG.md	chore: Update CHANGELOG.md with new release details and demo implemen…	Mar 1, 2025
CONTRIBUTING.md	CONTRIBUTING.md	chore: Update contributing and README documentation (#167 )	Nov 30, 2024
LICENSE	LICENSE	Update LICENSE	Nov 28, 2024
README.md	README.md	chore: Update README examples	Mar 7, 2025
THIRD-PARTY-NOTICES.txt	THIRD-PARTY-NOTICES.txt	ci: Add Assistant UI pipelines (#258 )	Dec 9, 2024
cliff.toml	cliff.toml	docs: Add environment section to CLI docs (#160 )	Nov 28, 2024
package-lock.json	package-lock.json	chore: Add root package.json for lint-staged (#897 )	Mar 6, 2025
package.json	package.json	chore: Add root package.json for lint-staged (#897 )	Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Inferable

What is Inferable?

⚡️ Quick Start

🔑 Key Features

📦 Workflows that execute in your own infrastructure

🔄 Versioned Workflows for backward compatibility

🏗️ Structured Outputs with automatic parsing, validation, and retries

🧑‍💼 Human-in-the-Loop with approval workflows

🤖 Agents with Tool Use

📚 Language Support

🚀 Open Source

💾 Self Hosting

🤝 Contributing

📝 License

About

Contributors 6

Languages

License

inferablehq/inferable

Folders and files

Latest commit

History

Repository files navigation

Inferable

What is Inferable?

⚡️ Quick Start

🔑 Key Features

📦 Workflows that execute in your own infrastructure

🔄 Versioned Workflows for backward compatibility

🏗️ Structured Outputs with automatic parsing, validation, and retries

🧑‍💼 Human-in-the-Loop with approval workflows

🤖 Agents with Tool Use

📚 Language Support

🚀 Open Source

💾 Self Hosting

🤝 Contributing

📝 License

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 6

Languages