Vercel AI SDK

Il Vercel AI SDK è il modo più popolare per costruire funzionalità AI in app Next.js, React, Svelte e Vue. Venice funziona out-of-the-box come provider compatibile con OpenAI.

Setup

npm install ai @ai-sdk/openai

Configurazione del provider

Crea un provider Venice usando l’adapter compatibile con OpenAI:

// lib/venice.ts
import { createOpenAI } from '@ai-sdk/openai';

const openai = createOpenAI({
  apiKey: process.env.VENICE_API_KEY!,
  baseURL: 'https://api.venice.ai/api/v1',
});

// Usa .chat() per garantire la compatibilità con l'endpoint chat completions di Venice
export const venice = (modelId: string) => openai.chat(modelId);

Usare .chat() garantisce che le richieste vadano all’endpoint /chat/completions di Venice. La sintassi predefinita openai('model') può usare endpoint OpenAI più recenti che Venice non supporta ancora.

Streaming chat (Next.js App Router)

Route API

// app/api/chat/route.ts
import { streamText } from 'ai';
import { venice } from '@/lib/venice';

export async function POST(req: Request) {
  const { messages } = await req.json();

  const result = streamText({
    model: venice('venice-uncensored'),
    system: 'You are a helpful, privacy-respecting AI assistant.',
    messages,
  });

  return result.toDataStreamResponse();
}

Componente React

// app/page.tsx
'use client';

import { useChat } from '@ai-sdk/react';

export default function Chat() {
  const { messages, input, handleInputChange, handleSubmit, isLoading } = useChat();

  return (
    <div className="max-w-2xl mx-auto p-4">
      <div className="space-y-4 mb-4">
        {messages.map((m) => (
          <div key={m.id} className={m.role === 'user' ? 'text-right' : 'text-left'}>
            <span className="font-bold">{m.role === 'user' ? 'You' : 'Venice'}:</span>
            <p className="whitespace-pre-wrap">{m.content}</p>
          </div>
        ))}
      </div>

      <form onSubmit={handleSubmit} className="flex gap-2">
        <input
          value={input}
          onChange={handleInputChange}
          placeholder="Ask anything..."
          className="flex-1 border rounded px-3 py-2"
          disabled={isLoading}
        />
        <button type="submit" disabled={isLoading} className="bg-red-600 text-white px-4 py-2 rounded">
          Send
        </button>
      </form>
    </div>
  );
}

Generazione di testo (senza streaming)

import { generateText } from 'ai';
import { venice } from '@/lib/venice';

const { text } = await generateText({
  model: venice('zai-org-glm-5-1'),
  prompt: 'Explain zero-knowledge proofs in simple terms.',
});

console.log(text);

Output strutturato

import { generateObject } from 'ai';
import { venice } from '@/lib/venice';
import { z } from 'zod';

const { object } = await generateObject({
  model: venice('venice-uncensored'),
  schema: z.object({
    recipe: z.object({
      name: z.string(),
      ingredients: z.array(z.string()),
      steps: z.array(z.string()),
      prepTimeMinutes: z.number(),
    }),
  }),
  prompt: 'Generate a recipe for chocolate chip cookies.',
});

console.log(object.recipe.name);
console.log(`Prep time: ${object.recipe.prepTimeMinutes} minutes`);

Tool calling

import { streamText, tool } from 'ai';
import { venice } from '@/lib/venice';
import { z } from 'zod';

const result = streamText({
  model: venice('zai-org-glm-5-1'),
  messages: [{ role: 'user', content: 'What is the weather in Tokyo?' }],
  tools: {
    getWeather: tool({
      description: 'Get current weather for a location',
      parameters: z.object({
        location: z.string().describe('City name'),
      }),
      execute: async ({ location }) => {
        // La tua chiamata API meteo qui
        return { temperature: 22, condition: 'Sunny', location };
      },
    }),
  },
});

for await (const part of result.fullStream) {
  if (part.type === 'text-delta') {
    process.stdout.write(part.textDelta);
  } else if (part.type === 'tool-result') {
    console.log('Tool result:', part.result);
  }
}

Generazione di immagini

La generazione di immagini Venice può essere chiamata direttamente insieme all’AI SDK:

// app/api/image/route.ts
export async function POST(req: Request) {
  const { prompt } = await req.json();

  const response = await fetch('https://api.venice.ai/api/v1/image/generate', {
    method: 'POST',
    headers: {
      'Authorization': `Bearer ${process.env.VENICE_API_KEY}`,
      'Content-Type': 'application/json',
    },
    body: JSON.stringify({
      model: 'qwen-image',
      prompt,
      width: 1024,
      height: 1024,
    }),
  });

  const data = await response.json();
  return Response.json({ image: data.images[0] });
}

Chat multi-modello (selettore di modello)

Permetti agli utenti di scegliere tra i modelli Venice:

// app/api/chat/route.ts
import { streamText } from 'ai';
import { venice } from '@/lib/venice';

const ALLOWED_MODELS = [
  'venice-uncensored',
  'zai-org-glm-5-1',
  'qwen3-vl-235b-a22b',
  'qwen3-5-9b',
];

export async function POST(req: Request) {
  const { messages, model: modelId } = await req.json();

  if (!ALLOWED_MODELS.includes(modelId)) {
    return new Response('Invalid model', { status: 400 });
  }

  const result = streamText({
    model: venice(modelId),
    messages,
  });

  return result.toDataStreamResponse();
}

// Componente client con selettore di modello
'use client';

import { useChat } from '@ai-sdk/react';
import { useState } from 'react';

const MODELS = [
  { id: 'venice-uncensored', name: 'Venice Uncensored', desc: 'Fast & uncensored' },
  { id: 'zai-org-glm-5-1', name: 'GLM 5.1', desc: 'Most intelligent (private)' },
  { id: 'qwen3-vl-235b-a22b', name: 'Qwen Vision', desc: 'Advanced vision + text' },
  { id: 'qwen3-5-9b', name: 'Qwen 3.5 9B', desc: 'Fastest & cheapest' },
];

export default function Chat() {
  const [model, setModel] = useState('venice-uncensored');
  const { messages, input, handleInputChange, handleSubmit } = useChat({
    body: { model },
  });

  return (
    <div>
      <select value={model} onChange={(e) => setModel(e.target.value)}>
        {MODELS.map((m) => (
          <option key={m.id} value={m.id}>{m.name} — {m.desc}</option>
        ))}
      </select>
      {/* ... UI chat ... */}
    </div>
  );
}

Integrazione web search

Passa i parametri Venice per la web search:

import { streamText } from 'ai';
import { venice } from '@/lib/venice';

const result = streamText({
  model: venice('venice-uncensored'),
  messages: [{ role: 'user', content: 'What happened in AI news today?' }],
  // Parametri specifici di Venice
  experimental_providerMetadata: {
    venice_parameters: {
      enable_web_search: 'auto',
    },
  },
});

Se experimental_providerMetadata non viene passato, puoi usare un wrapper fetch personalizzato o chiamare direttamente l’API Venice per le funzionalità di web search.

Embeddings

Per gli embeddings, usa textEmbeddingModel() direttamente sul provider:

import { embed, embedMany } from 'ai';
import { createOpenAI } from '@ai-sdk/openai';

const openai = createOpenAI({
  apiKey: process.env.VENICE_API_KEY!,
  baseURL: 'https://api.venice.ai/api/v1',
});

// Embedding singolo
const { embedding } = await embed({
  model: openai.textEmbeddingModel('text-embedding-bge-m3'),
  value: 'Privacy-first AI infrastructure',
});

// Embedding in batch
const { embeddings } = await embedMany({
  model: openai.textEmbeddingModel('text-embedding-bge-m3'),
  values: [
    'Venice AI provides private inference.',
    'Zero data retention guaranteed.',
    'OpenAI SDK compatible.',
  ],
});

Variabili d’ambiente

# .env.local
VENICE_API_KEY=your-venice-api-key

Modelli consigliati

Caso d’uso	Modello	Perché
App chat	`venice-uncensored`	Veloce, economico, nessun filtro
Compiti complessi	`zai-org-glm-5-1`	Ragionamento di punta privato
App vision	`qwen3-vl-235b-a22b`	Comprensione avanzata delle immagini
Alto volume	`qwen3-5-9b`	Il più economico a $0,10/1M input,$ 0,15/1M output
Tool calling	`zai-org-glm-5-1`	Function calling affidabile

Documentazione Vercel AI SDK

Documentazione ufficiale Vercel AI SDK

Modelli Venice

Sfoglia tutti i modelli Venice

Documentazione

Per iniziare

Testo e chat

Immagini, video e audio

Ricerca e RAG

Agenti e integrazioni

Strumenti di sviluppo

Strumenti per agenti

SDK e framework

Setup

Configurazione del provider

Streaming chat (Next.js App Router)

Route API

Componente React

Generazione di testo (senza streaming)

Output strutturato

Tool calling

Generazione di immagini

Chat multi-modello (selettore di modello)

Integrazione web search

Embeddings

Variabili d’ambiente

Modelli consigliati

Documentazione Vercel AI SDK

Modelli Venice

​Setup

​Configurazione del provider

​Streaming chat (Next.js App Router)

​Route API

​Componente React

​Generazione di testo (senza streaming)

​Output strutturato

​Tool calling

​Generazione di immagini

​Chat multi-modello (selettore di modello)

​Integrazione web search

​Embeddings

​Variabili d’ambiente

​Modelli consigliati

Documentazione Vercel AI SDK

Modelli Venice

Setup

Configurazione del provider

Streaming chat (Next.js App Router)

Route API

Componente React

Generazione di testo (senza streaming)

Output strutturato

Tool calling

Generazione di immagini

Chat multi-modello (selettore di modello)

Integrazione web search

Embeddings

Variabili d’ambiente

Modelli consigliati