API Documentation

Introduction

The Fomoa AI 2.0 API provides access to our powerful GPU-accelerated vision-language model. Powered by NVIDIA T4 GPU, enjoy blazing fast ~3 second response times. Our API is fully compatible with the OpenAI Chat Completions API format, making it easy to integrate with existing applications and SDKs.

~3s

Response Time

T4 GPU

Acceleration

128K

Context Window

₹49

Per Million Tokens

Key Features

GPU-Accelerated - NVIDIA T4 GPU for ~3 second inference
Vision Capabilities - Analyze images, documents, and screenshots
Voice Input - Built-in speech recognition (Web Speech API)
Text-to-Speech - Listen to AI responses read aloud
PPT Generation - Create professional presentations via API
Hindi Support - Full Hindi language understanding
Streaming - Real-time streaming responses
OpenAI Compatible - Drop-in replacement for OpenAI SDK

Base URL

https://fomoa.cloud/api/v1

Quick Start

curl https://fomoa.cloud/api/v1/chat \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "fomoa-vision-2.0",
    "messages": [
      {"role": "user", "content": "Hello, who are you?"}
    ]
  }'

What's New in Fomoa AI 2.0

NVIDIA T4 GPU

Inference powered by 16GB VRAM GPU

Voice Commands

Speak to chat with voice recognition

Text-to-Speech

Listen to AI responses read aloud

PPT Generation

Create presentations via API

Hindi Language

Full Hindi language support

~3s Response

Lightning fast GPU inference