Speech-to-text on x402 Bazaar. POST multipart/form-data with a `file` audio upload (WAV/FLAC/M4A/MP3/OGG/WEBM). Whisper Large v3 (default), NVIDIA Parakeet, fal Wizper, xAI STT. Optional model/response_format/timestamps/language fields. Returns JSON {text, duration, timestamps?}. Hard cap 300s of audio per request (longer → 400, so check duration before paying).
| Network | Scheme | Amount | Pay To |
|---|---|---|---|
| Base | exact | $0.05 USDC | 0x2348...2bCb |
Loading activity for 1 address...
| Network | Address | USDC Balance | ETH Balance | Tx Count |
|---|---|---|---|---|
| Base | 0x2348...2bCb | $0.86 | 0.000000 ETH | 0 |