Ali Parandeh: Building Generative AI Services with FastAPI, Kartoniert / Broschiert

Ali Parandeh

Building Generative AI Services with FastAPI

Buch

A Practical Approach to Developing Context Rich Generative AI Applications

sofort lieferbar

Aktueller Preis: EUR 62,38

Verlängerter Rückgabezeitraum bis 31. Januar 2026

Alle zur Rückgabe berechtigten Produkte, die zwischen dem 1. bis 31. Dezember 2025 gekauft wurden, können bis zum 31. Januar 2026 zurückgegeben werden.

Versandkosten (United States of America): EUR 19,90

Verlag:: O'Reilly Media, 04/2025
Einband:: Kartoniert / Broschiert
Sprache:: Englisch
ISBN-13:: 9781098160302
Artikelnummer:: 12049265
Gewicht:: 882 g
Maße:: 230 x 175 mm
Stärke:: 28 mm
Erscheinungstermin:: 30.4.2025
Hinweis: Achtung: Artikel ist nicht in deutscher Sprache!

Klappentext

Ready to build applications using generative AI? This practical book outlines the process necessary to design and build production grade AI services with a FastAPI web server that communicate seamlessly with databases, payment systems, and external APIs. You'll learn how to develop autonomous generative AI agents that stream outputs in real-time and interact with other models. Web developers, data scientists, and DevOps engineers will learn to implement end-to-end production-ready services that leverage generative AI.

You'll learn design patterns to manage software complexity, implement FastAPI lifespan for AI model integration, handle long-running generative tasks, perform content filtering, cache outputs, implement retrieval augmented generation (RAG) with a vector database, implement usage / cost monitoring and tracking, protect services with your own authentication and authorization mechanisms, and effectively control stream outputs directly from GenAI models. You'll explore efficient testing methods for AI outputs, validation against databases, and deployment patterns using Docker for robust microservices in the cloud.

Build generative services that interact with databases, external APIs, and more
Learn how to load AI models into a FastAPI lifecycle memory
Monitor and log model requests and responses within services
Use authentication and authorization patterns hooked with generative models
Handle and cache long-running inference tasks
Stream model outputs via streaming events and WebSockets into browsers or files
Automate the retraining process of generative models by exposing event-driven endpoints

Ali Parandeh is a Chartered Engineer with the UK Engineering Council and a Microsoft and Google certified developer, data engineer, and data scientist.