BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Huntsville AI - ECPv6.8.3//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Huntsville AI
X-ORIGINAL-URL:https://hsv.ai
X-WR-CALDESC:Events for Huntsville AI
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/Chicago
BEGIN:DAYLIGHT
TZOFFSETFROM:-0600
TZOFFSETTO:-0500
TZNAME:CDT
DTSTART:20240310T080000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0500
TZOFFSETTO:-0600
TZNAME:CST
DTSTART:20241103T070000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/Chicago:20240724T180000
DTEND;TZID=America/Chicago:20240724T190000
DTSTAMP:20260418T105900
CREATED:20240505T034709Z
LAST-MODIFIED:20240722T035343Z
UID:1657-1721844000-1721847600@hsv.ai
SUMMARY:Private RAG Deployment & Cost
DESCRIPTION:We have been covering a series this year based on how a RAG works\, and how to build one. Now it’s time to look at what it will cost to deploy it! \nBLUF: If you have constraints that require you to host your own LLM instead of using services such as OpenAI\, Anthropic\, or Google – this is going to get expensive. \nThe baseline that we will be using to derive requirements and prices will be the a subset documents that we have been using along the way from NASA. To start\, I have pulled 500 of these documents which total 109\,516 paragraphs and 4\,254\,032 words. \nWe will start with a basic use case with using OpenAI GPT4o and a Weaviate hosted vector store. We will move from there to a self hosted solution for all of the components and add up the month cost associated.\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\nSeries: \nOver the next several sessions\, we will be diving deeper into separate components needed for RAG – hopefully resulting in a chat-based Q&A service for the NASA Technical Report Server.  We were introduced to this data during our submission for the 2022 NASA SpaceApps Challenge – where we placed 2nd. Our submission was a semantic search based on the abstracts for the NSTR dataset of 10\,000 papers. \nLinks: \n\nHuntsville AI 2022 SpaceApps Submission – https://github.com/HSV-AI/spaceapps2022\n\n\nDetails: \n\nDate – 07/24/2024\nTime – 6-7pm\nLocation – HudsonAlpha\nAddress –  601 Genome Way Northwest\, Huntsville\, AL 35806\nZoom –https://us02web.zoom.us/j/84452278503?pwd=rJwxSbD1EAUdIHuzGoMscHYxpfULhR.1
URL:https://hsv.ai/event/private-rag-deployment-cost-2/
LOCATION:HudsonAlpha\, 601 Genome Way Northwest\, Huntsville\, AL\, 35806
ATTACH;FMTTYPE=image/png:https://hsv.ai/wp-content/uploads/2024/05/Private-RAG.png
END:VEVENT
END:VCALENDAR