Deep Dive with Llama-Cpp-Python

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

This week we will take another look at llama-ccp, this time using the python wrapper available in llama-cpp-python. This library can be installed and imported directly, or run as a separate service available through the OpenAI API. We covered the basics of llama-cpp last year and looked through the different quantization approaches available. It provides […]

OpenAI API

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

This week we will cover the use of the OpenAI API with both llama-cpp-python and OpenAI itself. After this session, you will have a solid understanding of how to use the chat completion API along with an approach to run the same code locally against llama-cpp-python. We will cover account setup, billing & budgets, as […]

Summer Social

Stovehouse 3414 Governors Dr SW, Huntsville, AL

We are going to take a break from the formal sessions this week to get together at Stovehouse and hang out for a bit. This is a great opportunity to relax and meet others involved in Huntsville AI. We have a wide variety of backgrounds, so I always learn something interesting. This officially starts at […]

Law and AI

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

This week we have Andrew Tuggle from Womble Bond Dickinson (US) LLP joining us to talk about Law and AI. He has been involved in our meetups for years and always has an interesting perspective from an attorney’s point of view. As AI becomes more advanced and ubiquitous, it is raising new legal issues across many domains. In this session, […]

Vector Storage with Weaviate

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

We’re going to add in a new topic to the NASA RAG series this week, so check below for some rearranging of the schedule. As always, if there is something that you would like to talk about (or have someone cover) – please let me know! After a few rounds of iteration using ChromaDB for […]

RAG Prompt Engineering

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

If you have ever tried to prompt an LLM to give you what you want, you have probably run into some frustrating challenges. This frustration can range from trying to write what you want in pseudo-code to arguing with a child about putting their toys away. This week we will focus on some guidelines for […]

Private RAG Deployment & Cost

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

We have been covering a series this year based on how a RAG works, and how to build one. Now it’s time to look at what it will cost to deploy it! BLUF: If you have constraints that require you to host your own LLM instead of using services such as OpenAI, Anthropic, or Google […]

2024 SBIR Topics Round 3

This week we will cover the last round of DoD SBIR Topics that are scheduled to be released on Wednesday. This is always a good checkpoint to see what areas of AI that DoD organizations are interested in. The last time we covered SBIR topics, there were several on the call that were new to […]

Faster Whisper for Audio Transcription

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

This week we will cover the Faster Whisper library. It is similar to the approach used by llama.cpp and provides a much smaller footprint and faster execution times than the OpenAI Whisper library. We will also walk through the approach for using this library within an AWS Lambda function, along with tips and tricks for […]

Pixtral Model by Mistral AI

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

We will cover the Pixtral multimodal model released by Mistral AI last week. Hopefully we can get it loaded up in Le Chat (French ChatGPT?) and kick the tires on it a bit. From what I am hearing from some of our group, it’s definitely something to look into. From the Mistral announcement: Natively multimodal, […]

vLLM, New SBIR Topics, Helene Response

HudsonAlpha 601 Genome Way Northwest, Huntsville, AL

We have a few topics that we will cover this week, led by Josh Phillips (vLLM) and Gary Turner (Helene Response). We will also cover the list of AI related SBIR topics that were released last week. vLLM In this session, we will demonstrate how easy it is to deploy vLLM on RunPod using an […]