0% found this document useful (0 votes)

15 views55 pages

LLM Programming Tools and APIs Guide

The document discusses programming tools for Large Language Models (LLMs), focusing on APIs such as OpenAI's API, which allows users to interact with and embed LLMs in applications. It explains the concept of Retrieval-Augmented Generation (RAG) for enhancing LLM capabilities by accessing external information and the process of fine-tuning LLMs with custom data. Additionally, it covers the differences between human and LLM agents, the importance of chunking in data processing, and various applications of LLMs.

Uploaded by

kartik17lksh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views55 pages

LLM Programming Tools and APIs Guide

Uploaded by

kartik17lksh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Module 2

Programming an LLM
Hardeep Johar
Senior Lecturer in the Discipline of
Industrial Engineering and Operations Research
LLM Programming Tools
LLMs come with support for programmers through platforms, APIs, and third party libraries

OpenAI API Meta LLaMa 3 Google Gemini

Vertex API
What is an API?
An API allows you to:

❏ Write complex threaded queries

❏ Extract relevant responses
❏ Build code
❏ Tailor an LLM to your specific domain
❏ Embed the LLM in your own applications
OpenAI API
The OpenAI API allows you to programmatically access various GPT models

OpenAI API Docs: [Link]

OpenAI API
API

Interact Build Embed

Interact with a GPT LLM in the Build a specialized Embed a GPT based chatbot
same manner as ChatGPT but version of the LLM for in your own web or mobile
from inside a program your own application or application - seamless
organization integration of the LLM into a
● Ask questions
broader application
● Chat with the model
● Analyze data (input data into
the LLM)
● Get programming
suggestions or fix code
OpenAI API: What You Need
● An OpenAI account
○ [Link]
● Create a project
○ [Link]
● Create a secret API key
○ [Link]
● Store the key in a safe place
○ The “Colab notebooks” folder on your Google Drive is probably the
easiest
○ Don’t share it with anyone!
Summary

LLM Programming Tools

What is an API?
OpenAI API
OpenAI API: Using the Key

OpenAI API Notebook

Flight number: 42 Flight number: 42

Arrival time: 4:00 pm Arrival time: 4:10pm

Examples of applications
❏ Customer service chatbots
❏ Discovering new proteins
❏ Symptom diagnosis
❏ Accessibility
❏ Learning history
❏ Document summarization
❏ Search engines
❏ Video editing
❏ Marketing campaigns
❏ Accounting
Building Custom LLMs
Human Agent vs LLM Agent - 1
A human agent An LLM agent

● Has some basic knowledge of the ● Has been trained on large amounts
world (e.g., a college degree) of text data
● Can converse in natural language ● Can converse in natural language
● Can answer general questions ● Can answer general questions
● Can make deductive inferences ● Has some ability to make
● But doesn’t know the internal deductions as long as they have
business knowledge to answer been trained on relevant
business specific questions information
● But doesn’t know the internal
business knowledge to answer
business specific questions
Human Agent vs LLM Agent - 2
● A human agent can add to knowledge

Human

By going back to school By being provided in-house training

● a graduate degree, ● an onboarding program at an

professional certifications, organization
etc. ● Trade subscriptions
● Usually requires a significant ● Usually a smaller cost
financial outlay
Human Agent vs LLM Agent - 3
● An LLM agent can add to knowledge

LLMs

Retrieval Augmented
By Being Retrained Fine Tuning
Generation (RAG)

Very expensive ● Adjusting model parameters using ● Provide the model with
(analogous to sending a supervised learning specialized information and
● Expensive and time consuming a mechanism to retrieve this
human agent back to
(analogous to sending a human
elementary school!) information
agent to graduate school or to a
professional certification program) ● The model will use this
specialized information and
fall back to generalized
knowledge if necessary
● Model parameters don’t
change
● Relatively inexpensive
Retrieval-Augmented Generation - 1
● The LLM accesses information from an external (to its model) document
repository
● It uses this document repository, as well as its trained model, to answer
queries
● Roughly:
○ The documents are converted into chunks (short sequences of words)
○ The chunks are converted into embedded vectors
○ The query is converted into embedded vectors
○ The most similar document embedded vectors are chosen using a
similarity algorithm
○ The LLM then uses its “language skills” to respond to the query
Embedding Chunked Vectors
● Since chunks are smaller groups of information
○ For example, 250 word chunks from a 10,000 word document
○ The likelihood of the “objects” in the chunks being related is high
○ Embedded vectors from these chunks will capture relationships better
Retrieval-Augmented Generation - 2

Question

Instruction LLM Answer

Vector
Encoder Context
Database

Encoder

Documents
Source: [Link]
By Gknor - Own work, CC BY-SA 4.0, [Link]
Retrieval-Augmented Generation - 3

Question

Instruction LLM Answer

Vector
Encoder Context
Database

Encoder
document text is chunked

Documents
Source: [Link]
By Gknor - Own work, CC BY-SA 4.0, [Link]
Chunking
● Text documents are large and contain a variety of information
○ A customer service agent may contain information on
■ How to return products
● With different policies for different products
■ How to contact a human agent
■ Locations of retail stores
● Chunking increases the probability that related information is grouped
Retrieval-Augmented Generation - 4

Question

Instruction LLM Answer

Vector
Encoder Context
Database

Encoder
embedded vectors from
the chunked text

Documents
Source: [Link]
By Gknor - Own work, CC BY-SA 4.0, [Link]
Retrieval-Augmented Generation - 5

Question

Instruction LLM Answer

Vector
Encoder Context
Database

Encoder
embedded vectors
are stored in a
vector database
Documents
Source: [Link]
By Gknor - Own work, CC BY-SA 4.0, [Link]
Vector Databases and Vector Search
● Indexed databases for storing vectors
● Given a vector as input, vector databases search for matching vectors using a
similarity algorithm
○ Cosine similarity:
■ Calculates the cosine of the angle between two vectors
■ The smaller the angle, the closer the cosine is to 1, and the more
similar the vectors
● The number of vectors is large and calculating the similarity between every
pair of vectors is computationally expensive
○ The input prompt is converted into a large number of vectors
○ The document repository is converted into a large number of vectors
○ The product of the two is large
Embedding Vectors and Cosine Similarity
Notebook
Vector Search: Retrieving Similar Vectors
Navigable Small Worlds (NSW) algorithm

● Store a pre-constructed similarity graph of document chunks

● Randomly pick a chunk and compute the similarity with the input vector
● Move to a neighbor of the chunk and recompute the similarity
● Stop when the similarity doesn’t get better
● Repeat and report the top-n similar chunks
● NSW algorithms rely on the “six degrees of separation” idea
○ The best similarity will be utmost some small n away from a random
chunk
Navigable Small Worlds - 1
● Construct the base
M=2 2
1 graph
0 ● The parameter M
specifies the number
3 5
of connections a
4 chunk makes to other
chunks
6
8 ● For a large number of
chunks, the graph will
7 be sparse
10
9 ● The edge attribute is
the inverse of cosine
similarity
Navigable Small Worlds - 2

M=2 2 ● A new chunk (from

1
0 N the LLM prompt)
arrives
● And is inserted into
3 5
4 the graph

6
8

7
9 10
Navigable Small Worlds - 3

M=2 2 ● Choose a random chunk

1
0 N (e.g., 10)
● And calculate the
distance
3 5
4 ○ add a new edge
between 10 and N
6
8

7
9 10
Navigable Small Worlds - 4

M=2 2 ● Check distance

1
0 N (similarity) from two
neighbors
● And calculate the
3 5
4 distance
○ add new edges
6
8 from 9 and 8 to N

7
9 10
Navigable Small Worlds - 5

M=2 2 ● 8 is closer
1
0 N ● Calculate distance
from 4 and 5 to N

3 5
4

6
8

7
9 10
Navigable Small Worlds - 6

M=2 2 ● 4 is closer
1
0 N ● Calculate distance
from 1 and 3 to N

3 5
4

6
8

7
9 10
Navigable Small Worlds - 7

M=2 2 ● 1 is closer
1
0 N ● Calculate distance
from 0 and 3 to N
● Since neither 0 nor 3
3 5
4 is closer than 1, stop.
1 is the closest chunk
6
8

7
9 10
Vector Search: Retrieving Vectors

● Hierarchical Navigable Small Worlds (HNSW) algorithm

○ Adaptation of NSW but the pre-constructed graph is hierarchical with a small
number of starting chunks with subsequent chunks arranged in a hierarchy
○ The algorithm picks a random chunk from the top level and then searches in
that hierarchy
○ Using the NSW algorithm at each level
○ Facebook AI Search Similarity (FAISS) is a commonly used HNSW
implementation and you will see it used later in this course
● Constructing the hierarchy and the base network is the hard part of HNSW
● We won’t look at it in detail here but, if you’re interested:
○ [Link]
Retrieval-Augmented Generation - 6

Question

Instruction LLM Answer

Vector
Encoder Context
Database

Encoder
The LLM selects the most
appropriate embedded vector set
Embedded using a vector search algorithm
vectors Documents
Source: [Link]
By Gknor - Own work, CC BY-SA 4.0, [Link]
Retrieval-Augmented Generation - 7
Question

Instruction LLM Answer

Vector
Encoder Context
Database

Encoder
And responds to the prompt

Documents
Source: [Link]
By Gknor - Own work, CC BY-SA 4.0, [Link]
RAG Example Notebook (Xilin)
Knowledge Graphs
● A knowledge graph is a data model that uses a graph to organize and
represent domain knowledge in the form of entities and relationships
● Knowledge graphs have a formal semantics for
○ Storing knowledge (entities, relationships)
○ Retrieving knowledge (knowledge searching)
Knowledge Graph: Example
Fine Tuning an LLM
Fine Tuning an LLM
● When custom data is added to an LLM using RAG, the model itself is not updated
○ All the parameters stay the same
○ Vectors representing the new knowledge are computed and stored in a
database
○ The LLM retrieves these vectors (i.e., the specific data chunks) and combines
it with its model (the LLM network parameters) to figure out the output
○ The LLM itself is unchanged
● In fine tuning
○ The model is updated with new parameters
○ You get, in essence, a new LLM
● When fine tuning, you need to go through all the steps in the machine learning
process
Fine Tuning: Broad Steps
● Gather data: The quality of the data is the most important input into a model.
Data should be representative of the domain you are customizing on; should
be sufficient in quantity
● Preprocessing and feature engineering
○ Clean the data
○ Create appropriate features
● Split into training/validation/testing sets: Fine tuning changes model
parameters and you need to ensure that the model is learning “correctly”
● Fine tune the model
● Test the fine tuned model
Supervised Fine Tuning: Process
Supervised fine tuning

● The pre-trained LLM is given specific labeled examples

○ A prompt
■ How can I return my LCD TV?
○ A response
■ To return your LED TV, ensure it is in its original packaging and includes all
accessories. Returns are accepted within 30 days of purchase. A restocking
fee of 15% may apply.
● The prompt is used to generate a response
■ Example: A generic response on returns
● The generated response is compared with the labeled response
● Weights are adjusted to account for the error
● The process is repeated
Supervised Fine Tuning: Advantages and Disadvantages
● Advantages:
○ Lower processing and memory requirements than full retraining
○ Fall back to pre-trained model is more seamless (compared to RAG)
○ Adapts to the specific domain (like RAG)
● Disadvantages:
○ Model weights are changed and this may compromise the reliability of the LLM for
non-domain questions
○ Overfitting: This is a big danger since the model weights are changed. General
queries may still give a domain specific response even where they are not suitable
○ Data issues: Data has to be reconstituted in a prompt response format and this
may not be practical
■ For example, many different forms of the LCD TV prompt need to prepared
■ And this has to be repeated for all possible prompt/response pairs
Instruction Tuning
● In instruction tuning, the pre-trained LLM is provided with an instruction and
given a response
○ Instruction: Translate I love you into Italian
○ Response: te amo
● Typical use cases:
○ Language translation
○ Multiple choice tests
● Instruction fine tuning works like supervised fine tuning (the model is updated)
but is used
○ when there is insufficient labeled data
○ When the response is well defined
Parameter Efficient Fine Tuning (PEFT)
● Supervised fine tuning and instruction fine tuning update the entire model
● Since an LLM is huge (trillions of parameters), these methods are relatively resource
intensive
○ Though less intensive than a complete retraining!
● Parameter efficient fine tuning focuses on updating a small part (subset) of the model
○ Can work with less data (since the training is focused)
○ More efficient (since only a small subset of the model is being changed)
○ Less likely to be overfitted (since the LLM is largely unchanged)
○ However, not as reliable (since only a small subset of the LLM is retrained)
● PEFT is mostly used when
○ Resources are limited
○ Data availability is limited
Types of Parameter Efficient Fine Tuning
● Adapters
● Low rank adaption (LORA) and Quantized Low Rank Adaption (QLORA)
● Infused Adapter by Inhibiting and Amplifying Inner Activations (IA3)
● Layer freezing
● Prefix tuning
● Prompt tuning
Adapters
● Adapters are new submodules that are inserted into the transformer
architecture
● With each training case (labeled) only the weights in the adapter modules are
updated
● The original pre-trained LLM weights are not changed
● Since the adapters are relatively small (few weights) the resources required
are relatively low
te amo (OUTPUT!)
Transformer
Word Probabilities
Encoder n

Encoder Encoder 2 Decoder n

Stack
Encoder 1 Decoder 1
Insert
Adapters

Embedding Position Embedding Position

Layer Encoding Layer Encoding

I love you (INPUT) te amo (TARGET)

LORA: Low Rank Adaptation
● In between any two layers of a transformer, there are n x n weights
● LORA keeps two smaller matrices
○ n x a and a x n, where a << n
● As each case is passed through the transformer
○ The change in weights is computed
○ A lower dimension approximation of this change is used to update the
weights in the LORA adaptor
○ The original weights are unchanged
● The main advantage is the reduced memory and processing requirement
○ The LORA adaptor is many (many!) orders of magnitude smaller than the
pre-trained LLM
LORA Adaptor

n nodes Encoder i+1 Adaptor

axn
a << n
nxn weights
nxa

n nodes Encoder i
IA3
● IA3 is structurally similar to LORA
● But, the low rank vectors are directly learned rather than computed from the
weight changes of the original model
● This makes IA3 faster and more memory efficient than LORA
Layer Freezing
● Roughly
○ The early layers in a model are more general
■ Language elements
■ General knowledge
○ Later layers are more specialized
■ Domain specific knowledge
■ Derived knowledge
■ How to knowledge (classify, summarize, translate, etc.)
● Layer freezing attempts to freeze early layers and update weights only in later
layers when fine tuning a model
● Models can be trained to figure out which layers (or parameters) need to be
updated (beyond the scope of our class!)
Prefix Tuning
● A vector is prepended to the model, before the input
● The purpose of the vector is to provide an operational context to the LLM
● For example:
○ A prefix vector may guide the model to produce a summary of the input
○ A prefix vector may guide the model to produce a translation of the input
○ A prefix vector may guide the model to set the context to Olympics
○ A prefix vector may guide the model to set the context to the presidential
elections
● Advantages and disadvantages
○ Very memory efficient (a vector) and fast training (only the prefix vector is
updated)
○ Limited use since it is setting a context rather than building new information
into the LLM
Prompt Tuning
● The same prompt can be written in many different ways
○ How do I return my LCD TV
○ I bought an LCD TV from your store and now realize it is too big for the space and
want to return it. What should I do?
● The above two examples ask the same question but in different ways
● In prompt tuning, a preprocessing layer that is trained with sample prompts is inserted
between the input and the model
○ The preprocessing layer adds a set of embeddings to the prompt
○ These embeddings direct the prompt (sort of) toward a standard prompt
○ In the two examples above, both prompts will be directed toward the same
question
● No new knowledge is added but the model can be guided toward a specific purpose
Fine Tuning Example
How to create an Open AI API Key
Screenshots or b-roll

[Link]

Build Your First AI Agent Live Class
No ratings yet
Build Your First AI Agent Live Class
34 pages
LLM Interview Questions & Course Guide
No ratings yet
LLM Interview Questions & Course Guide
10 pages
Understanding RAG with Gemini Pro
No ratings yet
Understanding RAG with Gemini Pro
42 pages
Guide to LLMs, Agents, and Storage Systems
No ratings yet
Guide to LLMs, Agents, and Storage Systems
5 pages
LLM Project Essentials for Interns
No ratings yet
LLM Project Essentials for Interns
4 pages
Harnessing RAG with Foundational Models
100% (1)
Harnessing RAG with Foundational Models
20 pages
LLM Deployment Strategies and Insights
No ratings yet
LLM Deployment Strategies and Insights
5 pages
LLMs, Agents, and RAG Explained
No ratings yet
LLMs, Agents, and RAG Explained
4 pages
LLMs in Production
No ratings yet
LLMs in Production
46 pages
Understanding Retrieval-Augmented Generation
No ratings yet
Understanding Retrieval-Augmented Generation
14 pages
RAG and LLMs for AI Applications
No ratings yet
RAG and LLMs for AI Applications
12 pages
LLM Agents: Planning and Execution Insights
No ratings yet
LLM Agents: Planning and Execution Insights
76 pages
RAG: Enhancing Enterprise Data Value
No ratings yet
RAG: Enhancing Enterprise Data Value
7 pages
LLM Application Architecture Overview
No ratings yet
LLM Application Architecture Overview
15 pages
Foundations For Llms Integration
No ratings yet
Foundations For Llms Integration
53 pages
LLM Learning Roadmap Bootcamp Guide
No ratings yet
LLM Learning Roadmap Bootcamp Guide
12 pages
Lecture: Retrieval Augmented Generation
No ratings yet
Lecture: Retrieval Augmented Generation
12 pages
AI-Agent Engineering for Teaching Tools
No ratings yet
AI-Agent Engineering for Teaching Tools
169 pages
AI Product Management Essentials Guide
100% (1)
AI Product Management Essentials Guide
270 pages
Introduction to Large Language Models
No ratings yet
Introduction to Large Language Models
8 pages
LLMs and RAG in EDA AI-Agent Development
No ratings yet
LLMs and RAG in EDA AI-Agent Development
29 pages
Week3 Handout VectorDB Embeddings RAG Final
No ratings yet
Week3 Handout VectorDB Embeddings RAG Final
10 pages
LLM - Must Know Terms
No ratings yet
LLM - Must Know Terms
6 pages
Understanding Retrieval Augmented Generation
No ratings yet
Understanding Retrieval Augmented Generation
16 pages
RAG and LLMs in Semantic Search
No ratings yet
RAG and LLMs in Semantic Search
16 pages
RAG Comprehensive Guide
No ratings yet
RAG Comprehensive Guide
65 pages
AI Agents: Transforming Business Solutions
No ratings yet
AI Agents: Transforming Business Solutions
59 pages
Ai Agent
No ratings yet
Ai Agent
16 pages
A Gentle Introduction To Retrieval Augmented Generation
No ratings yet
A Gentle Introduction To Retrieval Augmented Generation
41 pages
LLM API RAG Mastercourse
No ratings yet
LLM API RAG Mastercourse
4 pages
RAG Variants: Graph, Light, Agentic
No ratings yet
RAG Variants: Graph, Light, Agentic
16 pages
AI Intern Interview Prep
No ratings yet
AI Intern Interview Prep
31 pages
Lessons from a Year with LLMs
No ratings yet
Lessons from a Year with LLMs
22 pages
Introduction to Large Language Models
No ratings yet
Introduction to Large Language Models
8 pages
RAG: Enhancing LLMs for AI Applications
No ratings yet
RAG: Enhancing LLMs for AI Applications
7 pages
660d79167ff162b823ed3869 Vector DB Whitepaper Compressed
No ratings yet
660d79167ff162b823ed3869 Vector DB Whitepaper Compressed
18 pages
Understanding Agentic RAG Technology
100% (1)
Understanding Agentic RAG Technology
18 pages
40 RAG Interview Questions Answers Beginner Advanced 1771871465
No ratings yet
40 RAG Interview Questions Answers Beginner Advanced 1771871465
41 pages
Building Local Agents with LangGraph
100% (3)
Building Local Agents with LangGraph
48 pages
Augmented Language Models in 2023
No ratings yet
Augmented Language Models in 2023
95 pages
Advanced RAG Techniques for LLMs
No ratings yet
Advanced RAG Techniques for LLMs
54 pages
Retrieval-Augmented Generation (RAG) : 2. The Limitations of Traditional Large Language Models
No ratings yet
Retrieval-Augmented Generation (RAG) : 2. The Limitations of Traditional Large Language Models
15 pages
AI - ML - Basics - Repaired
No ratings yet
AI - ML - Basics - Repaired
18 pages
Tata NeuSkills Mastering Generative AI Curriculum
No ratings yet
Tata NeuSkills Mastering Generative AI Curriculum
2 pages
Mastering Large Language Models Skills
No ratings yet
Mastering Large Language Models Skills
17 pages
Agentic AI Engineer Basics
No ratings yet
Agentic AI Engineer Basics
11 pages
Understanding LLM Systems End-to-End: AI Engineering Insider
No ratings yet
Understanding LLM Systems End-to-End: AI Engineering Insider
26 pages
Generative AI and LLMs in Business
No ratings yet
Generative AI and LLMs in Business
53 pages
LLM Rag
No ratings yet
LLM Rag
8 pages
RAG Based Academic Tutor
No ratings yet
RAG Based Academic Tutor
15 pages
Overview of AI Agents by Sam Bhagwat
100% (1)
Overview of AI Agents by Sam Bhagwat
12 pages
Introduction to Retrieval Augmented Generation
No ratings yet
Introduction to Retrieval Augmented Generation
47 pages
RAG: Enhancing AI with Data Integration
No ratings yet
RAG: Enhancing AI with Data Integration
8 pages
Ostad AI Bootcamp M7 C1
No ratings yet
Ostad AI Bootcamp M7 C1
45 pages
FastAPI RAG Chatbot Development Guide
100% (1)
FastAPI RAG Chatbot Development Guide
41 pages
Generative AI Engineer Training Program
No ratings yet
Generative AI Engineer Training Program
10 pages
April Documentation
No ratings yet
April Documentation
36 pages
Waste Management Challenges in Nigeria
No ratings yet
Waste Management Challenges in Nigeria
40 pages
1 Isnin
No ratings yet
1 Isnin
3 pages
AI Prompt Engineering Bootcamp Guide
No ratings yet
AI Prompt Engineering Bootcamp Guide
13 pages
Emerging Popular Arts in Asia
No ratings yet
Emerging Popular Arts in Asia
50 pages
2018 (Jachimowicz) Why Grit Requires Perseverance and Passion To Positively Predict Performance
No ratings yet
2018 (Jachimowicz) Why Grit Requires Perseverance and Passion To Positively Predict Performance
6 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
9 pages
Constitutional Values in Indian Education
No ratings yet
Constitutional Values in Indian Education
10 pages
Grant Writing Essentials for Nonprofits
No ratings yet
Grant Writing Essentials for Nonprofits
11 pages
Importance of Lesson Planning Explained
No ratings yet
Importance of Lesson Planning Explained
12 pages
Numerical Analysis Course Syllabus
No ratings yet
Numerical Analysis Course Syllabus
11 pages
Special Exam Schedule September 2025
No ratings yet
Special Exam Schedule September 2025
1 page
Mastering Google Tools for Research
No ratings yet
Mastering Google Tools for Research
2 pages
Research Grants for African Projects
No ratings yet
Research Grants for African Projects
2 pages
Course Details
No ratings yet
Course Details
1 page
2025-26 Examination Timetable
No ratings yet
2025-26 Examination Timetable
1 page
Psychosocial Readiness for College Assessment
No ratings yet
Psychosocial Readiness for College Assessment
27 pages
Turing's AI Test: Objections and Insights
No ratings yet
Turing's AI Test: Objections and Insights
1 page
Cambridge IGCSE: Isizulu As A Second Language 0531/02
No ratings yet
Cambridge IGCSE: Isizulu As A Second Language 0531/02
8 pages
Seven Philosophical Views on Critical Thinking
No ratings yet
Seven Philosophical Views on Critical Thinking
23 pages
Impact of Training on Work Outcomes
No ratings yet
Impact of Training on Work Outcomes
8 pages
Vimetco Extrusion Aluminium Profiles
No ratings yet
Vimetco Extrusion Aluminium Profiles
2 pages
DFIG Wind Turbine Modeling and Analysis
No ratings yet
DFIG Wind Turbine Modeling and Analysis
6 pages
Credit Card Fraud Detection Techniques
No ratings yet
Credit Card Fraud Detection Techniques
3 pages
CNA Gold 1 Midterm Test Answer Key
No ratings yet
CNA Gold 1 Midterm Test Answer Key
2 pages
Class 7 Maths Mid Term Exam Paper
No ratings yet
Class 7 Maths Mid Term Exam Paper
2 pages
Developing Critical Thinking in Students
No ratings yet
Developing Critical Thinking in Students
3 pages
7th Grade Science Fair Project Guide
No ratings yet
7th Grade Science Fair Project Guide
4 pages
An Interpersonal Approach To Classroom Management
No ratings yet
An Interpersonal Approach To Classroom Management
20 pages
E-Transfer Application for Teachers
No ratings yet
E-Transfer Application for Teachers
1 page
Efficient Appointment System for USTP
No ratings yet
Efficient Appointment System for USTP
67 pages

LLM Programming Tools and APIs Guide

Uploaded by

LLM Programming Tools and APIs Guide

Uploaded by

Module 2

OpenAI API Meta LLaMa 3 Google Gemini

❏ Write complex threaded queries

OpenAI API Docs: [Link]

Interact Build Embed

LLM Programming Tools

OpenAI API Notebook

Arrival time: 4:00 pm Arrival time: 4:10pm

By going back to school By being provided in-house training

● a graduate degree, ● an onboarding program at an

Instruction LLM Answer

Instruction LLM Answer

Instruction LLM Answer

Instruction LLM Answer

● Store a pre-constructed similarity graph of document chunks

M=2 2 ● A new chunk (from

M=2 2 ● Choose a random chunk

M=2 2 ● Check distance

● Hierarchical Navigable Small Worlds (HNSW) algorithm

Instruction LLM Answer

Instruction LLM Answer

● The pre-trained LLM is given specific labeled examples

Encoder Encoder 2 Decoder n

Embedding Position Embedding Position

I love you (INPUT) te amo (TARGET)

n nodes Encoder i+1 Adaptor

You might also like