0% found this document useful (0 votes)

18 views5 pages

Ollama Notes

The document classifies AI models into proprietary (closed source) and open source (open weight), detailing their characteristics and accessibility. It introduces Ollama, a tool for running open source LLMs locally, emphasizing benefits like privacy and zero cost, and explains the Modelfile used for customizing models. Additionally, it covers Tool Calling for real-time data access and the integration of Ollama with LangChain for enhanced functionality, concluding with the upcoming Ollama Cloud for running larger models while maintaining user experience.

Uploaded by

sudhansuparida977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views5 pages

Ollama Notes

Uploaded by

sudhansuparida977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

1.

Classification of AI Models
Models are classified based on their accessibility and level of control:

Proprietary Models (Closed Source)

Owned and controlled by specific companies (e.g., OpenAI, Google, Anthropic).

● Black Box Nature: The source code, training data, and internal weights are hidden.
Users cannot inspect the "why" behind a decision.
● Access Method: Usually via API (Application Programming Interface) or paid
subscriptions.
● Deployment: Cloud-based. You send your data to their servers, and they send back the
answer.

Open Source Models (Open Weight)

Core details are shared with the public (e.g., Meta’s Llama 3, Mistral, DeepSeek).

● Transparency: Architecture and weights are public, allowing for full inspection and local
hosting.
● Customization: Can be "fine-tuned" on private data to create specialized experts.
● Access: Downloadable for free from platforms like Hugging Face.

2. Ollama: The Local AI Orchestrator

Ollama is a tool that allows you to run these Open Source LLMs directly on your own hardware.
It simplifies the complex process of setting up and managing AI models locally.

Benefit Description

Privacy Sensitive data (legal files, medical records) never leaves your local
machine.

Zero Cost No "pay-per-token" fees. You only pay for your hardware and
electricity.
Offline Works without an internet connection once the model is
Access downloaded.

Simple CLI Manage models like a "Play Store" using commands like run, pull,
and rm.

No Lock-in You aren't tied to a specific cloud vendor's pricing or terms.

System Requirements

• Operating System: Windows, macOS, or Linux.

• Hardware: Laptop/PC with at least 8 GB of RAM (more RAM improves smoothness).

• Storage: Sufficient disk space is required, as models range from 3 GB to 15 GB.

• Connectivity: Internet is required only for the initial download of the tool and models.

• Skills: Basic knowledge of command line/terminal functions.

• GPU: Optional, but speeds up performance if available.

3. The Modelfile: The Blueprint of Your Model

A Modelfile is a configuration file that allows you to customize an LLM's personality, settings,
and behavior. It works similarly to a "Dockerfile"—it doesn't store the massive brain (the
weights), but it provides the instructions on how to use it.

Core Components of a Modelfile

The Modelfile uses specific directives to build a custom model:

● FROM (Required): Defines the base model you are building upon.
○ Theory: You take a generic model (like llama3.2) and use it as the foundation.
● SYSTEM: Sets the "Identity" or "System Prompt."
○ Theory: This defines the permanent rules the model must follow (e.g., "You are a
professional accountant who only speaks in bullet points").
● PARAMETER: Adjusts the technical "knobs" of the model.
○Theory: You can control Temperature (creativity vs. logic), Num_Ctx (how much
history it remembers), and Stop Sequences (where the model should stop
talking).
● TEMPLATE: Defines the interaction format.
○ Theory: It structures how the user's prompt and the system's response are
"wrapped" so the LLM understands who is speaking.
● MESSAGE: Pre-loads conversation history.
○ Theory: You can give the model "examples" of how to behave by providing a few
sample questions and answers within the blueprint.

The Creation Process

1. Write: You create a text file named Modelfile.

2. Build: You run a command (ollama create) which "packages" your instructions with
the base model.
3. Deploy: You now have a new model identity (e.g., legal-assistant) that appears
in your ollama list.

4. Tool Calling: Giving LLMs "Hands"

Tool Calling (or Function Calling) is the process where an LLM realizes it cannot answer a
question on its own and requests to use an external tool.

The Theoretical Workflow

Instead of just "chatting," tool calling follows a 4-step loop:

1. Declaration (The Menu):

○ You provide the LLM with a list of "Tools" (functions) described in JSON
Schema.
○ Theory: You aren't giving the model the code; you are giving it a "User Manual"
for the tool (e.g., "I have a tool called get_weather that needs a city name").
2. Recognition (The Decision):
○ The user asks: "What is the temperature in Paris?"
○ The LLM realizes it doesn't know live data. It looks at its "Menu" and decides: "I
need to call get_weather(city='Paris')."
○ Crucial Point: The LLM does not run the code. It simply outputs a "request" in
JSON format.
3. Execution (The Action):
○ Your local system (or LangChain) sees the LLM's request.
○ It runs the actual code (e.g., hits a weather API or checks a database).
○ It collects the result (e.g., "22°C").
4. Integration (The Final Answer):
○ The result ("22°C") is sent back to the LLM.
○ The LLM reads that result and finally answers the user: "The temperature in Paris
is currently 22°C."
Why use Tool Calling locally?
● Real-time Data: LLMs are frozen in time; tools let them see today's news or stock
prices.
● Accuracy: LLMs are bad at math; a tool can send a calculation to a Python script for a
100% correct answer.
● Action-Oriented: Tools allow the AI to actually do things, like sending an email or
saving a file to your desktop.

5. Ollama and LangChain: The Orchestration Layer

While Ollama acts as the engine (running the model), LangChain acts as the brain or the "glue"
that connects that engine to your data, tools, and workflows.

The Role of LangChain

In a standard setup, LangChain provides a standardized interface. This means you can write
your application logic once and switch between a local Ollama model and a cloud model (like
OpenAI) by changing just one line of configuration.

Key Integration Concepts

● Prompt Templating: LangChain manages complex prompts. Instead of sending raw
text to Ollama, LangChain structures it into "System," "AI," and "Human" messages to
ensure the local model follows instructions strictly.
● Chains: You can link multiple Ollama calls together. For example:
1. Chain 1: Use a small Ollama model (like Phi-3) to summarize a document.
2. Chain 2: Use a larger Ollama model (like Llama 3) to answer a specific question
based on that summary.
● RAG (Retrieval-Augmented Generation): This is the most popular use case.
LangChain "retrieves" relevant facts from your private files (PDFs, Excel) and feeds
them to the local Ollama model as "context" so it can answer questions about your
private data without that data ever leaving your machine.
● Memory: LangChain handles the conversation history. It stores previous turns of a chat
and resends them to Ollama so the local model "remembers" what you said earlier in the
conversation.

6. Ollama Cloud (Released in late 2025/2026)

Ollama Cloud is a hybrid expansion of the local Ollama tool. It allows users to run massive
models that a standard laptop cannot handle while maintaining the same simple user
experience.

How it Works
● Remote Inference: Instead of ollama run llama3, you can run ollama run
llama3-cloud. The command stays the same, but the heavy lifting happens on
Ollama’s high-performance servers.
● Hybrid "Bursting": You can develop and test locally on a 7B (7 billion parameter)
model. When you need "God-mode" reasoning for a complex task, you "burst" that
specific query to Ollama Cloud to run a 400B+ parameter model.
● Zero-Configuration Sync: Your local Modelfiles can be pushed to Ollama Cloud. This
ensures that the "personality" and "instructions" you built locally behave exactly the
same way in the cloud.

Core Benefits of the Cloud Tier

● Hardware Independence: You can run state-of-the-art models from a cheap
Chromebook or an old tablet.
● Battery Life: Local inference is heavy on power; using the cloud tier saves your laptop’s
battery during long sessions.
● Privacy-First Cloud: Unlike standard proprietary APIs, Ollama Cloud is designed with
"Stateless Inference"—meaning they process the request but do not store your data for
training.

Ollama RAG Deployment Guide
No ratings yet
Ollama RAG Deployment Guide
10 pages
Run and Customize Llms Locally With Ollama
No ratings yet
Run and Customize Llms Locally With Ollama
45 pages
Ollama: Setup and Python API Guide
No ratings yet
Ollama: Setup and Python API Guide
9 pages
Open-Source AI: Tools & Resources Guide
No ratings yet
Open-Source AI: Tools & Resources Guide
75 pages
Modelfile New
No ratings yet
Modelfile New
15 pages
Quick Setup for AI Development Environment
No ratings yet
Quick Setup for AI Development Environment
2 pages
LM Studio: Local LLMs for Privacy
No ratings yet
LM Studio: Local LLMs for Privacy
13 pages
Introduction to Ollama LLMs
No ratings yet
Introduction to Ollama LLMs
14 pages
Integrating Ollama with Langchain Guide
No ratings yet
Integrating Ollama with Langchain Guide
23 pages
Week 9 - RAG Deployment Componenets
No ratings yet
Week 9 - RAG Deployment Componenets
20 pages
LLMs for Smart Contract Coding Guide
No ratings yet
LLMs for Smart Contract Coding Guide
16 pages
Build Your Own Gen AI App: A Guide
No ratings yet
Build Your Own Gen AI App: A Guide
6 pages
Code
No ratings yet
Code
75 pages
Ollama & Langchain Integration Guide
No ratings yet
Ollama & Langchain Integration Guide
24 pages
Ollama & Langchain: Local AI Guide
No ratings yet
Ollama & Langchain: Local AI Guide
27 pages
LLMQuantization Document
No ratings yet
LLMQuantization Document
80 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
Getting Started with Llama 2 Guide
No ratings yet
Getting Started with Llama 2 Guide
37 pages
Tool Use in Agent Function Calling
No ratings yet
Tool Use in Agent Function Calling
21 pages
Designing Retrieval Augmented Generation
No ratings yet
Designing Retrieval Augmented Generation
32 pages
Run Large Language Models Locally with Ollama
No ratings yet
Run Large Language Models Locally with Ollama
37 pages
Running LLMs Locally: A Practical Guide
No ratings yet
Running LLMs Locally: A Practical Guide
8 pages
LLM Rag
No ratings yet
LLM Rag
8 pages
Build Your Own Unrestricted AI Agent
No ratings yet
Build Your Own Unrestricted AI Agent
20 pages
LLMOps Tools for Open-Source Frameworks
No ratings yet
LLMOps Tools for Open-Source Frameworks
10 pages
Run LLMs Locally with Ollama Framework
100% (1)
Run LLMs Locally with Ollama Framework
11 pages
Ollama: Local AI for Privacy and Control
No ratings yet
Ollama: Local AI for Privacy and Control
6 pages
Creating a Personal LLM Agent
No ratings yet
Creating a Personal LLM Agent
30 pages
FastAPI RAG Chatbot Development Guide
100% (1)
FastAPI RAG Chatbot Development Guide
41 pages
Essential LLMOps Toolkit 2025 Guide
100% (2)
Essential LLMOps Toolkit 2025 Guide
12 pages
AI Fundamentals and Tools
No ratings yet
AI Fundamentals and Tools
25 pages
Hybrid AI: Integrating LLMs & Traditional Models
No ratings yet
Hybrid AI: Integrating LLMs & Traditional Models
15 pages
Build Your AI Assistant Locally
100% (1)
Build Your AI Assistant Locally
193 pages
AI Agents: Build & Deploy with Vertex AI
No ratings yet
AI Agents: Build & Deploy with Vertex AI
54 pages
Building Applications Using GenAI
No ratings yet
Building Applications Using GenAI
34 pages
Chapter 4 - Building A Tool-Based Agentic AI Framework - Design Multi-Agent AI Systems Using MCP and A2A
No ratings yet
Chapter 4 - Building A Tool-Based Agentic AI Framework - Design Multi-Agent AI Systems Using MCP and A2A
45 pages
LLM Automation Framework Guide
100% (2)
LLM Automation Framework Guide
23 pages
Ollama: Local LLM Runtime Explained
No ratings yet
Ollama: Local LLM Runtime Explained
2 pages
Ollama Meetup 2: Vision-Language Models
No ratings yet
Ollama Meetup 2: Vision-Language Models
40 pages
Deploying LLMs: Strategies & Costs
No ratings yet
Deploying LLMs: Strategies & Costs
186 pages
Generative AI Technology Ecosystem Explained
No ratings yet
Generative AI Technology Ecosystem Explained
3 pages
PDAI L4 LLMs
No ratings yet
PDAI L4 LLMs
55 pages
Generative AI and LLMs in Business
No ratings yet
Generative AI and LLMs in Business
53 pages
Working Locally With Open Source Llms
No ratings yet
Working Locally With Open Source Llms
88 pages
Ollama and Langchain Integration Guide
No ratings yet
Ollama and Langchain Integration Guide
35 pages
The Architecture of Tomorrow - How Digital Ecosystems, Automation, and Artificial Intelligence Are Rewriting The Human Script
No ratings yet
The Architecture of Tomorrow - How Digital Ecosystems, Automation, and Artificial Intelligence Are Rewriting The Human Script
11 pages
AI & ML Concepts for MLOps Engineers
No ratings yet
AI & ML Concepts for MLOps Engineers
10 pages
PTCL Business Analytics Overview
No ratings yet
PTCL Business Analytics Overview
23 pages
SketchUp 8 Tool Operations Guide
No ratings yet
SketchUp 8 Tool Operations Guide
7 pages
Synopsis
No ratings yet
Synopsis
2 pages
IP Office™ Basic Integration and Configuration 7720
No ratings yet
IP Office™ Basic Integration and Configuration 7720
31 pages
Voodoo 2 Processor Scaling Results
No ratings yet
Voodoo 2 Processor Scaling Results
66 pages
Overview of Operating System Functions
No ratings yet
Overview of Operating System Functions
3 pages
CF1200AT Cooling Tower Performance Data
No ratings yet
CF1200AT Cooling Tower Performance Data
8 pages
Mobile Security Surveillance Robot
No ratings yet
Mobile Security Surveillance Robot
41 pages
Cool Python Project Ideas for Beginners
100% (2)
Cool Python Project Ideas for Beginners
31 pages
IoT-Enabled Smart Parking System Design
No ratings yet
IoT-Enabled Smart Parking System Design
15 pages
SIWES Report at Global Communication
No ratings yet
SIWES Report at Global Communication
24 pages
Dell EMC PowerEdge R740 Installation and Service Manual
No ratings yet
Dell EMC PowerEdge R740 Installation and Service Manual
172 pages
Class 8 Computer Science Agenda
No ratings yet
Class 8 Computer Science Agenda
4 pages
Blockchain-Based Anomaly Detection Techniques
No ratings yet
Blockchain-Based Anomaly Detection Techniques
10 pages
Master Sage Line 100 Management Guide
No ratings yet
Master Sage Line 100 Management Guide
18 pages
CCP Project: Parallel Algorithm Design
No ratings yet
CCP Project: Parallel Algorithm Design
2 pages
A3000 A-SMGCS Operational Manual
100% (1)
A3000 A-SMGCS Operational Manual
57 pages
Procreate Installation Guide
No ratings yet
Procreate Installation Guide
11 pages
G70A Monitor Troubleshooting Guide
No ratings yet
G70A Monitor Troubleshooting Guide
23 pages
Dell Latitude Service Tag Results
No ratings yet
Dell Latitude Service Tag Results
3 pages
Online Graphic Design Principles
No ratings yet
Online Graphic Design Principles
49 pages
VirtualBox and Hadoop Installation Guide
No ratings yet
VirtualBox and Hadoop Installation Guide
76 pages
Year 5 Computing End-of-Year Test
100% (1)
Year 5 Computing End-of-Year Test
10 pages
IoT Fundamentals and Development Overview
No ratings yet
IoT Fundamentals and Development Overview
16 pages
Elementor Mastery: Build Websites Easily
100% (2)
Elementor Mastery: Build Websites Easily
196 pages
EP-700 Thermal Printer User Manual
No ratings yet
EP-700 Thermal Printer User Manual
26 pages
Telegram Desktop Log Report
No ratings yet
Telegram Desktop Log Report
4 pages
Product Database Template Someka V2F
No ratings yet
Product Database Template Someka V2F
8 pages
SS2 Computer Third Term Scheme
No ratings yet
SS2 Computer Third Term Scheme
10 pages
Hydrography Mapping in ArcGIS 10.8
No ratings yet
Hydrography Mapping in ArcGIS 10.8
2 pages