0% found this document useful (0 votes)

34 views24 pages

Generative AI Lab Manual for CSE

Uploaded by

aishwaryaljadhav18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views24 pages

Generative AI Lab Manual for CSE

Uploaded by

aishwaryaljadhav18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

RAJARAJESWARI COLLEGE OF ENGINEERING

MYSORE ROAD, BANGALORE-560074

(An ISO 9001:2008 Certified Institute)

(2024-25)

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

[IoT, Cybersecurity including Blockchain Technology]

Generative AI Lab Manual

Prepared By
Ghouse Pasha
Assistant Professor
Dept of CSE(IC), RRCE
Lab1: Explore pre-trained word vectors. Explore word relationships using
vector arithmetic. Perform arithmetic operations and analyze results.

Code:

import [Link] as api

from [Link] import KeyedVectors

from [Link] import load

print("Loading the model, please wait...")

model = load('glove-wiki-gigaword-50')

print("Model loaded successfully!")

word_vector = model['king']

print(f"\nVector for 'king':\n{word_vector}")

result = model.most_similar(positive=['king', 'woman'], negative=['man'], topn=1)

print(f"\n'king' - 'man' + 'woman' ≈ {result[0][0]} with similarity score {result[0][1]:.2f}")

similarity = [Link]('king', 'queen')

print(f"\nSimilarity between 'king' and 'queen': {similarity:.2f}")

odd_one = model.doesnt_match(['breakfast', 'lunch', 'dinner', 'car'])

print(f"\nOdd one out: {odd_one}")

Output:
Lab 2: Use dimensionality reduction (e.g., PCA or t-SNE) to visualize word
embeddings for Q1. Select 10 words from a specific domain (e.g., sports,
technology) and visualize their embeddings. Analyze clusters and
relationships. Generate contextually rich outputs using embeddings. Write
a program to generate 5 semantically similar words for a given input.

Code:

import [Link] as api

import numpy as np

import [Link] as plt

from [Link] import PCA

from [Link] import TSNE

word_vectors = [Link]("word2vec-google-news-300")

words = ["computer", "laptop", "AI", "machine", "robot", "software", "hardware",

"algorithm", "network", "cybersecurity"]

vectors = [Link]([word_vectors[word] for word in words])

def plot_embeddings(vectors, words, method="PCA"):

if method == "PCA":

reduced = PCA(n_components=2).fit_transform(vectors)

else:

reduced = TSNE(n_components=2, perplexity=5,

random_state=42).fit_transform(vectors)

[Link](figsize=(8, 6))

[Link](reduced[:, 0], reduced[:, 1])

for i, word in enumerate(words):

[Link](word, (reduced[i, 0], reduced[i, 1]), fontsize=12)

[Link](f"Word Embedding Visualization using {method}")

[Link]()

plot_embeddings(vectors, words, method="PCA")

plot_embeddings(vectors, words, method="t-SNE")

Output:
Google colab

Lab 3: Train a custom Word2Vec model on a small dataset. Train

embeddings on a domain-specific corpus (e.g., legal, medical) and analyze
how embeddings capture domain-specific semantics.

Code:

!pip install gensim matplotlib scikit-learn

import gensim

import [Link]

import numpy as np

import [Link] as plt

from [Link] import TSNE

from [Link] import Word2Vec

medical_sentences = [

['patient', 'diagnosed', 'cancer', 'treatment', 'chemotherapy'],

['doctor', 'prescribes', 'medication', 'therapy', 'recovery'],

['hospital', 'surgery', 'nurse', 'care', 'treatment'],

['virus', 'infection', 'vaccine', 'immune', 'system'],

['diabetes', 'insulin', 'blood', 'sugar', 'health'],

['heart', 'disease', 'cardiac', 'attack', 'stroke'],

['brain', 'neuroscience', 'mental', 'health', 'psychology'],

['radiology', 'MRI', 'X-ray', 'diagnosis', 'scan'],

['nutrition', 'diet', 'exercise', 'wellness', 'fitness'],

['epidemic', 'pandemic', 'COVID', 'quarantine', 'vaccine']

]
model = Word2Vec(sentences=medical_sentences, vector_size=100, window=3,
min_count=1, workers=4)

similar_words = [Link].most_similar('treatment', topn=5)

print("\nTop 5 words similar to 'treatment':")

for word, score in similar_words:

print(f"{word}: {score:.4f}")

words = list([Link].index_to_key) # Get vocabulary

word_vectors = [Link]([[Link][word] for word in words])

tsne = TSNE(n_components=2, random_state=0, perplexity=3)

word_vectors_2d = tsne.fit_transform(word_vectors)

[Link](figsize=(8, 6))

for i, word in enumerate(words):

[Link](word_vectors_2d[i, 0], word_vectors_2d[i, 1])

[Link](word_vectors_2d[i, 0] + 0.05, word_vectors_2d[i, 1] + 0.05, word, fontsize=12)

[Link]("t-SNE Visualization of Custom Medical Word Embeddings")

[Link]()

Output:
Lab 4: Use word embeddings to improve prompts for Generative AI
model. Retrieve similar words using word embeddings. Use the similar
words to enrich a GenAI prompt. Use the AI model to generate
responses for the original and enriched prompts. Compare the outputs
in terms of detail and relevance.

Code:

!pip install sentence-transformers

from sentence_transformers import SentenceTransformer, util

import torch

model = SentenceTransformer('all-MiniLM-L6-v2')

def get_similar_words(word, top_k=5):

"""

Finds similar words using word embeddings.

Args:

word: The word to find similar words for.

top_k: The number of similar words to return.

Returns:

A list of similar words.

"""

embeddings = [Link]([word], convert_to_tensor=True)

cosine_scores = util.pytorch_cos_sim(embeddings, [Link](['dog', 'cat',

'animal', 'pet', 'mammal', 'food'], convert_to_tensor=True))

top_results = [Link](cosine_scores[0], k=top_k)

similar_words = []
for score, idx in zip(top_results[0], top_results[1]):

similar_words.append(['dog', 'cat', 'animal', 'pet', 'mammal', 'food'][[Link]()])

return similar_words

def enrich_prompt(prompt):

"""

Enriches a prompt with similar words.

Args:

prompt: The original prompt.

Returns:

The enriched prompt.

"""

words = [Link]()

enriched_prompt = ""

for word in words:

similar_words = get_similar_words(word)

enriched_prompt += word + " (" + ", ".join(similar_words) + ") "

return enriched_prompt

original_prompt = "Describe the characteristics of a dog."

enriched_prompt = enrich_prompt(original_prompt)

def generate_response(prompt):

"""
Generates a response from a GenAI model.

Args:

prompt: The prompt to use.

Returns:

The generated response.

"""

response = f"Response for prompt: {prompt}"

return response

original_response = generate_response(original_prompt)

enriched_response = generate_response(enriched_prompt)

print(f"Original Prompt: {original_prompt}")

print(f"Original Response: {original_response}")

print(f"Enriched Prompt: {enriched_prompt}")

print(f"Enriched Response: {enriched_response}")

Output:
Lab 5: Use word embeddings to create meaningful sentences for creative
tasks. Retrieve similar words for a seed word. Create a sentence or story
using these words as a starting point. Write a program that: Takes a seed
word. Generates similar words. Constructs a short paragraph using these
words.

Code:

from sentence_transformers import SentenceTransformer, util

import torch

model = SentenceTransformer('all-MiniLM-L6-v2')

def get_similar_words(word, top_k=5):

"""

Finds similar words using word embeddings.

Args:

word: The word to find similar words for.

top_k: The number of similar words to return.

Returns:

A list of similar words.

"""

embeddings = [Link]([word], convert_to_tensor=True)

cosine_scores = util.pytorch_cos_sim(embeddings, [Link](['dog', 'cat', 'animal', 'pet',

'mammal', 'food', 'happy', 'sad', 'excited', 'angry'], convert_to_tensor=True))

top_results = [Link](cosine_scores[0], k=top_k)

similar_words = []

for score, idx in zip(top_results[0], top_results[1]):

similar_words.append(['dog', 'cat', 'animal', 'pet', 'mammal', 'food', 'happy', 'sad', 'excited',
'angry'][[Link]()])

return similar_words

def create_sentence(seed_word):

"""

Creates a short paragraph using similar words.

Args:

seed_word: The seed word to start with.

Returns:

A short paragraph.

"""

similar_words = get_similar_words(seed_word)

sentence = f"The {seed_word} was {similar_words[0]}, and it made me feel

{similar_words[1]}. I wondered if it was like a {similar_words[2]}, or maybe more like a
{similar_words[3]}."

return sentence

seed_word = "sunrise"

paragraph = create_sentence(seed_word)

paragraph

Output:
Lab 6: Use a pre-trained Hugging Face model to analyze sentiment in text.
Assume a real-world application, Load the sentiment analysis pipeline.
Analyze the sentiment by giving sentences to input.

Code:
from transformers import pipeline

sentiment_pipeline = pipeline("sentiment-analysis")

def analyze_sentiment(text):

result = sentiment_pipeline(text)[0] # Get the first result

label = result["label"]

confidence = result["score"]

return f"Sentiment: {label} (Confidence: {confidence:.2f})"

texts = [

"I love this product! It's amazing.",

"This is the worst experience I've ever had.",

"The movie was okay, but nothing special.",

"I'm extremely happy with my new laptop!",

"This service is so frustrating and disappointing."

for text in texts:

print(f"Text: {text}")

print(analyze_sentiment(text))

print("-" * 50)
Output:
Lab 7: Summarize long texts using a pre-trained summarization model
using Hugging face model. Load the summarization pipeline. Take a
passage as input and obtain the summarized text.

Code:
from transformers import pipeline

summarizer = pipeline("summarization")

def summarize_text(text, max_length=130, min_length=30):

"""

Summarizes a long text using a pre-trained summarization model.

Args:

text: The text to summarize.

max_length: The maximum length of the summary.

min_length: The minimum length of the summary.

Returns:

The summarized text.

"""

summary = summarizer(text, max_length=max_length, min_length=min_length,

do_sample=False)[0]['summary_text']

return summary

passage = """

The Gemini API gives you access to Gemini models created by Google DeepMind. Gemini
models are built from the ground up to be multimodal, so you can reason seamlessly across
text, images, code, and audio.

"""

summary = summarize_text(passage)

summary
Output:
Lab 9: Take the Institution name as input. Use Pydantic to define the schema
for the desired output and create a custom output parser. Invoke the Chain
and Fetch Results. Extract the below Institution related details from
Wikipedia: The founder of the Institution. When it was founded. The current
branches in the institution . How many employees are working in it. A brief
4-line summary of the institution.

Code:

!pip install wikipedia

!pip install pydantic

import wikipedia

from pydantic import BaseModel, Field

from typing import List, Optional

class InstitutionDetails(BaseModel):

"""

Pydantic schema for institution details.

"""

founder: Optional[str] = Field(None, description="Founder of the institution")

founded: Optional[int] = Field(None, description="Year of founding")

branches: Optional[List[str]] = Field(None, description="Current branches of

the institution")

num_employees: Optional[int] = Field(None, description="Number of

employees")

summary: Optional[str] = Field(None, description="A brief summary of the

institution")
def parse_wikipedia_page(page_content: str) -> InstitutionDetails:

"""

Parses the Wikipedia page content to extract the relevant details.

Args:

page_content (str): The content of the Wikipedia page.

Returns:

InstitutionDetails: Parsed institution details.

"""

details = InstitutionDetails()

try:

# Basic parsing - replace with more robust methods for production

[Link] = "\n".join([Link](page_content,
sentences=4).split('\n')[:4]) #Extract first 4 lines of summary

#Further parsing would require more advanced NLP techniques like NER or
dependency parsing,

#as simply searching for keywords is unreliable.

except Exception as e:

print(f"Error parsing Wikipedia page: {e}")

return details

if __name__ == "__main__":

institution_name = input("Enter the institution name: ")

try:

page = [Link](institution_name)

page_content = [Link]

details = parse_wikipedia_page(page_content)

print(details.model_dump_json(indent=2)) #Use .model_dump_json for

proper output

except [Link]:

print(f"Wikipedia page not found for '{institution_name}'")

except [Link] as e:

print(f"Disambiguation error: {[Link]}")

except Exception as e:

print(f"An unexpected error occurred: {e}")

Output:

Enter the institution name: Harvard Law School

Error parsing Wikipedia page: Expecting value: line 1 column 1 (char 0)

"founder": null,

"founded": null,
"branches": null,

"num_employees": null,

"summary": null

}
Lab 10: Build a chatbot for the Indian Penal Code. We'll start by
downloading the official Indian Penal Code document, and then we'll create
a chatbot that can interact with it. Users will be able to ask questions about
the Indian Penal Code and have a conversation with it.

Code:

import wikipedia

from pydantic import BaseModel, Field

from typing import List, Optional

import re

!pip install wikipedia

!pip install pydantic

class IPCSection(BaseModel):

"""

Pydantic schema for an IPC section.

"""

section_number: str = Field(..., description="Section number of the IPC")

description: Optional[str] = Field(None, description="Description of the

section")

punishment: Optional[str] = Field(None, description="Punishment prescribed

for the offence")

def parse_ipc_section(section_text: str) -> IPCSection:

"""

Parses the text of an IPC section to extract relevant details.

"""

section = IPCSection(section_number=section_text.split(". ")[0])

# Use regular expressions to extract description and punishment

description_match = [Link](r"(?<=Whoever).(?=\sShall be punished)",

section_text, [Link])

punishment_match = [Link](r"(?<=Shall be punished).*(?=\.)",

section_text, [Link])

[Link] = description_match.group(0).strip() if description_match

else "Description not found"

[Link] = punishment_match.group(0).strip() if
punishment_match else "Punishment not found"

return section

def search_ipc(query: str) -> List[IPCSection]:

"""

Searches the IPC for a given query.

"""

try:

page = [Link]("Indian Penal Code")

content = [Link]
# Use regular expressions to find sections matching the query

sections = []

matches = [Link](rf"{query}.*?(?=\n\d+\.)", content, [Link])

for match in matches:

[Link](parse_ipc_section([Link]()))

return sections

except [Link]:

print(f"Wikipedia page not found for 'Indian Penal Code'")

return []

except Exception as e:

print(f"An unexpected error occurred: {e}")

return []

if __name__ == "__main__":

while True:

user_query = input("Ask a question about the Indian Penal Code (or type
'exit'): ")

if user_query.lower() == 'exit':

break

results = search_ipc(user_query)
if results:

for section in results:

print(section.model_dump_json(indent=2))

else:

print("No matching sections found.")

Output:

Ask a question about the Indian Penal Code (or type 'exit'): ospds

Wikipedia page not found for 'Indian Penal Code'

No matching sections found.

Ask a question about the Indian Penal Code (or type 'exit'): exit

Understanding Generative AI Models
No ratings yet
Understanding Generative AI Models
19 pages
Advanced Deep Learning Techniques
No ratings yet
Advanced Deep Learning Techniques
89 pages
Introduction to Generative AI Concepts
No ratings yet
Introduction to Generative AI Concepts
15 pages
VAEs vs GANs in Generative AI
No ratings yet
VAEs vs GANs in Generative AI
54 pages
Understanding Transformer Architecture
No ratings yet
Understanding Transformer Architecture
55 pages
Deep Learning Study Notes for AI Course
No ratings yet
Deep Learning Study Notes for AI Course
60 pages
Gen Ai Unit 1
100% (1)
Gen Ai Unit 1
65 pages
JNTUK R20 Deep Learning Notes PDF
No ratings yet
JNTUK R20 Deep Learning Notes PDF
61 pages
Understanding Recurrent Neural Networks
No ratings yet
Understanding Recurrent Neural Networks
19 pages
Deep Learning: History and Techniques
No ratings yet
Deep Learning: History and Techniques
27 pages
Understanding GANs and Adversarial Training
No ratings yet
Understanding GANs and Adversarial Training
15 pages
History and Applications of Generative AI
No ratings yet
History and Applications of Generative AI
18 pages
Understanding Autoencoders in Deep Learning
No ratings yet
Understanding Autoencoders in Deep Learning
23 pages
Challenges in Training Deep Neural Networks
No ratings yet
Challenges in Training Deep Neural Networks
4 pages
Understanding Soft Prompt Tuning
No ratings yet
Understanding Soft Prompt Tuning
39 pages
GAN Training Challenges and Instabilities
No ratings yet
GAN Training Challenges and Instabilities
1 page
Generative AI: Evolution and Applications
No ratings yet
Generative AI: Evolution and Applications
5 pages
Types of Deep Learning Models
No ratings yet
Types of Deep Learning Models
25 pages
Anaphora Resolution in NLP Discourse Analysis
No ratings yet
Anaphora Resolution in NLP Discourse Analysis
40 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
15 pages
Soft Prompts and LoRA in LLMs
No ratings yet
Soft Prompts and LoRA in LLMs
9 pages
Introduction to Generative Adversarial Networks
No ratings yet
Introduction to Generative Adversarial Networks
21 pages
Problem Solving in Artificial Intelligence
No ratings yet
Problem Solving in Artificial Intelligence
105 pages
Understanding Generative AI Models
100% (1)
Understanding Generative AI Models
48 pages
Foundations of Conversational AI
No ratings yet
Foundations of Conversational AI
488 pages
LLM Interview Questions & Course Guide
No ratings yet
LLM Interview Questions & Course Guide
10 pages
NLP Module 1: Language Modeling & Regex
No ratings yet
NLP Module 1: Language Modeling & Regex
18 pages
Deep Learning: Definition and Applications
No ratings yet
Deep Learning: Definition and Applications
63 pages
Regularization Techniques in Deep Learning
No ratings yet
Regularization Techniques in Deep Learning
59 pages
AI & ML Course: From Basics to GenAI
No ratings yet
AI & ML Course: From Basics to GenAI
5 pages
MLOps Mid-Semester Test Guide 2023
No ratings yet
MLOps Mid-Semester Test Guide 2023
15 pages
Hyperparameter Optimization Techniques
100% (1)
Hyperparameter Optimization Techniques
29 pages
Generative Ai
No ratings yet
Generative Ai
2 pages
Deep Learning Course Overview
No ratings yet
Deep Learning Course Overview
41 pages
Understanding Generative AI Concepts
No ratings yet
Understanding Generative AI Concepts
130 pages
Generative AI Seminar Report 2025
No ratings yet
Generative AI Seminar Report 2025
46 pages
Generative AI: Trends and Applications
No ratings yet
Generative AI: Trends and Applications
18 pages
CS3491 AI & ML Unit 1 Overview
No ratings yet
CS3491 AI & ML Unit 1 Overview
83 pages
Deep Learning All Modules
No ratings yet
Deep Learning All Modules
445 pages
Understanding Generative Adversarial Networks
100% (1)
Understanding Generative Adversarial Networks
52 pages
Generative AI for Higher Education Leaders
No ratings yet
Generative AI for Higher Education Leaders
100 pages
Top 10 Machine Learning Algorithms
No ratings yet
Top 10 Machine Learning Algorithms
15 pages
Generative AI: Concepts and Applications
No ratings yet
Generative AI: Concepts and Applications
69 pages
Deep Generative Models Overview
No ratings yet
Deep Generative Models Overview
18 pages
Building Variational Autoencoders
No ratings yet
Building Variational Autoencoders
29 pages
Autoencoders in Deep Learning Explained
No ratings yet
Autoencoders in Deep Learning Explained
36 pages
CS6659 AI UNIT 3 Notes
50% (4)
CS6659 AI UNIT 3 Notes
30 pages
Introduction to Generative AI Concepts
No ratings yet
Introduction to Generative AI Concepts
16 pages
Autoencoders: Applications and Architecture
No ratings yet
Autoencoders: Applications and Architecture
22 pages
Generative AI Course Overview
100% (2)
Generative AI Course Overview
86 pages
Neural Networks in AI: Overview and Applications
No ratings yet
Neural Networks in AI: Overview and Applications
29 pages
Day 2 Module 1 - Introduction To Generative AI
No ratings yet
Day 2 Module 1 - Introduction To Generative AI
15 pages
Introduction to Generative AI Concepts
No ratings yet
Introduction to Generative AI Concepts
6 pages
Building LLMs for Production Guide
No ratings yet
Building LLMs for Production Guide
567 pages
UNIT-V Open Source Models
100% (1)
UNIT-V Open Source Models
50 pages
Unit 1 - Introduction To Prompt Engineering
No ratings yet
Unit 1 - Introduction To Prompt Engineering
19 pages
Generative AI: Foundations and Applications
No ratings yet
Generative AI: Foundations and Applications
4 pages
Sentiment Analysis with Hugging Face
No ratings yet
Sentiment Analysis with Hugging Face
6 pages
Performance Metrics in Deep Learning
100% (1)
Performance Metrics in Deep Learning
36 pages
Gen Ai Lab Manual
No ratings yet
Gen Ai Lab Manual
16 pages
Citra Emulator Log Analysis
No ratings yet
Citra Emulator Log Analysis
246 pages
Open Source CASA for Zebrafish Sperm Analysis
No ratings yet
Open Source CASA for Zebrafish Sperm Analysis
12 pages
Understanding Virtual Currency: Bitcoin Basics
No ratings yet
Understanding Virtual Currency: Bitcoin Basics
39 pages
Understanding Bayes' Nets and Independence
No ratings yet
Understanding Bayes' Nets and Independence
32 pages
Adobe Zii 2.2.1 Patch for CC 2017
33% (3)
Adobe Zii 2.2.1 Patch for CC 2017
11 pages
Harvard Referencing for YouTube Videos
No ratings yet
Harvard Referencing for YouTube Videos
4 pages
Finance & Analytics Professional Profile
No ratings yet
Finance & Analytics Professional Profile
3 pages
Third Harmonic Relay for Generator Faults
No ratings yet
Third Harmonic Relay for Generator Faults
4 pages
MIQ Digital Exam Preparation Guide
No ratings yet
MIQ Digital Exam Preparation Guide
96 pages
Reserve Power Supply Specifications
No ratings yet
Reserve Power Supply Specifications
7 pages
NAVFAC R&D Investment Assessment
No ratings yet
NAVFAC R&D Investment Assessment
129 pages
Analysis of India's Data Protection Bill
No ratings yet
Analysis of India's Data Protection Bill
11 pages
Acbc21-011 Brand Standard Style Guide 21-22
No ratings yet
Acbc21-011 Brand Standard Style Guide 21-22
41 pages
Industrial Drives Course Syllabus
100% (1)
Industrial Drives Course Syllabus
17 pages
Postpartum Hemorrhage Presentation Template
No ratings yet
Postpartum Hemorrhage Presentation Template
28 pages
Submission Guidelines - Scientific Reports
No ratings yet
Submission Guidelines - Scientific Reports
17 pages
Jofukebodusegitibigukas
0% (1)
Jofukebodusegitibigukas
2 pages
Protelindo Final Bill of Quantity Report
No ratings yet
Protelindo Final Bill of Quantity Report
3 pages
DataFrame Operations in Pandas
No ratings yet
DataFrame Operations in Pandas
4 pages
Essentials of Effective Pentest Reports
No ratings yet
Essentials of Effective Pentest Reports
11 pages
Blender BIM with Bonsai Extension Guide
No ratings yet
Blender BIM with Bonsai Extension Guide
28 pages
Linear Equations in Two Variables
No ratings yet
Linear Equations in Two Variables
3 pages
Head Loss Analysis for Anand Vihar Water Supply
No ratings yet
Head Loss Analysis for Anand Vihar Water Supply
4 pages
Essae Product Profile at A Glance
No ratings yet
Essae Product Profile at A Glance
50 pages
Cessup 09 04 Pap LG
No ratings yet
Cessup 09 04 Pap LG
72 pages
Design Aspects of Pumped Storage Plants
No ratings yet
Design Aspects of Pumped Storage Plants
54 pages
Java Multithreading Basics Explained
No ratings yet
Java Multithreading Basics Explained
30 pages
Tech Mahindra Interview Questions Guide
No ratings yet
Tech Mahindra Interview Questions Guide
11 pages
HAP Extractor Tool User Guide
No ratings yet
HAP Extractor Tool User Guide
12 pages
PWSA Transport Layer Overview
No ratings yet
PWSA Transport Layer Overview
2 pages