TON DUC THANG UNIVERSITY
FACULTY OF ELECTRICAL AND
ELECTRONICS ENGINEERING
404133
DIGITAL IMAGE
PROCESSING AND
COMPUTER VISION
Nguyen Hoang Nam, PHD
Phan Le son, PHD
COURSE OBJECTIVES
THIS COURSE INTRODUCES TO STUDENTS:
Digital Image Processing (Fundamental Knowledge)
● Camera imaging systems and the human visual system
● Color spaces and color representation
● Sampling and quantization
● 2D Discrete Fourier Transform (DFT) and Discrete Cosine Transform (DCT)
● Digital image filtering in the spatial and frequency domains
● Edge detection techniques
Computer Vision (Applications)
● Understanding computer vision problems and evaluation metrics
● Characteristics, strengths, and applications of different AI models
26/12/2025 404133 – Digital Image Processing and Computer Vision 2
LEARNING OUTCOMES
● Remember the basic knowledge of image processing
and computer vision.
● Understand the problems of computer vision.
● Choose & apply the relevant methods for each
problem.
● Analyze the image in spatial domain and frequency
domain.
● Apply deep learning model to practical applications
26/12/2025 404133 – Digital Image Processing and Computer Vision 3
GRADING POLICY
In-class test: 20%
Homework: 20%
Final project: 60%
Bonus points for students who actively participate in extra
projects (research project, hackathon, .etc) or understand new
research approaches (paper, dataset, source code)
26/12/2025 404133 – Digital Image Processing and Computer Vision 4
COURSE CONTENT
● Introduction about Digital Image Processing:
applications digital image processing, basic step
in digital image processing
● Camera and human visual system: how image
is captured by camera, what is human visual
system and what it consists of?
● Represent a digital image and color space:
represents digital image in to 2d matrix, model
color using different color space
26/12/2025 404133 – Digital Image Processing and Computer Vision 5
COURSE CONTENT (cont.)
● Sampling and quantization: why need sampling
and quantization, basic steps of sapling and
quantization a digital image processing.
● The relationship between pixels: known
neighboring pixels, distance between pixels.
● Image compression: why need compression,
information redundant, the entropy of image,
Huffman coding, digital image compression steps,
jpeg standard
26/12/2025 404133 – Digital Image Processing and Computer Vision 6
COURSE CONTENT (cont.)
● Digital filter: digital filter in spatial domain, the linear
filter and nonlinear filter, spatial mask, digital filter in
frequency domain
● Basic intensity transformation: histograms
processing, log transform, power law transform
● Digital image enhancement and restore: noise types,
remove noise modes, Wiener filter
● Advance digital image processing: temple matching,
line detection, geometry shape finding, image pattern
recognition
26/12/2025 404133 – Digital Image Processing and Computer Vision 7
COURSE CONTENT (cont.)
Computer vision
● CNN backbones
● Based-RCNN architectures
● YOLO
● Encoder-Decoder (U-Net, Transformer)
26/12/2025 404133 – Digital Image Processing and Computer Vision 8
REQUIREMENTS
▪ Students must attend at least 80% of class lectures.
▪ Read and master the information given in Course
syllabus (download from Library web site)
▪ Read lecture notes/slides, textbook and references
before coming to class. Frequently use library
resources.
▪ Take the lecture seriously, actively participate in class
activities proposed by the instructor.
▪ Submit assigned homework and projects in time.
26/12/2025 404133 – Digital Image Processing and Computer Vision 9
TEXTBOOKS AND REFERENCES
TEXTBOOKS:
[1]. Rafael C. Gonzalez, Richard E. Woods [2010], Digital image
processing, 3e, Prentice Hall, New Jersey.
RECOMMENDED REFERENCES:
■[2]. Rafael C. Gonzalez, Richard E. Woods , Steven L. Eddins
[2009], Digital image processing using MATLAB, 2e., Prentice
Hall, New Jersey.
■[3]. I. Pitas, [2000], Digital image processing algorithms and
applications John Wiley & Sons, New York.
26/12/2025 404133 – Digital Image Processing and Computer Vision 10
TEXTBOOKS AND REFERENCES
OTHER REFERENCES:
■[4] Phạm Hồng Liên, Đặng Ngọc Khoa, Trần Thanh
Phương [2010], MatLab và ứng dụng trong viễn
thông. NXB Đại học quốc gia TPHCM, TPHCM.
■ [5]. TS. Hồ Văn Sung [2009], Xử lý ảnh số - Lý
thuyết và thực hành với MatLab, NXB Khoa học kỹ
thuật, Hà Nội.
26/12/2025 404133 – Digital Image Processing and Computer Vision 11
TEXTBOOKS AND REFERENCES
Papers:
CNN: Y. Lecun, L. Bottou, Y. Bengio and P. Haffner, "Gradient-based learning applied to document
recognition,"
ResNet: K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition,"
RCNN: R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich Feature Hierarchies for Accurate Object
Detection and Semantic Segmentation,"
Fast-RCNN: [Link]
MASK-RCNN: [Link]
UNET: Ronneberger, O., Fischer, P., Brox, T, “U-Net: Convolutional Networks for Biomedical Image
Segmentation”
Transformer: [Link]
YOLO:J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You Only Look Once: Unified, Real-Time
Object Detection,"
26/12/2025 404133 – Digital Image Processing and Computer Vision 12