0% found this document useful (0 votes)
7 views2 pages

Database Systems: DDL, DML, and More

Uploaded by

garas47896
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views2 pages

Database Systems: DDL, DML, and More

Uploaded by

garas47896
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Unit 1

What is DDL (Data Definition Language)?


What are the primary DDL commands and their functions?
What is DML (Data Manipulation Language)?
What are the key DML commands and their purposes?
What is DML (Data Manipulation Language)?
What are the primary DML commands and their functions?
What are the key DDL commands and their purposes?
Write the SQL commands for creating database, creating table with syntax and examples.
Write the SQL commands for inserting records in a table with syntax and example.
What are views in SQL? Write the syntax for creating view in SQL
What are primary structured data types in object database systems?

Unit 2
What is a data model in the context of database systems?
What are the main components of a data model?
How does the relational data model represent data?
How do conceptual, logical, and physical data models differ?
What are the challenges in implementation of ORDBMS?
Distinguish between RDBMS and ORDBMS.
Explain the working of Parallel Databases?
Explain shared Disk Architecture.
Explain parallel query mechanism.
Explain shared memory Architecture.

Unit 3
Distinguish between Distributed Database and Centralized Database
What is Data Fragmentation in Distributed Databases
Explain Client-Server Architecture in Distributed Databases.
What are Distributed Database Access Primitives ?
Write the major database access primitives of distributed database.
Write the Integrity Constraints in distributed databases.
Explain Share Nothing Architecture in Distributed Databases.
What is horizontal fragmentation in distributed database?
Explain any two transparencies in distributed databases.
Explain Peer to Peer Architecture in Distributed Databases.
Explain hybrid fragmentation with example in distributed database.

Unit 4
Explain the framework for distributed database design.
Explain with an overview of Database Fragmentation.
What is query equivalence ? What are rules of query equivalence transformations ?
Explain the rules of transforming a global query into fragmented query.
What are distributed grouping functions in distributed database.
What are parametric queries in distributed databases.
Explain the framework for distributed database design.
What are common aggregate functions used in distributed databases

Unit 5
Explain the basic query optimization framework in detail .
What are the strategies for query optimizations.
What are cost components of Query Execution.
Write a note on Information Retrieval System in DBMS.
Describe IR Model in DBMS.
Distinguish Database Management System over Information Retrieval System.
Explain Query Processing in DBMS.
Explain how the query processing is being optimized.
Write a note on Information Retrieval System in DBMS.
What are cost components of Query Execution?
Explain the basic query optimization framework in detail .
Explain rule based and cost based query optimization.
What is Indexing in DBMS ? What are the indexing methods ?

Unit 6
What is Data mining ? What are various data mining application ?
What are advantages of Data Mining ?
Explain how classification is done in data mining classification technique.
Apply k-means clustering algorithm to cluster the datapoints {2,4,10,12,3,20,30,11,25} into two clusters.
Take M1=4, M2=11

Form cluster with Ɛ=3.5 and min pts=3 for the following set of data points S1(5,7), S2(8,4), S3(33),
S4(4,4), S5(3,7), S6(6,7), S7(6,1), S8(5,5). Use DBSCAN Algorithm.
What is Knowledge Discovery Data process in data mining ?

What is clustering in data mining ? How does k mean clustering algorithm work ?
What are parameters required for DBSCAN algorithm ?
What is Ranking of Web pages in data mining ? Explain
Explain centroid based clustering algorithm in data mining.
What are decision trees in data mining ? Explain

Common questions

Powered by AI

Implementation of ORDBMS is challenging due to the complexity of integrating object-oriented features into the relational models. These features include handling complex data types and supporting inheritance and encapsulation, which can complicate storage and retrieval processes. Additionally, maintaining performance and ensuring backward compatibility with existing RDBMS functionalities are significant difficulties .

Indexing significantly speeds up data retrieval operations by providing a quicker access path to the data, avoiding full table scans, which are resource-intensive. Common indexing methods include B-tree indexes, which are balanced and suitable for range queries, and hash indexes, which offer fast lookups in equality searches. Other methods like bitmap indexes are effective for low-cardinality columns . Indexes enhance performance but require maintenance overhead as they need updating with every data modification.

Hybrid fragmentation in distributed databases combines both horizontal and vertical fragmentation techniques to divide a database schema into smaller, manageable fragments. For example, suppose a company database table has columns for EmployeeID, Name, Department, and Salary. Horizontal fragmentation may first divide the table based on the Department column, and then vertical fragmentation can separate Name and Salary into different fragments. The goal is to localize access based on user needs while ensuring data availability and optimizing performance .

Query equivalence transformations facilitate the optimization of global queries into fragmented queries by allowing a higher-level query to be rewritten in terms consistent with the fragmented data distribution. This involves applying transformation rules to restructure queries without changing their semantic meaning, thus enhancing execution efficiency by taking advantage of data locality and reducing data movement across the network . These transformations are essential for achieving optimal performance in distributed database environments.

In a client-server architecture, a centralized server holds all the data and services clients' requests, creating a clear flow of data control and management. This model excels in security and centralized data integrity but can become a bottleneck . In contrast, peer-to-peer architecture distributes data and request-handling across multiple nodes, promoting redundancy and resilience. Each node can act as both client and server, thus improving scalability and fault tolerance but increasing complexity in ensuring consistency and synchronization across nodes .

The relational data model represents data in a structured tabular format with relationships expressed through foreign keys, but it has limitations in handling more complex data types and relationships. The object-relational database model extends this by integrating object-oriented database concepts such as inheritance, complex data types, and user-defined types, allowing it to represent more intricate data structures and relationships . This augmentation enables better management of complex data, surpassing the rigidity of traditional relational databases .

DBSCAN (Density-Based Spatial Clustering of Applications with Noise) differs from K-Means by identifying clusters based on the density of data points rather than predefined centroids. It groups points that are closely packed together while marking points in less dense regions as noise. DBSCAN requires two parameters: epsilon (ɛ), which specifies the radius for neighborhood points, and minPts, indicating the minimum number of points needed to form a dense region . This algorithm naturally handles noise, unlike K-Means which can be sensitive to outliers and noise as it relies on the initial random choice of centroids and often leads to spherical clusters .

DDL (Data Definition Language) focuses on defining, altering, or dropping data structures such as database schemas and tables. Key DDL commands include CREATE, ALTER, and DROP, which are used to create, modify, and delete databases and tables, respectively . On the other hand, DML (Data Manipulation Language) is used for managing data within these structures, providing commands like SELECT, INSERT, UPDATE, and DELETE to query, add, modify, and remove data . The primary difference lies in DDL's focus on structure and DML's focus on the data inside those structures.

Query optimization strategies include cost-based optimization, where different execution plans are evaluated for their estimated cost, and rule-based optimization, which applies predefined transformation rules to rewrite queries more efficiently. Other strategies involve indexing, partitioning data, utilizing parallel query processing, and caching interim results to minimize input/output operations, processing time, and resource consumption . These strategies collectively aim to execute queries with the lowest computational expense while maintaining query accuracy and response time.

Classification using a decision tree involves splitting the dataset into homogenous subsets based on attribute values that maximize the separation of different classes. Starting from a root node, data is filtered through branches based on decision rules, leading to terminal nodes that predict the class label. Decision trees are advantageous as they are easy to interpret, require no domain knowledge, and handle both numerical and categorical data effectively. They are also non-parametric, making them versatile for various types of data .

You might also like