0% found this document useful (0 votes)
35 views4 pages

Installing Pentaho Data Integration Guide

This document provides instructions for installing Pentaho Data Integration and a database driver for either Oracle or MySQL. It describes downloading and unzipping Pentaho, verifying the installation by launching Spoon, and downloading and copying the appropriate JDBC driver file (ojdbc7.jar for Oracle or mysql-connector-java-5.1.36-bin.jar for MySQL) to the lib folder to enable database connections.

Uploaded by

Paco
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
35 views4 pages

Installing Pentaho Data Integration Guide

This document provides instructions for installing Pentaho Data Integration and a database driver for either Oracle or MySQL. It describes downloading and unzipping Pentaho, verifying the installation by launching Spoon, and downloading and copying the appropriate JDBC driver file (ojdbc7.jar for Oracle or mysql-connector-java-5.1.36-bin.jar for MySQL) to the lib folder to enable database connections.

Uploaded by

Paco
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Installation of Pentaho Data Integration and Database

Driver for Module 5


This document provides details about installing Pentaho Data Integration and a database driver.
You will need the database driver for either Oracle or MySQL to complete the guided tutorial
and assignment in module 5.

Installing Pentaho Data Integration


You should install the community edition of Pentaho. The latest stable version (5.0.1) is
available from the Source Forge website ([Link]

To install Pentaho, you should follow the steps below. It is highly recommended that you use
the community edition from SourceForge as the instructions in this document follow the
community edition interface.

 The latest stable version is 5.0.1 although the latest version on SourceForge is 5.4. You
should be able to use either version to complete the tutorial and assignment although the
guided tutorial is written for the latest stable version 5.0.1.

 Go to [Link] and click on Files - Data Integration –


5.0.1-stable

 Download the [Link]

 Unzip the downloaded zip file to any folder.

 Windows uses should copy the folder data-integration to the folder C:\Program
Files\Pentaho. Mac and Linux users (including Oracle Virtual Box) may move the file to any
folder. Note that the Oracle Virtual Box is a Linux environment.

 If you are using the Oracle Database Virtual Box Appliance, you should download and
unzip the PDI zip file inside the Virtual Box. If you download the PDI zip file in Windows, you
may have difficulty making a connection to an Oracle database in the module 5 assignment.

To ensure that the installation worked, you should launch Pentaho Data Integration.

 Run the file [Link] by double clicking on it. You may want to create a shortcut to the
[Link] file so starting data integration is easier. If you get a permission error or cannot
execute the bat file, you should right click and select “Run as Administrator”. For Mac and
Linux users, run the [Link] from terminal (./[Link]). Note that the Oracle Virtual Box is
a Linux environment.

 After you launch Pentaho Data Integration, you will see the Welcome window (Figure 1)
and then the Spoon designer (Figure 2).

 Exit Spoon before installing the database driver file in the next part of the instructions.
22 September 2021 Installation of Pentaho Data Integration Page 2

Figure 1: Pentaho Data Integration Welcome Window


22 September 2021 Installation of Pentaho Data Integration Page 3

Figure 2: Spoon Opening Window

Installing JDBC Drivers


In the guided tutorial and assignment in module 5, you will need to connect to either Oracle or
MySQL server. Before you can connect to a database, you must install the appropriate database
driver. Pentaho uses the Java Database Connectivity (JDBC) to connect to databases.

You need to install the JDBC driver for the specific version of the DBMS that you previously
installed.

 For Oracle 12C, the JDBC driver is “[Link]”. You can download it from the following page.
You can also find JDBC files for other Oracle server versions on this page.

[Link]

 For MySQL, the JDBC driver is “[Link]”. You can download


it using this page:

[Link]
22 September 2021 Installation of Pentaho Data Integration Page 4

The JDBC for MySQL comes in two formats: .zip and .msi. Each format has a different
extraction method. Mac and Linux users (including Oracle Virtual Box) cannot use the
.msi format.

For the .zip file, the unzipped folder has many files. You only need to copy the mysql-
[Link] file.

For the .msi file, you should double click to extract it to the folder C:\Program
Files(x86)\MySQL\MySQL Connector J. In this folder, you will find many files and the
only needed one is [Link].

For Windows users, after downloading the JDBC file, you should copy the file to the following
folder:

C:\Program Files\Pentaho\data-integration\lib

For Mac and Linux users (including Oracle Virtual Box), you should copy the driver to the data-
integration/lib directory.

After copying the JDBC file to the specified folder, you should restart Pentaho Data Integration.

Common questions

Powered by AI

The recommended process for downloading the Oracle 12C JDBC driver involves obtaining the 'ojdbc7.jar' file from the Oracle technology network. After downloading, it should be placed in the `data-integration/lib` directory of Pentaho for successful database connections .

JDBC (Java Database Connectivity) acts as an interface facilitating connection between Pentaho Data Integration and database management systems like Oracle and MySQL. It allows Pentaho to execute SQL statements and retrieve/manage data within the database systems, requiring a specific version driver matching the DBMS .

Restarting Pentaho Data Integration after installing a JDBC driver file is crucial to ensure that the software can recognize and use the newly added driver. This step applies changes that support database connectivity .

The MySQL JDBC driver is available in two formats: .zip and .msi. Windows users can use either format, but Mac and Linux users (including those using Oracle Virtual Box) must use the .zip format due to compatibility issues with .msi files. The essential file in both cases is 'mysql-connector-java-5.1.36-bin.jar' .

For Windows, the 'data-integration' folder is copied to 'C:\Program Files\Pentaho', and 'Spoon.bat' is run with administrator privileges. In Mac/Linux, the folder can be moved to any directory, and 'Spoon.sh' is executed via terminal using './spoon.sh' .

To run 'Spoon.bat' smoothly on Windows without permission issues, right-click the file and select 'Run as Administrator', which provides the necessary permissions. Creating a shortcut can facilitate easier future access, reducing repeated permission prompts .

Extracting the Pentaho Data Integration ZIP file outside of Oracle Virtual Box may lead to challenges in establishing database connections with Oracle, as the different environments might prevent proper driver recognition and integration .

It is recommended to use the community edition of Pentaho Data Integration from SourceForge as the guided tutorials are specifically written for version 5.0.1, ensuring compatibility and instructions match the software's interface .

Users of Oracle Virtual Box should download and unzip the Pentaho Data Integration zip file directly inside the Virtual Box to avoid connection issues with the Oracle database. Additionally, when installing the JDBC driver, the 'ojdbc7.jar' file should be placed appropriately in the `data-integration/lib` directory .

To install Pentaho Data Integration on a Windows machine, follow these steps: 1) Download the community edition of Pentaho Data Integration (version 5.0.1) from SourceForge. 2) Unzip the downloaded file to any folder. 3) Copy the 'data-integration' folder to 'C:\Program Files\Pentaho'. 4) Launch Pentaho Data Integration by running 'Spoon.bat'. Ensure you use 'Run as Administrator' if you encounter permission errors .

You might also like