Course : Talend Open Studio: Implementing Data Integration

Talend Open Studio: Implementing Data Integration

Download in PDF format Share this course by email 2


Talend Open Studio (TOS) is an open-source ETL application that makes it possible to perform massive data synchronizations from one database to another. This course will teach you how to design, model, and develop TOS jobs in order to handle different challenges.


Inter
In-house
Custom

Practical course in person or remote class

Ref. TOT
Price : 2230 CHF E.T.
  3d - 21h00




Talend Open Studio (TOS) is an open-source ETL application that makes it possible to perform massive data synchronizations from one database to another. This course will teach you how to design, model, and develop TOS jobs in order to handle different challenges.

Teaching objectives
At the end of the training, the participant will be able to:
  • Design and develop jobs in the application ETL Talend
  • Optimize the jobs developed by using contexts and datasets.
  • Carrying out more complex transformations by using variables, expressions, and joins
  • Execute and debug a job, track execution statistics

Intended audience
Developers, project managers, decision-making IT consultants, database administrators.

Prerequisites
Good knowledge of RDBMs and SQL. Knowledge of the Java language is a plus.

Course schedule

Presentation

  • Data integration ETL solutions.
  • TOS: Installation, user preferences. Reference documentation.
  • Product philosophy. Designing a job.
Hands-on work
Getting started with the tool. Overview of the specifications of the generic project.

Model the need, design a first job

  • Business Modeler. Job Designer.
  • Main connections. CSV and XML components.
  • Simple transformation components.
  • Viewing the generated code, executing a job.
Exercise
Developing a job that sorts a CSV source, filtering data, and storing the result in an XML file.

Optimize job development

  • Configuring reusable connections using metadata.
  • Updating the metadata and propagating it to jobs, importing/exporting metadata.
  • Configuring jobs with contexts.
  • Externalizing context variables in “.properties” and “.ini” files.
  • Creating and managing your own variables.
  • Generating data sets for tests.
Exercise
Refactoring a job using metadata and contexts. Generating a test data set for this job.

Working with databases

  • Supported databases and main components.
  • Configuring operations on tables.
  • Metadata and context of connecting to a database schema.
  • Sharing connections and managing transactions.
  • Creating queries using SQLBuilder.
Exercise
Reading and updating a data repository hosted on a MySQL server.

Working with multisource data

  • Overview of the tMap component.
  • Configuring input flows, creating joins.
  • Carrying out more transformations using variables, expressions and joins.
  • Qualifying data using filters.
  • Generating multiple outputs, managing rejects.
Exercise
Consolidating multisource data and generating a warehouse.

Add-ons

  • Breaking a job down into sub-jobs, using tRunJob. Launching jobs in the command line. Periodic execution.
  • Debugging a job, tracking execution statistics
  • tJasperOutput reporting.
Exercise
Generating a Jasper Report from a warehouse.


Practical details
Hands-on work
Series of mini-projects used to design increasing elaborate Talend jobs.
Teaching methods;
Theoretical contributions illustrated with example implementations, then consolidated by creating mini-Talend projects.

Customer reviews
4,3 / 5
Customer reviews are based on end-of-course evaluations. The score is calculated from all evaluations within the past year. Only reviews with a textual comment are displayed.


Dates and locations
From 19 to 21 June 2024 *
FR
Remote class
Guaranteed session
Registration
From 23 to 25 September 2024
FR
Remote class
Registration
From 13 to 15 November 2024
FR
Remote class
Registration