Datastage Interview Questions And Answers

Datastage Interview Questions And Answers

1) What is Datastage?

Datastage is an ETL tool given by IBM which utilizes a GUI to design data integration solutions. This was the first ETL tool that gave
parallelism concept.

It is available in following 3 different editions

Server Edition
Enterprise Edition
MVS Edition

2) Highlight the main features of Datastage?

The main features of Datastage are highlighted below

It is the data integration component of IBM Infosphere information server.
It is a GUI based tool.We just need to drag and drop the Datastage objects and we can convert it to Datastage code.
It is used to perform the ETL operations (Extract, transform, load)
It provides connectivity to multiple sources & multiple targets at the same time
Provides partitioning and parallel processing techniques which enable the Datastage jobs to process a huge volume of data quite faster.
It has enterprise-level connectivity.

3) What are the primary usages of Datastage tool?

Datastage is an ETL tool which is primarily used for extracting data from source systems, transforming that data and finally loading it to target systems.

4)What are the different layers in the information server architecture?

Unified user interface
Common services
Unified parallel processing
Unified Metadata
Common connectivity

5)What could be a data source system?

It could be a database table, a flat file, or even an external application like people soft.

6)On which interface you will be working as a developer?

As a Datastage developer, we work on Datastage client interface which is known as a Datastage designer that needs to be installed on the local system. In the backend, it is connected to the Datastage server.

7)What are the different common services in Datastage?

Metadata services
Unified service deployment
Security services
Looping and reporting services.

8)How do you start developing a Datastage project?

The very first step is to create a Datastage job on the Datastage server. All the Datastage objects that we create are stored in the Datastage project. A Datastage project is separated environment on the server for jobs, tables, definitions, and routines.

A Datastage project is separated environment on the server for jobs, tables, definitions, and routines.

9)Where do the Datastage jobs get stored?

The Datastage jobs get stored in the repository. We have various folders in which we can store the Datastage jobs.

10)Where do you see different stages in the designer?

All the stages are available within a window called as ‘Palette’. It has various categories depending upon the kind of function that the stage provides.

The various categories of stages in the Palette are – General, Data Quality, Database, Development, File, Processing, etc.

11)What are the Processing stages?

The processing stages allow us to apply the actual data transformation.

12)Name the different sorting methods in Datastage?

Link sort
Inbuilt Datastage Sort

13)How do you import and export the Datastage jobs?

Import: dsimport.exe
Export: dsexport.exe

14)What are routines in Datastage? Enlist various types of routines.

Routine is a set of functions that are defined by DS manager. It is run via transformer stage.

There are 3 kinds of routines

Parallel routines
Mainframe routines
Server routines

15)What are the different kinds of views available in a Datastage director?

Log view
Status view
Job view

16)What is the difference between passive stage and active stage?

Passive stages are utilized for extraction and loading whereas active stages are utilized for transformation.

17)Is the value of staging variable stored temporarily or permanently?

Temporarily. It is a temporary variable.

18)What is the use of Datastage director?

Through Datastage director, we can schedule a job, validate the job, execute the job and monitor the job.

19)What are the various kinds of the hash file?

Static hash file
Dynamic hash file

20)what are the different type of jobs in Datastage?

Server jobs (They run in a sequential manner)
Parallel jobs (They get executed in a parallel way)

For more   Click Here

For Course Content   Click Here

Top ↑