Table of contents

Overview of InfoSphere DataStage

IBM® InfoSphere® DataStage® is a data integration tool for designing, developing, and running jobs that move and transform data.

InfoSphere DataStage is the data integration component of IBM InfoSphere Information Server. It provides a graphical framework for developing the jobs that move data from source systems to target systems. The transformed data can be delivered to data warehouses, data marts, and operational data stores, real-time web services and messaging systems, and other enterprise applications. InfoSphere DataStage supports extract, transform, and load (ETL) and extract, load, and transform (ELT) patterns. InfoSphere DataStage uses parallel processing and enterprise connectivity to provide a truly scalable platform.

With InfoSphere DataStage, your company can accomplish these goals:
  • Design data flows that extract information from multiple source systems, transform the data as required, and deliver the data to target databases or applications.
  • Connect directly to enterprise applications as sources or targets to ensure that the data is relevant, complete, and accurate.
  • Reduce development time and improve the consistency of design and deployment by using prebuilt functions.
  • Minimize the project delivery cycle by working with a common set of tools across InfoSphere Information Server.