TPump is a Teradata utility designed to continuously move data from data sources into Teradata tables without locking the affected table. TPump provides near-real-time data into data warehouses. TPump can be used to insert, update, upsert, and delete data in the Teradata Database. TPump uses row hash locks, meaning users can run queries while it’s updating the Teradata Warehouse.
Overview
TPump uses standard Teradata SQL to achieve moderate to high data loading rates to the Teradata RDBMS. Multiple sessions and multistatement request are typically used to increase throughput.TPump provides an alternative to MultiLoad for the low volume batch maintenance of large databases under control of a Teradata system. Instead of updating Teradata databases overnight, or in batches throughout the day, TPump updates information in real time, acquiring every bit of data from the client system with low processor utilization. It does this through a continuous feed of data into the data warehouse, rather than the traditional batch updates. Continuous updates results in more accurate, timely data.
And, unlike most load utilities, TPump uses row hash locks rather than table level locks. This allows users to run queries while TPump is running. This also means that TPump can be stopped instantaneously.
TPump also provides a dynamic throttling feature that enables it to run “all out” during batch windows, but within limits when it may impact other business uses of the Teradata RDBMS. Operators can specify the number of statements run per minute, or may alter throttling minute-by-minute, if necessary.
TPump’s main attributes are:
- Simple, hassle-free setup – doesn’t require staging of data, intermediary files, or special hardware.
- High-end portability – supports IBM mainframes; UNIX MP-RAS; AIX; HP-UX; Windows 98, Windows NT, Windows 2000, and Windows XP; and Solaris SPARC.
- Efficient, time-saving operation – jobs can continue running in spite of database restarts, dirty data, and network slow downs. Jobs can restart with absolutely no intervention.
- Flexible data management – accepts an infinite variety of data forms from an infinite number of data sources, including direct feeds from other databases. TPump is also able to transform that data on the fly before sending it to Teradata. SQL statements and conditional logic are usable within the utilities, making it unnecessary to write wrapper jobs around.
Features
- Fast, scalable continuous data loads
- Row hash lock enables concurrent queries
- Dynamic throttling feature
- Best for small data volumes
Supported Platforms
- NCR UNIX SVR4 MP-RAS
- IBM z/OS (MVS and USS)
- z/OS VM
- Microsoft Windows 2000, XP, and Server 2003
- Sun Solaris SPARC
- IBM
- HP-UX
TPump example
The sample script attached below loads a sample fixed-length columns extract into a Teradata database using Parallel Data Pump - Teradata TPump.
Contents of a ggclients.tpump script:
Contents of a ggclients.tpump script:
.logtable tpumplogtable;
.logon tdpid/username,password;
.BEGIN LOAD SESSION 4;
.layout ggclilayout;
.field wh_cust_no 1 char(10);
.field cust_name 12 char(200);
.field bal_amt 213 char(18);
.dml label insertclidml;
insert into gg_cli.*;
.IMPORT INFILE insert.input
layout ggclilayout
apply insertclidml;
.END LOAD;
.logoff;