Fast data loading with DIA

posted Sep 14, 2012, 5:54 AM by Sachchida Ojha   [ updated Sep 14, 2012, 6:09 AM ]
The DIA servers are preloaded with RedHat Enterprise Linux operating systems, currently at version 5.5. It is also preloaded with a Greenplum utility called gpfdist. gpfdist is the Greenplum parallel file server utility used for facilitating fast data loading, making use of the DCA database’s MPP architecture.

Since the DIA servers are RedHat Linux hosts, they can also be configured as hosts for data integration software, such as Informatica, Talend, and Pentaho.

Now let's discuss about common questions asked by the customers such as,

1. How gpfdist is used in the DIA servers for data loading
2. How we can install the Informatica Integration Services on the DIA servers. 
3. How the DIA servers can be configured as a grid for the Informatica Enterprise Grid option.