The simultaneous use of more than one CPU or processor core to execute a program or multiple computational threads is called parallel processing or Parallelism. Ideally, parallel processing makes programs run faster because there are more engines (CPUs or Cores) running it. as you all know Datastage supports 2 types of parallelism.
1.Pipeline parallelism.
2.Partition parallelism.
Pipeline Parallelism :
As and when a row/set of rows is/are processed at a particular stage that record or rows is sent out to process at another stage for processing or storing. Below image explains the same in detail.
We have set of rows in source and 1k rows being read in a single segment,When ever those rows got processed at Transform,those are being sent to ENRICH and From there to LOAD ,so By this way we can keep processor busy and reduce disk usage for staging.
Partition Parallelism :
Partition Parallel depends on dividing large data into smaller subsets (partitions) across resources ,Goal is to evenly distribute data,some transforms require all data within same group to be in same partition Requires the same transform on all partitions.
Using partition parallelism the same job would effectively be run simultaneously by several processors, each handling a separate subset of the total data, but Each partition is independent of others, there is no concept of “global” state.
Key based Techniques are
a) Hash
b) Modulus
c) Range
d) DB2
Key Less Techniques are
a) Same
b) Entire
c) Round Robin
Using partition parallelism the same job would effectively be run simultaneously by several processors, each handling a separate subset of the total data, but Each partition is independent of others, there is no concept of “global” state.
Datastage combines both Partition and Pipeline parallelism together to implement ETL Solutions.
Partition techniques are Key based And Key less Techniques
Key based Techniques are
a) Hash
b) Modulus
c) Range
d) DB2
Key Less Techniques are
a) Same
b) Entire
c) Round Robin
I like the information. Good work and keep update more.
ReplyDeleteEthical Hacking course in Chennai
Ethical Hacking Training Institute in Chennai
Hacking course in Chennai
ccna Training in Chennai
Salesforce course in Chennai
PHP Training in Chennai
Tally course in Chennai
Ethical Hacking course in OMR
Ethical Hacking course in Anna Nagar
Ethical Hacking course in Vadapalani
I would like to thank you so much for sharing with us and I have many ideas after visiting your post. Well done...
ReplyDeleteJMeter Training in Chennai
JMeter Certification
Linux Training in Chennai
Pega Training in Chennai
Primavera Training in Chennai
Unix Training in Chennai
Placement in Chennai
Tableau Training in Chennai
Oracle Training in Chennai
JMeter Training in T Nagar
JMeter Training in OMR
This is one amazing piece of article. Helped a lot in increasing my knowledge.
ReplyDeleteThis is good site and nice point of view.
Digital Marketing Training in Chennai | Certification | SEO Training Course | Digital Marketing Training in Bangalore | Certification | SEO Training Course | Digital Marketing Training in Hyderabad | Certification | SEO Training Course | Digital Marketing Training in Coimbatore | Certification | SEO Training Course | Digital Marketing Online Training | Certification | SEO Online Training Course
ReplyDeleteThanks for sharing Our team, who never fails to give the best of all time in best digital marketing company in Chennai. We creating, developing optimizing, analyzing, ranking, monitoring the traffic, hosting web services, etc. We care about the whole things, to be done for you.