1. Establish baselines (especially with I/O), use copy with no output
2. Avoid the use of only one flow for tuning/performance testing. Prototyping can be a powerful tool.
3. Work in increments...change 1 thing at a time.
4. Evaluate data skew: repartition to balance the data flow
5. Isolate and Solve - determine which stage is causing a problem.
7. Do NOT involve the RDBMS in initial testing.
8. Understand and evaluate the tuning knobs available
https://www.facebook.com/datastage4you
https://twitter.com/datagenx
https://plus.google.com/+AtulSingh0/posts
https://datagenx.slack.com/messages/datascience/
No comments:
Post a Comment