Agile SSIS series (12/12): Future of FLOGS

Agile SSIS series (12/12): Future of FLOGS

So project FLOGS is over. The task was to design comprehensive and high performing logging platform for SSIS. Also I wanted to improve my knowledge of agile (SCRUM). I believe that I have achieved both goals. However, during the design I had couple more ideas that I did not have time to implement.

How to improve FLOGS?

While testing final version of FLOGS, I suggests following areas to be developed further:

  • Improved error logging in data flow – ability to redirect rows to a generic data structure and achieve non interruptible loads. This would mean that the SSIS load would carry on and not fail even though error in data flow would occur (for example wrong data type got into the pipeline).
  • Sophisticated reporting solution on top of FLOGS – develop OLAP cubes, reports and dashboard based on data collected from FLOGS. These could further support goals G1 – G3.
  • Improve performance of logging – as suggested in previous text the performance of the logging can be significantly improved by storing the log data temporarily to in-memory structure and flush the data to the database once in a while. Performance impact of such logging would be close to nothing.
  • Align FLOGS data with Windows Management Instrumentation data (CPU or memory utilization etc.) - analysis can be taken even further and determine hardware bottlenecks when the SSIS packages execute.

Few last words...

Thanks for reading this series, I am looking forward to your comments as well as improvements to FLOGS. Please download FLOGS here (from Dropbox).

Previous articles in the series

  • Agile SSIS series (1/12): High performance SSIS logging platform – introduction to the project FLOGS; what do I want to achieve and who is the target audicence?
  • Agile SSIS series (2/12): Top 4 features that SSIS logging platform must have – definition of 4 basic requirements and justification why I see these as the most important criteria.
  • Agile SSIS series (3/12): Why do we REALLY use agile approach? - justification of agile; one needs to know what are the reasons to use agile.
  • Agile SSIS series (4/12): Different ways how to do SSIS logging – review of existing solutions or approaches how to log execution of DTSX packages; this review also gives a hint which way to go.
  • Agile SSIS series (5/12): What is the smartest way to log SSIS execution? - data flow diagram defining the architecture of the solution.
  • Agile SSIS series (6/12): Well designed data structures for SSIS logging – entity relationship diagram and snow flake schema which was developed in the first design iteration.
  • Agile SSIS series (7/12): Prototyping the agile way! - first prototype for testing; implemented the dataflow as well as the entity relationship diagram and build test DTSX packages.
  • Agile SSIS series (8/12): Performance testing... - the prototype is tested in order to determine the impact on the performance of SSIS execution; it proved that the design is flawed and can impact performance of rapid executing packages.
  • Agile SSIS series (9/12): Back to the drawing board – based on the performance testing, the architecture of the solutio needs to be changed.
  • Agile SSIS series (10/12): Performance testing of the final prototype – after the architecture change, has the impact on performance been mitigated?
  • Agile SSIS series (11/12): Log data analysis – once the data has been logged how do I interrogate it? How can one improve performance of SSIS packages using extensive logging?

Want to know more about Agile BI? Please read my agile BI series...

Stéphane Boinon

Lead Data Quality Analyst @Rexel

7 年

hi petr, nice feedbacks and reviews about your flogs series, by the way your dropbox link concerning FLOGS.zip file is apparently no longer viable

回复

要查看或添加评论,请登录

Petr Podrouzek的更多文章

  • Innovating with Emplifi Unified Analytics ??

    Innovating with Emplifi Unified Analytics ??

    Last week marked a significant milestone for Emplifi with the launch of Unified Analytics ??. This feature symbolizes…

    3 条评论
  • My goals for 2022

    My goals for 2022

    My goal for January was to come up with what I would like to achieve in 2022 professionally. I asked myself how can I…

    2 条评论
  • Top 3 things I did in 2021

    Top 3 things I did in 2021

    I truly believe it is important to reflect on past experiences and learnings. Sometimes we are so preoccupied with the…

    3 条评论
  • Data warehouse release nightmares (2/2)

    Data warehouse release nightmares (2/2)

    In the previous text, I have discussed the issues I have encountered when releasing DWH based on my 10 years of…

  • Data warehouse release nightmares (1/2)

    Data warehouse release nightmares (1/2)

    I have been a BI/DWH developer for nearly 10 years now and most of the projects I have worked on had one particular…

    1 条评论
  • What value can MDM bring to organisations and at what cost? (3/3)

    What value can MDM bring to organisations and at what cost? (3/3)

    In the previous articles, I have discussed the benefits of implementing MDM but also the costs it can bring. Now let's…

  • What value can MDM bring to organisations and at what cost? (2/3)

    What value can MDM bring to organisations and at what cost? (2/3)

    In the previous article, I discussed the benefits of implementing MDM. As with any technology, it does come with a cost…

  • What value can MDM bring to organisations and at what cost? (1/3)

    What value can MDM bring to organisations and at what cost? (1/3)

    There are many applications supporting various processes in organizations - there is not a single process that would…

    3 条评论
  • Snowflakes and why not to use them (4/4): Conclusion

    Snowflakes and why not to use them (4/4): Conclusion

    In the previous articles, I have introduced simple snowflake utilizing SCD2 model. I have also build a simple star…

    1 条评论
  • Snowflakes and why not to use them (3/4): The problem

    Snowflakes and why not to use them (3/4): The problem

    As stated in the previous article, snowflakes can be more efficient in using storage compared to stars. And believe me,…

    2 条评论

社区洞察

其他会员也浏览了