Agile SSIS series (5/12): What is the smartest way to log SSIS?

Agile SSIS series (5/12): What is the smartest way to log SSIS?

Sprint 1, which was described in the previous text, the focus was on understanding how data centric applications can be developed using agile approach. Besides that possible solutions to the logging issues have been reviewed. In sprint 2, all this knowledge will be incorporated into the architecture of the solution. Two diagrams will be produced as the result of work in sprint 2. “Data flow diagram capturing architecture of the solution” shows the architecture of the solution using data flow diagram. The second chart “Entity-relationship diagram (logical)” captures the proposed data structures in which the logging data will be stored. Please note that only the first diagram will be part of this text, second will come later on.

Data flow diagram capturing architecture of the solution

As mentioned above, the logging will capture data into a database. However, the SSIS packages will not be writing the data to the database directly using INSERT statements, but will use stored procedure to do that. The stored procedure provides abstraction and also supports code reuse as the logic will be part of the procedure rather part of every single SSIS package. Once the data is saved into the database, users will be able to report on it. To make the logging as robust as possible, fail over capability will be added and this is why the packages will still log into the log file as well. Connection to a database can time out or be interrupted as it is likely that the SSIS packages will be executing on a different server than the logging database will reside. So having a log file on the very same server where the DTSX package runs is beneficial as it will work as a log of a last instance in case of some significant failure. However, while logging into the database will be very low level, descriptive and detailed, only errors, warnings and custom events will be written into the log file. Finally, one has to address the issue of the daily maintenance. The problem here is that logging is likely to produce substantial amounts of data and it is required to have some maintenance processes in place to delete off old data. In case of the database, stored procedure will be deleting off old data and for the log file, PowerShell script will be used to trim older data.

Data flow diagram

Few last words...

In this text I have described the data flow diagram of FLOGS which was designed in the sprint 2. In the next article I will carry on describing the work which was done also as part of sprint 2 – the entity-relationship diagram so stay tuned!

Previous articles in the series

Want to know more about Agile BI? Please read my agile BI series...

Raghunath Ramkumar

Distinguished Engineer at CFS

8 年

Good one Petr.

回复
Mustapha Ichahane

Operations Delivery Manager

9 年

Thanks for sharing

回复
Dimitri HAPPI

Value Creation | CVM & Analytics | Data Science

9 年

Thanks for sharing. Very helpful !!!

回复

要查看或添加评论,请登录

Petr Podrouzek的更多文章

  • Innovating with Emplifi Unified Analytics ??

    Innovating with Emplifi Unified Analytics ??

    Last week marked a significant milestone for Emplifi with the launch of Unified Analytics ??. This feature symbolizes…

    3 条评论
  • My goals for 2022

    My goals for 2022

    My goal for January was to come up with what I would like to achieve in 2022 professionally. I asked myself how can I…

    2 条评论
  • Top 3 things I did in 2021

    Top 3 things I did in 2021

    I truly believe it is important to reflect on past experiences and learnings. Sometimes we are so preoccupied with the…

    3 条评论
  • Data warehouse release nightmares (2/2)

    Data warehouse release nightmares (2/2)

    In the previous text, I have discussed the issues I have encountered when releasing DWH based on my 10 years of…

  • Data warehouse release nightmares (1/2)

    Data warehouse release nightmares (1/2)

    I have been a BI/DWH developer for nearly 10 years now and most of the projects I have worked on had one particular…

    1 条评论
  • What value can MDM bring to organisations and at what cost? (3/3)

    What value can MDM bring to organisations and at what cost? (3/3)

    In the previous articles, I have discussed the benefits of implementing MDM but also the costs it can bring. Now let's…

  • What value can MDM bring to organisations and at what cost? (2/3)

    What value can MDM bring to organisations and at what cost? (2/3)

    In the previous article, I discussed the benefits of implementing MDM. As with any technology, it does come with a cost…

  • What value can MDM bring to organisations and at what cost? (1/3)

    What value can MDM bring to organisations and at what cost? (1/3)

    There are many applications supporting various processes in organizations - there is not a single process that would…

    3 条评论
  • Snowflakes and why not to use them (4/4): Conclusion

    Snowflakes and why not to use them (4/4): Conclusion

    In the previous articles, I have introduced simple snowflake utilizing SCD2 model. I have also build a simple star…

    1 条评论
  • Snowflakes and why not to use them (3/4): The problem

    Snowflakes and why not to use them (3/4): The problem

    As stated in the previous article, snowflakes can be more efficient in using storage compared to stars. And believe me,…

    2 条评论

社区洞察

其他会员也浏览了