Automated Data Collection with NiFi

Automated Data Collection with NiFi

Introduction

Manufacturing is a field that is undergoing a complete transformation in the era of faster and more available data. Machines no longer work in isolation but instead transmit their data in real time to centralized monitoring and analytic hubs. Companies who successfully adopt this level of technology will have a substantial edge over their competitors in terms of scrap rate, efficiency analysis, and even thruput. Calculated Systems and Incite Informatics delivered a system that did just that. We designed and piloted a system on Azure that enabled detecting a malfunctioning machine in minutes instead of hours.

Architecture & Solution

If you want to learn more about Apache NiFi in general download Apache NiFi for dummies

Centralizing and automating manufacturing analysis can be broken into 5 stages:

5 Steps of Automated Data Collection
  • Data Collection - This stage is the most dependent on external variables. The specifics of the machines affects storage formats, logging frequency, and even connectivity. Normally this would be a bottleneck in any production pipeline as the hardware teams will need to provide a stream to test the rest of the system. We selected Apache NiFi to be the log transmission agent as it has the ability to serve the production purpose of streaming and the development purpose of simulating a stream. The same architecture and flow(code-SEE HERE) can be used to either as the device monitor or development simulator
  • Landing Data on the Cloud - An important principle in designing a reliable, resilient architecture is decoupling ingest and processing. This is particularly true on cloud where scalable resources can be utilized to reduce cost. The goal is to be able to land messages regardless of frequency and then scale processing to match. A message queue or event bus are ideal for filling this role in an easy to use, cheap manner. We used Event Hubs due to its serverless nature and compatibility with the rest of the Azure ecosystem.
  • Processing - Batch & Stream - Processing data is a multi-headed problem that requires much consideration. A good rule of thumb for figuring out the right tool is the 3 Vs - Volume, Velocity, Variety. It is important to consider velocity a sliding scale rather than a binary condition of batch or stream. Modern cloud systems consider it a greyscale between ‘instant’ and ‘infinite’. In our case we decided to tap two separate processing engines, Azure Functions and NiFi. Functions is native to azure and is truly serverless enabling processing to be run cheaply and sustainably. NiFi adds a high degree of flexibility and agility but does require a server to be configured.
  • Automated Action - The ability to automate actions is the most important step. Insights and knowledge do not help the business unless you know what to do with them. In this case study we were focused on scrap rate reduction. The initial implementation focused on sending an alert to an alert que and trigger an email off that. As the solution enters phase two automated shutdown signals along with suggested remedial action can be added.
  • Post-Analysis & Storage - After the events are processed and acted on the information is still valuable. Making it accessible and available enables further analysis and review. This allows further analysis and new processing & actions to be discovered. Following the best practice principles of big data we landed the data without modifying or parsing it. This ensures that the data can be replayed if an error in processing was found. Event hubs has a built in storage connector to blob storage that can be easily configured. We also sent data to a SQL database for analysis. Although this requires more parsing it is much more accessible to the end user.
Manufacturing + Cloud Architecture

If you want to learn more about Apache NiFi in general download Apache NiFi for dummies

Conclusion

Were able to quickly automated data and factory handling. A cloud-first approach enables companies to scale up an entire project in weeks which would previously take months. Cloud platforms such as Azure offer phenomenal building blocks in which the right open source technology can augment existing capabilities. This enables companies to launch sweeping technology upgrades within weeks instead of years.

Check out a more practical lesson to moving data to AWS


Prithvi Raj Vadlamudi

Principal at BCG | C-Suite Advisor | Digital Strategy and Consulting | Data Strategy | Architecture | Robotics & AI

5 年

Great articles How do I download for offline read

回复

要查看或添加评论,请登录

Chris Gambino的更多文章

  • NiFi and Retrieval Augmented Generation

    NiFi and Retrieval Augmented Generation

    Phase 1 – “Basic Knowledge” We built a real time slackbot to help answer NiFi questions. To build and host this…

    1 条评论
  • Cloud First IoT with Syft

    Cloud First IoT with Syft

    Introduction Syft Technologies is a leading scientific equipment manufacturer specializing in chemical analysis. To…

  • A Crash Course for Amazon Natural Language Processing

    A Crash Course for Amazon Natural Language Processing

    Over the past few years we have seen a rise in cloud native “machine learning” models. These general use models are…

  • What I Learned from 2.75 Million Bike Rides

    What I Learned from 2.75 Million Bike Rides

    What do you think is the most popular bicycle spot is in San Francisco? I’ll give you a hint, over 129,000 people…

  • Moving Data to the Cloud - A Practical Guide

    Moving Data to the Cloud - A Practical Guide

    Moving data to the cloud is one of the cornerstones of any cloud migration. Having worked with both on-premise and…

    2 条评论
  • Create A Restful API for Nifi, Walmart Case Study

    Create A Restful API for Nifi, Walmart Case Study

    I was recently tinkering with the walmart rest-api. This is publicly available interface and can be used for a quick…

  • Windows Share + Nifi + HDFS – A Practical Guide

    Windows Share + Nifi + HDFS – A Practical Guide

    Recently I had a client ask about how would we go about connecting a windows share to Nifi to HDFS, or if it was even…

    1 条评论
  • Parsing XML Logs With Nifi – Part 1 of 3

    Parsing XML Logs With Nifi – Part 1 of 3

    I have a plan to write a 3 part “intro” series as to how to handle your XML files. The subjects will be: Basic XML and…

    1 条评论
  • Integrating Nifi with Graylog

    Integrating Nifi with Graylog

    Graylog is gaining popularity as a log exploration tool. So this begs the question, how do you intelligently route your…

    1 条评论
  • Building a Smarter Home with Nifi and Spark

    Building a Smarter Home with Nifi and Spark

    I submitted an abstract for the hadoop world summit. Check it out and vote for it here Join us as we discuss what life…

    2 条评论

社区洞察

其他会员也浏览了