登录查看更多内容

Data Logging for Audit, Debugging, and Fraud Detection

William Taylor

发布日期: 2019年10月27日

As we divide large applications up into smaller “microservices,” we encounter common problems that go back to the Apollo project days. Every development team must solve these problems, so it makes sense for the system architects to specify solutions in advance. This post discusses logging, an area where common solutions are more important than in many other areas.

Back when computer memory cost 10 cents per bit, computer memories were so small and network capacity was so low that it wasn’t possible to do much logging in the sense we use the term today. That made debugging a lot harder than it is now. Legacy system generally implemented logging pretty casually – developers added print statements as the spirit moved them. In order to be able to monitor and debug when thousands of different micorservice instances are cooperating to solve problems, logging needs a lot more thought than in the old days of wooden ships and iron men.

Given that any user-initiated action can be processed by any number of microservice calls, the variety of bugs and logic flaws that can bedevil a service-based system is richer than bugs in a monolithic application. Each user request should be assigned a user transaction ID which is included in all logging messages. Tracking user requests helps untangle errors related to flow and helps identify normal activity patterns which can be distinguished from fraudulent activity. Request IDs can also be used to support Once and Only Once which is discussed in a separate post.

The architecture team should choose a common logging library and supply a standard logger configuration file so that messages are formatted consistently across all projects and services. Each message should start with a GMT date and time in yyyymmddhhmmss format. This supports merging logs from different systems spread all over the world.

Many issues arise because of unexpected interactions between calls to microservices. Merging log files from different sources and keeping them in time-sequence is helpful, particularly if each message also includes the user request ID. Log messages should also include a hostname, service name, and a trigger for the SCADA system which is explained later.

All times should be stored as raw GMT milliseconds instead of using database-specific time formats. Airlines learned long ago that any other approach leads to chaos.

It is important to be able to associate log messages with individual users, either by including the user ID in the log messages or being able to associate the transaction ID with a specific user as the need arises. The Target breach occurred because their IT system let a hacker who had stolen HVAC vendor credentials access their point-of-sale systems. This was a permissions issue, not an authentication failure. Target should never have permitted the attacker who impersonated the vendor to escalate privileges at all, to say nothing of escalating to the point of accessing critical data. If all requests identified the originating user, it would have been easier to protect the more sensitive systems.

The only known way to detect such internal attacks is activity pattern analysis. Mr. Snowden had root access to the NSA computers, but there was no job-related need for him to actually read gigabytes of data. The fact that he was collecting, moving, and copying large volumes of data should have set off alarms, or at least spurred someone to ask questions. We saw the same “invade, collect, compress, export” sequence in the Capital One data breach.

Banks require that all actions with financial effect be associated with some responsible individual. Activity logs support both auditing and activity pattern analysis, but only if all log file formats are consistent.

Logging also feeds our Supervisory Control and Data Acquisition system which makes sure that thousands of microservice instances are working as expected and that overall system performance meets customer expecations. SCADA is discussed in its own post.

AWS has services which merge logs and send alarms as needed. Is being dependent on AWS OK or should we incorporate and maintain some other log management system? If we already have a well-organized log management in our legacy data center, using it for the cloud makes sense. If we don’t, getting into the cloud is a golden opportunity to configure one.

jimah zubairu

clergy at godsownpenticostalministry

5 年

God bless u

Ari Sugmad

Musician at Be?+

5 年

Bill Taylor what’s your input on this matter, pro and cons. Could universities or education systems help brainstorm ideas in classrooms?

David Shahal

IPT-Lead at Aitech Systems Ltd.

5 年

I do this for a living. when I first srarted 15 years ago, we barely had much data coming in. Im talkinf a few hundred MBs. Today we are talking 10s of GBs of data. With bew servies coming online from our customers they are capturing more and more data. We are playing catchup with Data servicing/processing amd debugging. We are working with finding lost data and why it was missing, down to a second of lost data or less. Its difficult when you process the data and 1 million lines or more you have to shift through to find the issue. So we have to create the test/processing tools.

1 次回应

查看更多评论

要查看或添加评论，请登录

William Taylor的更多文章

Mr. Trump spoke the truth! He said

2024年8月29日

Mr. Trump spoke the truth! He said

that if you are a good person, you will go to Heaven. That is true, but he didn’t explain God’s strict conditions you…
Progress In IT Comes From Cheaper Semiconductors

2024年5月18日

Progress In IT Comes From Cheaper Semiconductors

This is a talk given at a startup incubation center. The audience was 20-something billionaire wannabees who yearn to…

6 条评论
McDonald’s Theory of Brainstorming for New Ideas

2024年4月21日

McDonald’s Theory of Brainstorming for New Ideas

I use a trick with co-workers when we’re trying to decide where to eat lunch and no one has any ideas. I recommend…
How Startups Fail

2024年4月13日

How Startups Fail

Startups fail in as many ways as there are failed startups, but failures fall into a few broad categories. I’ve been…
Can Salvation be Lost? Are works required for Salvation?

2023年3月28日

Can Salvation be Lost? Are works required for Salvation?

People ask, “Are works such as baptism, speaking in tongues, or gospel visitation required in order to be saved?”…
Cyber security should be a core part of development culture, not an afterthought

2021年5月11日

Cyber security should be a core part of development culture, not an afterthought

Summary Reports of unknown agents penetrating protected networks suggest that bad guys are winning. This note tells how…

4 条评论
Why it’s Hard to Protect Classified Data – Inherent Vulnerabilities

2020年2月8日

Why it’s Hard to Protect Classified Data – Inherent Vulnerabilities

Amazon Web Services has raised vehement objections to our Department of Defense awarding a $10 billion cloud computing…

2 条评论
Why it’s Hard to Protect Classified Data - Cloud Server Farms

2020年1月29日

Why it’s Hard to Protect Classified Data - Cloud Server Farms

Amazon Web Services has raised vehement objections to our Department of Defense awarding a $10 billion cloud computing…

3 条评论
Why it’s Hard to Protect Classified Data - Cloud Networking

2020年1月19日

Why it’s Hard to Protect Classified Data - Cloud Networking

Amazon Web Services has raised vehement objections to our Department of Defense awarding a $10 billion cloud computing…

2 条评论
Why it’s Hard to Handle Classified Data - Cloud Fundamentals

2020年1月3日

Why it’s Hard to Handle Classified Data - Cloud Fundamentals

Amazon Web Services has raised vehement objections to our Department of Defense awarding a $10 billion cloud computing…

4 条评论

See all articles

Data Logging for Audit, Debugging, and Fraud Detection

William Taylor

William Taylor的更多文章

社区洞察

其他会员也浏览了

Exploiting JSON-P Based API Endpoints for Data Exfiltration

The Importance of Code Obfuscation in Financial Industries

January 30, 2025

December 29, 2024

What is API Security?

Enhancing Legacy-Heavy Systems with Observability and AIOps: Challenges and Benefits

Of Krispy Kreme, Chaos Engineering, Veterans and more...

Crowdstruck, DORA, and the importance of being earnest!

July 22, 2023

Bloch Bytes Newsletter - May 2022

William Taylor的更多文章

Mr. Trump spoke the truth! He said

Progress In IT Comes From Cheaper Semiconductors

McDonald’s Theory of Brainstorming for New Ideas

How Startups Fail

Can Salvation be Lost? Are works required for Salvation?

Cyber security should be a core part of development culture, not an afterthought

Why it’s Hard to Protect Classified Data – Inherent Vulnerabilities

Why it’s Hard to Protect Classified Data - Cloud Server Farms

Why it’s Hard to Protect Classified Data - Cloud Networking

Why it’s Hard to Handle Classified Data - Cloud Fundamentals

社区洞察

其他会员也浏览了

Exploiting JSON-P Based API Endpoints for Data Exfiltration

The Importance of Code Obfuscation in Financial Industries

January 30, 2025

December 29, 2024

What is API Security?

Enhancing Legacy-Heavy Systems with Observability and AIOps: Challenges and Benefits

Of Krispy Kreme, Chaos Engineering, Veterans and more...

Crowdstruck, DORA, and the importance of being earnest!

July 22, 2023

Bloch Bytes Newsletter - May 2022