登录查看更多内容

Digital Transformation - Modern DB

Glenn West

Cloud Strategy and Evangelism at Red Hat

发布日期: 2020年1月4日

Tech Debt also lives in the database! I've done a MRP transformation a few years ago, and one of the shocking things was the huge amount of totally invalid data that should have never got into the database. Of course I'm a firm believer in testing with a copy of full production data, to find such the sediment that will cause issues when things go to production. So how do you really get a database transformed from the old to the new?

First, where are we comings from? What db is used, and what is the schema for that data. A test environment will need to be created with the original data, and the schema needs to be extracted. and the data. In my case, the db was proprietary, so I had to contact the company and get a library that would let me access the data. Since the db was "ancient", there was no modern support for it, so I had to do a minimal driver for it. Now we need to transform the data into a new db, preferably something that is more current.

One of the concepts that I love for data migration/transformation is the concept of ORM and Migrations. This is a Ruby On Rails concept originally, and works really well. a Object Relationship Manager gives standard semantics to a underlying db, and allows code to be independent of db. Literally you could use sqlite for test, and oracle or postgres for production. In today's environment I would use golang and a framework called Buffalo, which includes "fizz". Fizz is a Domain Specific Language for migrating databases,. It tries to be as database-agnostic as possible and simplify the conversion, change and modification of data. Effectively it gives us a easy way of "scripting" database transformations.

Now the next problem was getting all the tables and everything into the "fizz" files we need to recreate the tables in the new system. For me, I was transforming a production application that had been in use for more than 10 years with a large number of tables. So ideally for this component I would write a bit of golang that reads the schema of the original db, and creates fizz files that would allow the recreation of the db in any db supported by the framework. This also gives us a bit of future proofing our db choices. As once we have our fizz files, we could change the target db with little to no pain.

Next choice is, what db do we use? The intention would be to use something that would work in a Kubernetes or OpenShift environment, that could scale horizontally and vertically, and would support a modern cloud native environment. It should also support todays SSD and memories sizes and really be able to handle the level of performance that is needed. Traditional solutions just do not offer the level of scalability that is needed for modern applications, and will incur a huge amount of technical debt right from the start. So the better solution is to choose a cloud native solution as our target. NUODB is already integrated with OpenShift, and gives us SQL and ACID compliant transactions, fully redundancy and scalable. So in my 2020 Technical vision using NUODB really solves a huge number of problems.

So now we have the migration tools, the source and the target, there a few more points. One of the most important is to make sure we can run the migration multiple times, and it not cause corruption of data. In my own experience migrations can take a huge amount of time to run, and you will often want or have to restart them due to issues of bad input data. So in designing your transformation, error recovery and restart is very important element.

During this transformation, the other area to be concerned on, it matching the hardware runtime environment for the new infrastructure to match the current industry standards. While this could be a log article by itself, modern db's can use huge ram footprints, and Pci Express based SSD's to offer huge performance. In a OpenShift environment it may be worthwhile to have nodes that are configured for databases, and tagged for that application. This could allow your production db pods to run significantly better.

The DB is one of the important layers in moving to a 2020 cloud native environment, also looking at the api's, the framework, and the total flow of the app, and where is runs are additional parts of the total solution. Also making sure we get the most done in the least time. (Agile/Scrum) is equally important.

I will explore these elements further in coming articles.

要查看或添加评论，请登录

Glenn West的更多文章

COBOL - Transformation

2020年4月10日

COBOL - Transformation

Surprisingly there still alot of Cobol code in the world. While there is great resistance to moving, the reality is the…
Azure Private DNS for Openshift

2020年1月18日

Azure Private DNS for Openshift

Want to create a openshift cluster, or other application, and need local control of DNS? Azure private dns allows you…
Running a container close to bare metal

2020年1月13日

Running a container close to bare metal

Sometimes you want as close to bare metal or bare vm as you can get. Performance and security often drive this.
Rapid IoT Development

2020年1月10日

Rapid IoT Development

I have always had a love of hardware and IoT, from my teen years. I started my first IoT project in 2nd grade.
Stubbing Out a Complex REST API

2019年10月1日

Stubbing Out a Complex REST API

I've started on redfish2esxi, a service designed to allow Metal3 to talk to esxi, and do bare metal provisioning. Since…
Metal DB - Using Hardware like a POD

2019年10月1日

Metal DB - Using Hardware like a POD

What if you could schedule and use bare metal hardware as easy as a pod in Kubernetes or OpenShift? Could you really…
OpenShift 4.x on your laptop - CRC

2019年9月30日

OpenShift 4.x on your laptop - CRC

OpenShift 4.x is a huge upgrade from OpenShift 3.
Fast Development Ideas on OpenShift

2019年7月26日

Fast Development Ideas on OpenShift

I always get inspired when talking with customers. I saw Tabulator this morning, in my daily nodejs update, and really…

1 条评论
The Micro in MicroService

2019年4月3日

The Micro in MicroService

MicroServices have been the in thing for a while now. But the question for me is always the size.
NodeJS running on Windows 2019

2018年12月5日

NodeJS running on Windows 2019

Windows 2019 is hot off the press, complete with new container images. Since it's so new, it will take a while for the…

See all articles

Digital Transformation - Modern DB

Glenn West

Cloud Strategy and Evangelism at Red Hat

Glenn West的更多文章

社区洞察

其他会员也浏览了

My Hilariously Horrid Database Migration Story: A Tale of Team Growth and Lessons Learned

?? DBMS – Day 02: DBMS Architecture & Role of the DBA ??

How Grab stores and processes millions of orders every day

Demystifying DATA: DBMS, Databases, Data Structures, Database Engines and Data

CAP Theorem: Optimizing Database Selection for Enhanced Application Performance

Unlocking the Power of Database Reverse Engineering with JDX and Gilhari

Understanding the Execution of Expressions in Databases: Technical Insights and Future Directions

DSE Cassandra Architecture Overview

From Crash to Recovery: The Power of the ARIES Algorithm

Unlocking Data Consistency: Introducing the Outbox Pattern for Reliable Transactions and Messaging using Kafka

Glenn West的更多文章

COBOL - Transformation

Azure Private DNS for Openshift

Running a container close to bare metal

Rapid IoT Development

Stubbing Out a Complex REST API

Metal DB - Using Hardware like a POD

OpenShift 4.x on your laptop - CRC

Fast Development Ideas on OpenShift

The Micro in MicroService

NodeJS running on Windows 2019

社区洞察

其他会员也浏览了

My Hilariously Horrid Database Migration Story: A Tale of Team Growth and Lessons Learned

?? DBMS – Day 02: DBMS Architecture & Role of the DBA ??

How Grab stores and processes millions of orders every day

Demystifying DATA: DBMS, Databases, Data Structures, Database Engines and Data

CAP Theorem: Optimizing Database Selection for Enhanced Application Performance

Unlocking the Power of Database Reverse Engineering with JDX and Gilhari

Understanding the Execution of Expressions in Databases: Technical Insights and Future Directions

DSE Cassandra Architecture Overview

From Crash to Recovery: The Power of the ARIES Algorithm

Unlocking Data Consistency: Introducing the Outbox Pattern for Reliable Transactions and Messaging using Kafka