登录查看更多内容

20 years of Open Source from Grid to Cloud Computing

Paul Brebner

Open Source Technology Evangelist at Instaclustr by NetApp

发布日期: 2024年12月17日

Given that it's coming to the end of 2024 I was thinking back to what I was up to 20 years ago, in 2004. That feels like a long time ago - Myspace was the social media site, my kids loved playing Runescape, I bought my first (small) mobile phone - previously I had an analogue "brick" phone, I bought the fastest laptop around - the Acer Ferrari - and I looked a lot younger! (The Acer ran red hot, just like the colour!)

But I was also involved in an EPSRC project running out of UCL (with Professor Wolfgang Emmerich) in London to evaluate a new type of open source software for Grid computing. It was called the Open Grid Services Architecture (OGSA), and I (and the family) had to move from Canberra to London for the year (arriving in a dark freezing cold London Xmas was an experience). Grid computing is a type of distributed computing for scientific (and other) workloads, and was designed to enable multiple heterogeneous computing resources (typically provided for free by universities and other research bodies) to cooperate in solving problems that would be too large to run locally.

Some of the problems we had to address included how to set up the distributed OGSA infrastructure at each location securely and enable it to provide the ability for end users to deploy and run arbitrary code (packaged as web services) securely across all the available infrastructure.

What did we learn? That it was harder than it looked - there were lots of complications around security, scale, and the fact that most Grid workloads were batch-oriented and the only way you could get access to computing resources was through complex and heavy-weight resource schedulers. We discovered that traditional users were very good at "gaming" the system to ensure they could "reserve" resources well in advance (often by submitting dummy jobs which they swapped for the real ones at the last minute). On the other hand, we were interested in dynamic, real-time, workflows that needed access to resources on-demand for minutes to hours, not days to weeks.

I was pleasantly surprised to find the UCL OGSA project page is still around! Check it out here.

As well as evaluating distributed open source Grid software, the project was distributed across 4 universities, so it was fun to travel around the UK for work (including my 1st International Conference on Software Engineering, in Edinburgh).

领英推荐

SUSE's open source digest - November 2024

SUSE 4 个月前

Autumn Bytes

Rancher 1 年前

Mirantis is a Challeger in Gartner Magic Quadrant for…

Mirantis 5 个月前

After 20 years what did we learn? Grid was an early type of cloud computing - given the problems with finite and fair resource sharing, an obvious solution was an economic dimension - this was a fundamental aspect of Cloud computing - resources cost real $ and you only have them for a finite period of time. Service discovery was an essential part of the Grid, and that's still the case today - AWS (e.g.) has zillions of services, all readily discoverable. Instaclustr also has a whole bunch of services, but the difference is that they are all open source and available on multiple cloud providers. The heterogeneous nature of the available Grid resources corresponds (loosely) to different instance sizes, and the ability to deploy arbitrary code corresponds to early approaches for component deployment, which involved into serverless computing, Kubernetes and more.

These days I'm the technology evangelist at NetApp Instaclustr, and I write about diverse Big Data and Streaming open source software running on Cloud platforms.

The main resource contribution from this project was this paper (available here)

Brebner, Paul & Emmerich, Wolfgang. (2006). Two Ways to Grid: The Contribution of Open Grid Services Architecture (OGSA) Mechanisms to Service-Centric and Resource-Centric Lifecycles. J. Grid Comput.. 4. 115-131. 10.1007/s10723-005-9008-2. Service Oriented Architectures (SOAs) support service lifecycle tasks, including Development, Deployment, Discovery and Use.

Abstract: We observe that there are two disparate ways to use Grid SOAs such as the Open Grid Services Architecture (OGSA) as exemplified in the Globus Toolkit (GT3/4). One is a traditional enterprise SOA use where end-user services are developed, deployed and resourced behind firewalls, for use by external consumers: a service-centric (or ‘first-order’) approach. The other supports end-user development, deployment, and resourcing of applications across organizations via the use of execution and resource management services: A Resource-centric (or ‘second-order’) approach. We analyze and compare the two approaches using a combination of empirical experiments and an architectural evaluation methodology (scenario, mechanism, and quality attributes) to reveal common and distinct strengths and weaknesses. The impact of potential improvements (which are likely to be manifested by GT4) is estimated, and opportunities for alternative architectures and technologies explored. We conclude by investigating if the two approaches can be converged or combined, and if they are compatible on shared resources.

要查看或添加评论，请登录

Paul Brebner的更多文章

Load Testing - of a bridge, by lots of trains!

2025年3月3日

Load Testing - of a bridge, by lots of trains!

Finally, an opportunity to combine software performance engineering with trains in a way that's not too far-fetched! I…
Three decades of laptop computers

2025年2月23日

Three decades of laptop computers

I was tidying up the garage on the weekend and came across a stack of old laptops that I've been "accidentally"…
Open Source Performance Engineering: Blogs – Part 1

2025年2月19日

Open Source Performance Engineering: Blogs – Part 1

I recently needed to track down and summarise some of my Performance Engineering blogs (covering performance…
Kafka Connect: Build and Run Data Pipelines - Book Review, Paul Brebner

2024年11月22日

Kafka Connect: Build and Run Data Pipelines - Book Review, Paul Brebner

Kafka Connect: Build and Run Data Pipelines, by Mickael Maison and Kate Stanley, O'Reilly September 2023, 400 pages. I…

2 条评论
Summary of the 6th Community over Code Performance Engineering Track (October 7, 2024, Denver, Colorado, USA)

2024年10月23日

Summary of the 6th Community over Code Performance Engineering Track (October 7, 2024, Denver, Colorado, USA)

After much anticipation, the 6th Community over Code Performance Engineering track was held on October 7 2024 in…

2 条评论
Seven Years of Open Source DevRel Technology Fun With Instaclustr

2024年8月6日

Seven Years of Open Source DevRel Technology Fun With Instaclustr

Seven years ago tomorrow I joined Instaclustr as the first Technology Evangelist to help explain multiple open source…

4 条评论
The Fourth Community over Code Performance Engineering Track (Bratislava, Slovakia, 5 June 2024)

2024年6月17日

The Fourth Community over Code Performance Engineering Track (Bratislava, Slovakia, 5 June 2024)

The 4th Community over Code Performance Engineering track was on recently in Bratislava. Thanks to everyone who made it…
Kafka Summit Bangalore 2024 - Interesting Talks

2024年5月9日

Kafka Summit Bangalore 2024 - Interesting Talks

Last week I attended the Apache Kafka Summit Bangalore (India, along with thousands of other speakers and attendees -…
What Do Hanoi Intersections And Water Puppets Have In Common With Distributed Cloud Systems?

2024年4月22日

What Do Hanoi Intersections And Water Puppets Have In Common With Distributed Cloud Systems?

Last week I presented at FOSSASIA which was held in Hanoi, Vietnam. During my time in Hanoi, I had two experiences that…

3 条评论
Connecting to Instaclustr Managed PostgreSQL? and Apache Kafka? from Payara Cloud

2024年3月14日

Connecting to Instaclustr Managed PostgreSQL? and Apache Kafka? from Payara Cloud

Paul Brebner, Instaclustr Technology Evangelist https://www.instaclustr.

See all articles

20 years of Open Source from Grid to Cloud Computing

Paul Brebner

Open Source Technology Evangelist at Instaclustr by NetApp

领英推荐

Paul Brebner的更多文章

社区洞察

其他会员也浏览了

AWS vs. Traditional Server Administration: What you need to know

Serverless Computing: How Serverless Computing is Changing the Game for Software Engineers

OpenStack installation on Ubuntu

Red Hat announces Matt Hicks as its new CEO

Open source framework technical characteristics and applicable scenarios （4）

Raft: The Unsung Hero of Distributed Systems

Serverless in September

Isolating Services with Docker: Simplifying Infrastructure and Deployment

CEPH Why? & How?

Scaling Web Application

领英推荐

Paul Brebner的更多文章

Load Testing - of a bridge, by lots of trains!

Three decades of laptop computers

Open Source Performance Engineering: Blogs – Part 1

Kafka Connect: Build and Run Data Pipelines - Book Review, Paul Brebner

Summary of the 6th Community over Code Performance Engineering Track (October 7, 2024, Denver, Colorado, USA)

Seven Years of Open Source DevRel Technology Fun With Instaclustr

The Fourth Community over Code Performance Engineering Track (Bratislava, Slovakia, 5 June 2024)

Kafka Summit Bangalore 2024 - Interesting Talks

What Do Hanoi Intersections And Water Puppets Have In Common With Distributed Cloud Systems?

Connecting to Instaclustr Managed PostgreSQL? and Apache Kafka? from Payara Cloud

社区洞察

其他会员也浏览了

AWS vs. Traditional Server Administration: What you need to know

Serverless Computing: How Serverless Computing is Changing the Game for Software Engineers

OpenStack installation on Ubuntu

Red Hat announces Matt Hicks as its new CEO

Open source framework technical characteristics and applicable scenarios （4）

Raft: The Unsung Hero of Distributed Systems

Serverless in September

Isolating Services with Docker: Simplifying Infrastructure and Deployment

CEPH Why? & How?

Scaling Web Application