Gremlin

Gremlin

软件开发

San Jose,California 11,511 位关注者

The Reliability Management Platform for high-velocity engineering teams

关于我们

Gremlin’s Reliability Management Platform enables high-velocity engineering teams to standardize and automate reliability across their organizations without slowing down software delivery. Gremlin's Reliability Score sets the standard for reliability so there's no guesswork, and an automated suite of Reliability Management tools makes it easy to integrate reliability throughout the software lifecycle so there's no slowdown.

网站
http://www.gremlin.com
所属行业
软件开发
规模
51-200 人
总部
San Jose,California
类型
私人持股
创立
2016
领域
Distributed Systems、Resilience、Failures as a Service、DevOps和Chaos Engineering

地点

  • 主要

    55 S Market St

    Ste 1205

    US,California,San Jose,95113

    获取路线
  • 555 Montgomery St

    Ste 811

    US,California,San Francisco,94111

    获取路线

Gremlin员工

动态

  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    If you spent 2024 improving your service’s reliability, but aren't sure how to demonstrate it, this week's Office Hours session is for you. Join us this Thursday at 11 AM PST to learn how Gremlin can help you analyze and understand your organization’s reliability posture for a seamless start to the new year. Register at the link in the comments. P.S. Can't make it? Register anyway and we'll send you the link to the recording.

    • 该图片无替代文字
  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    Instead of waiting for January to tackle improvements, take advantage of end of year code freezes to set your team up for success in 2025. Here’s how: 1️⃣ Reflect on incident trends from 2024. What patterns stand out? 2️⃣ Review your system health. Run critical tests to catch hidden risks early. 3️⃣ Plan your 2025 reliability goals. Where can you move from reactive to proactive? Take it one step further by starting a regular reliability management cadence. Small actions now mean fewer surprises and smoother scaling tomorrow. Ready to make the most of your downtime? Explore Gremlin’s 7 essential reliability tests at the link in the comments. https://lnkd.in/gipz7Wnz

    Seven tests to measure and improve reliability: what matters and how it works

    Seven tests to measure and improve reliability: what matters and how it works

    gremlin.com

  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    📈 72% of organizations were using AI at the start of 2024, and that number is likely to keep growing throughout 2025. With expectations around AI reaching historic highs, the reliability of these systems is more important than ever. Enter, Gremlin GPU- the easiest way to build more resilient machine learning and AI models, and test and validate the scalability of your systems. Learn more at the link in the comments.

  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    2025 is fast approaching, which means it’s the perfect time to look at how you can build a culture of reliability within your organization. Urgent troubleshooting burns out teams and creates a reactive environment.  Introduce proactive reliability efforts and you start to build a calm, collaborative culture. So where do you start? See how Ritchie Bros. created a culture of reliability and drove innovation within their team: https://lnkd.in/gmcj26jH

    How Ritchie Bros Creates a Culture of Reliability

    How Ritchie Bros Creates a Culture of Reliability

    gremlin.com

  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    2025 is just a few weeks away–how are you going to change your reliability efforts going into the new year? 🚀 Where to start: run reliability tests to test for cracks in your system. And while you’re at it, make regular reliability management part of your team’s workflow (we recommend a weekly cadence). Get ahead of potential issues and scale effectively with Gremlin’s Reliability Management Quick Start Guide- link in the comments. ⬇️ 🚀 Bonus: Once you’ve run the essential tests, put those insights to work. Make a plan for addressing any uncovered risks, and create reports for stakeholders in your organization. A more reliable 2025 starts today. 

  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    No matter what type of  AI model you use, they all have one thing in common: they need to crunch a lot of data, and GPUs are the most effective tool. 💡 But what happens when those GPUs are busy? Can your infrastructure scale to meet changing demand? ➡️ Enter, Gremlin GPU experiments, the best way to test and validate the scalability of your systems. If you have an LLM deployed, run a GPU experiment alongside it to simulate heavy loads or additional workloads. While the experiment is running, monitor the performance, throughput, and availability of your LLM to determine what (if any) impact there is. It’s easy to get started- find it in the Gremlin web app, or learn more (and get started with your free trial) at the link in the comments. 

    • 该图片无替代文字
  • 查看Gremlin的公司主页,图片

    11,511 位关注者

    Even the most robust systems can experience downtime. This year’s CrowdStrike outage—and smaller outages across the industry—served as a warning: no system is immune. So, how do you prepare for the unpredictable? 1️⃣ Run reliability tests regularly. Identify weak spots before they become critical failures. 2️⃣ Simulate outages. Test your disaster recovery plan with realistic simulations.  3️⃣ Automate where possible. Reduce human error and recover faster. These outages aren’t just wake-up calls—they’re opportunities. The companies that proactively invest in reliability today will minimize customer impact and protect their reputations tomorrow. 💡 See how the team at Ritchie Bros. uses Chaos Engineering to prepare for the unexpected and ensure resilience: https://lnkd.in/gmcj26jH

    How Ritchie Bros Creates a Culture of Reliability

    How Ritchie Bros Creates a Culture of Reliability

    gremlin.com

相似主页

查看职位

融资

Gremlin 共 3 轮

上一轮

B 轮

US$18,000,000.00

投资者

Redpoint
Crunchbase 上查看更多信息