登录查看更多内容

Leverage AIX lsmpio to discover some details about issues on SAN networks

Ivan Rakus

ICT & Cloud consultant at Aliter Technologies

发布日期: 2023年2月22日

Working in a customer environment on AIX 7.2 (7.2 TL5 SP3) systems I came across one quite interesting problem on the SAN network. Having a running AIX system which had disks published from one (V7K-1) san storage cluster in a redundant way via NPIV (dual vio, 2 fabrics etc.) - did not show any problem.

At the moment of mapping and publishing another san LUN from another (V7K-2) san storage cluster (note: different storage ports) to the same AIX client system and performing a standard discovery procedure ( cfgmgr ) the system reached a state where it reported read-only status for rootvg and paths to old and new disks have got into degraded / failed state (output from lsmpio "Deg,Fai" and lsmpio -ar).

-a lists parent Fibre channel adapter information

-r flag adds information about remote ports

At the moment of unmapping this new san LUN the system has recovered and the previous functional state was automatically restored. The key point was the revelation that the problem occurred only when publishing disks from 2 different V7K storage clusters. In the case of publishing disks from one storage cluster, the problem did not appear.

Originally, the virtual vFC ports of the client AIX partition (dyntrk=yes, fc_err_recov=fast_fail) mapped via NPIV to individual V7K-1 storage ports were as follows:

LPAR

V7K-1

After expanding the disk configuration of the AIX client partition (SAN zoning was done automatically and correctly by PowerVC automation), the following connections were added to the V7K-2 storage ports:

V7K-2

After the next standard discovery ( cfgmgr ) on the client's AIX partition the system entered the read-only state ("read permission only"). This system error reported corresponds to the definition of the error message and value in the errno.h system header file. The only question was why?

LPAR

领英推荐

Tips for a Smooth Transition to Windows Server 2022

Introstat Pty Ltd 1 年前

The Ultimate Guide to Build a Personal Cross-Platform…

Arun KL 2 年前

Unveiling the Future: AIX 7.3 vs. AIX 7.2 - A Dynamic…

Mafaaz Salam 1 年前

You can note on another picture below the adapter WWPN is 0, paths are failed and san IDs are N/A for each port.

LPAR

Discussing the previous with IBM Support came to the following conclusion:

Decoding errlog file, we see The VFC4_ERR15 with VFC_ERR_LOC_248 indicating the VIO servers forwarded a link down event due to a SCN received either for Fabric or Domain, however we don't get any related issue on the VIOSes.

It looks something happened on the Storage leading to this SCN, as there were no real link down issue on the physical ports, and the VIOS did not report any link down.

IBM Support, vio — vio server errors VFC4_ERR15, VFC_ERR_LOC_248

The entries match a known issue when using NPIV, described with HIPER APAR IJ31604 and APAR IJ32895 which currently missing on host.

Summary

Combo fix IJ32895m2a ( devices.vdevice.IBM.vfc-client.rte ) for APAR IJ32895 (DOMAIN RSCN CAUSE IO PATH FAILURE) and APAR IJ31604 (FABRIC FORMATTED RSCN CAUSE IO PATH FAILURE) is issued for this concrete and specific problem on the SAN network. Applying the combo fix is via reboot.

References:

# IJ31604: FABRIC FORMATTED RSCN MAY CAUSE IO PATH FAILURE APPLIES TO AIX 7200-05, 06 December 2022

https://www.ibm.com/support/pages/apar/IJ31604

# IJ32895: DOMAIN RSCN MAY CAUSE IO PATH FAILURE WHICH NEVER RECOVERS

https://www.ibm.com/support/pages/apar/IJ32895

要查看或添加评论，请登录

Ivan Rakus的更多文章

From DockerHub to Openshift ImageStream Deployment (on IBM Power)

2025年1月31日

From DockerHub to Openshift ImageStream Deployment (on IBM Power)

I would like to shortly demonstrate an approach when a primitive application within Docker can be transferred to the…
Running 2 types of LXC (Linux Containers) in ppc64le LPAR on IBM Power (scale-out)

2024年12月22日

Running 2 types of LXC (Linux Containers) in ppc64le LPAR on IBM Power (scale-out)

In one of my previous articles regarding platform dependent container images builds via Docker's buildx extension I was…
Elegant way to fix the state of IBM Power host from the OpenStack level (PowerVC)

2024年11月25日

Elegant way to fix the state of IBM Power host from the OpenStack level (PowerVC)

Last time we've have noticed interesting issue in customer environment (occurred after testing of power supply branches…
Getting in touch with vPMEM volumes (virtual Persistent Memory volumes)

2024年5月4日

Getting in touch with vPMEM volumes (virtual Persistent Memory volumes)

It has been some time (10/2019) since IBM introduced the possibility of using the vPMEM (virtual Persistent Memory)…

5 条评论
Running system AIX 7.2 in emulator QEMU on MacOS (part 1)

2023年9月4日

Running system AIX 7.2 in emulator QEMU on MacOS (part 1)

Intro AIX 5L, Administration, one of the best books by Randal K. Michael, published at McGraw-Hill Osborne, year 2002…
Using Docker extension for platform dependent container image build

2022年8月8日

Using Docker extension for platform dependent container image build

I'd like to share and introduce shortly one of less known experimental features of Docker called buildx which is an…
No-War

2022年2月24日

No-War

I have never thought my first post ever on this platform will be against war. Senseless war.

See all articles

Leverage AIX lsmpio to discover some details about issues on SAN networks

Ivan Rakus

ICT & Cloud consultant at Aliter Technologies

领英推荐

Ivan Rakus的更多文章

社区洞察

其他会员也浏览了

Building a Kubernetes Cluster with Kubeadm on Ubuntu Servers

The Art and Science of AIX Performance -- Part V: Intuition and Instinct

Fleet Server Installation and Setup

OpenShift 4.12 with z/VM on LinxuONE

New KIOXIA RM7 Series Value SAS SSDs Debut on Hewlett Packard Enterprise Servers

Logical Clocks(I) — Clock Series

Setting up RHEL 7 (rhel-server-7.9-x86_64-dvd) for a virtual machine

HomeLab Imaging Server: iPXEv4

New Power10 based servers from IBM

Setting up the PhpSysInfo software for server monitoring purposes on our Alpine Linux virtual machine

领英推荐

Ivan Rakus的更多文章

From DockerHub to Openshift ImageStream Deployment (on IBM Power)

Running 2 types of LXC (Linux Containers) in ppc64le LPAR on IBM Power (scale-out)

Elegant way to fix the state of IBM Power host from the OpenStack level (PowerVC)

Getting in touch with vPMEM volumes (virtual Persistent Memory volumes)

Running system AIX 7.2 in emulator QEMU on MacOS (part 1)

Using Docker extension for platform dependent container image build

No-War

社区洞察

其他会员也浏览了

Building a Kubernetes Cluster with Kubeadm on Ubuntu Servers

The Art and Science of AIX Performance -- Part V: Intuition and Instinct

Fleet Server Installation and Setup

OpenShift 4.12 with z/VM on LinxuONE

New KIOXIA RM7 Series Value SAS SSDs Debut on Hewlett Packard Enterprise Servers

Logical Clocks(I) — Clock Series

Setting up RHEL 7 (rhel-server-7.9-x86_64-dvd) for a virtual machine

HomeLab Imaging Server: iPXEv4

New Power10 based servers from IBM

Setting up the PhpSysInfo software for server monitoring purposes on our Alpine Linux virtual machine