登录查看更多内容

Does your Algorithmic auditor know how to code ?

Sri Krishnamurthy, CFA, CAP

CEO, QuantUniversity | AI Expert | Educator | Author | TedX Speaker |

发布日期: 2023年12月7日

I started my career 25 years ago as an ISO 14001 auditor. At that time, the primary tools I had was Microsoft Excel and Powerpoint. I had the domain knowledge and I would do audits based on the expectations and requirements.

Fast forward 25 years, I just finished teaching a graduate course entitled "Big Data and Intelligent Analytics" at Northeastern University. This is an intense boot-camp styled course, where over 15 weeks, students build full stack AI-applications. Along the way, they learn the problems in working with messy datasets, scaling and deployment and the pain in using multiple components and technologies.

As a summary, here are some of the projects my students worked on!

Data Engineering with PDF Summary Tool: Create a Streamlit app to summarize PDFs, comparing nougat and PyPDF libraries, and integrate architectural diagrams.
Large Language Models for SEC Document Summarization: Develop a tool for summarizing PDF documents, evaluating different libraries, and creating Jupyter notebooks and APIs for Streamlit integration.
Document Summarization with LLMs and RAG: Focus on automating embedding creation, data processing, and developing a client-facing application with secure login and search functionalities.
Data Engineering with Snowpark Python: Reproduce data pipeline steps, analyze datasets, design architectural diagrams, and integrate Streamlit with OpenAI for SQL query generation using natural language.
Project Redesign and Re-architecture: Review existing architecture and redesign using open-source components and enterprise alternatives, focusing on flexible, scalable, and cost-effective solutions.

More interestingly, the technologies used to build these projects involved:

1. Data Processing and Visualization:

- Streamlit

- Nougat or PyPDF libraries

- pandas-profiling

- greatexpectations

- Diagrams tool for architecture visualization

2. APIs and Integration Tools:

- FAST API

- OpenAI APIs

- Docker for containerization

3. Database and Cloud Technologies:

- Airflow

- Pinecone

- JWT (JSON Web Token)

- SQL Database

领英推荐

DABL

360DigiTMG 1 年前

Issue #216 - THE ML ENGINEER ??

Alejandro Saucedo 1 年前

Issue #192 - THE ML ENGINEER ??

Alejandro Saucedo 2 年前

4. Machine Learning and Language Models:

- Huggingface

- LLAMA from Meta

5. Enterprise Solutions:

- Snowpark Python

- Snowflake Marketplace

- Google Bard

- Anthropic

- Cohere

Each student team spent close to 600 hours collectively over 16 weeks to build these applications. Many enterprises spend comparable time and comparable tools to build AI-based applications! With all the hoopla and the emerging profession of AI auditing,

What does this mean if you are an Algorithmic auditor invited to audit such applications ?

Well, first, Algorithmic auditing is not an arm-chair exercise, where you sit back in a conference room and critique the application. Understanding the domain and reviewing the complete architecture holistically is crucial in AI-based applications to ensure the solution is contextually relevant, scalable, and integrates well with existing systems.

Second, Algorithmic auditors often focus narrowly on the algorithm due to its direct impact on decision-making and its complexity. However, considering all components of AI-based applications is crucial for comprehensive risk evaluation. This includes data sources, infrastructure, user interfaces, and integration points. Understanding the domain ensures the solutions are contextually appropriate, addressing specific needs and challenges of the field.

Moreover, a deep understanding of the tech stack is essential for auditors to identify potential risks and vulnerabilities in AI systems, which often arise from interactions between different components rather than solely from the algorithm.

Algorithmic auditors need to know how to code and must have done extensive domain-specific work and application development themselves! So the next time, you are hiring an algorithmic auditor, show them your system architecture of your AI-based system and ask them to explain what they understand before giving them the contract!!

Sri Krishnamurthy

Promotional note: Upcoming events from QuantUniversity :

If you are interested in learning how to build LLM-based applications, QuantUniversity & CFA Society Boston is hosting a 1-day workshop next week on December 14th. The course can be attended online or in-person in Boston! Check https://www.quantuniversity.com/course-details/GENAI.html for details!

AI&Risk Management Newsletter

4,771 位关注者

Paul Starrett

11 个月

I would agree that AI auditing for the purposes of risk spans a broad spectrum. I greatly appreciate the breadth and depth of this article. Other comments to this article touch on critical aspects such as what I like to refer to as public-policy aspects of AI risk (e.g. privacy, bias/fairness) often grounded in laws. One can always refer to NIST's framework for completeness. https://nvlpubs.nist.gov/nistpubs/ai/NIST.AI.100-1.pdf. One thing I do wonder about is the top-level guidance where technical aspects (such as this article) and compliance/policy dimensions, to say nothing of internal enterprise governance (e.g. training, a culture of privacy/cybersecurity awareness, etc.), come together. No single person can be an expert in all areas. What shall be required of this high-level professional's credentials and skill set, knowledge, etc.? This is, for me, a most pressing and underrepresented issue. Replies here are very welcomed! Thank you for a thorough, relevant and informative article!

1 次回应

Christophe Rougeaux

Risk Executive | Ex-McKinsey

11 个月

For a sound risk management of the AI, the AI auditor role (aka model validator or model risk manager) should have an expertise that is not limited to the technical / software development aspects only. At the end of the day, you would also like your AIto be fit for purpose and deployed with limited risk (in terms of data, bias / fairness, cyber risk, conceptual soundness, esg etc.)

3 次回应

David Blaszkowsky

Leader and innovator in data privacy, financial data and data's governance, strategy, regulation, and standardization.

11 个月

Excellent piece, amazing projects. Many thanks for posting! We have not caught up in a while, Sri ... I look forward to the next opportunity!

1 次回应

Patrick Hall

Machine Learning & AI Risk Management

11 个月

Broad expertise is required to audit AI systems--certainly technical AI expertise and domain expertise are needed. Social science expertise and legal expertise are often needed as well.

9 次回应

Joseph X Ng

11 个月

I've dedicated the past few months to a deep dive into algorithmic programming and engineering. Hopefully, we can exchange notes very soon.???

2 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Does your Algorithmic auditor know how to code ?

Sri Krishnamurthy, CFA, CAP

CEO, QuantUniversity | AI Expert | Educator | Author | TedX Speaker |

领英推荐

AI&Risk Management Newsletter

4,771 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Integrating Python Pandas with ChatGPT: A new frontier

Types of Sampling in Machine Learning

The Unofficial Guide to Picking the Right Coding AI Assistant for Software Developers

Personal Knowledge Graphs. Semantic Entity Persistence in DataLog. Deductive databases

Streamlit, The Magic of Data Storytelling

Unlocking the Power of Data Science with DSPy: Your Gateway to AI Mastery

Microservices Design IV: Distributed Tracing, Python in Excel and ChatGPT Enterprise

KDnuggets 16:n42: Python Machine Learning Open Source Projects; Facebook Groups for Big Data & Data Science

ML Model Deployment Considerations

Korvus: The Future of Efficient AI Workflows with In-Database RAG

领英推荐

AI&Risk Management Newsletter

4,771 位关注者

Navigating Model Risk Management in the Age of AI

2024年11月7日

QU AI school Lecture 4: Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigation

2024年10月30日

Part 2: Unlocking AI's Potential in Finance: Insights from QuantUniversity's Boston Fintech Week Workshop

2024年10月23日

AI in FinTech: Opportunities,Adoption and the future

2024年10月22日

The Case for Specialized Agentic AI Architectures: Moving Beyond Generic AI Agentic architectures

2024年10月13日

Use of Generative AI in Large Financial Institutions

2024年10月9日

Testing and Validating Generative AI Applications

2024年10月3日

What do you do when your AI system in production fails?

2024年5月21日

AI News from Colorado!

2024年5月18日

10 things you need to know about the EU AI ACT

2024年3月14日

社区洞察

其他会员也浏览了

Integrating Python Pandas with ChatGPT: A new frontier

Types of Sampling in Machine Learning

The Unofficial Guide to Picking the Right Coding AI Assistant for Software Developers

Personal Knowledge Graphs. Semantic Entity Persistence in DataLog. Deductive databases

Streamlit, The Magic of Data Storytelling

Unlocking the Power of Data Science with DSPy: Your Gateway to AI Mastery

Microservices Design IV: Distributed Tracing, Python in Excel and ChatGPT Enterprise

KDnuggets 16:n42: Python Machine Learning Open Source Projects; Facebook Groups for Big Data & Data Science

ML Model Deployment Considerations

Korvus: The Future of Efficient AI Workflows with In-Database RAG