A Technologist Perspective on the AI Position Paper of The Law Society of Hong Kong
Ryan Purkey
Managing Consultant @ LexiTech Consulting specializing in Digital Transformation of the Hong Kong Legal Industry
To kickoff 2024,?The Law Society of Hong Kong?released a Position Paper:?The Impact of Artificial Intelligence on the Legal Profession. It joins work from an array of organizations seeking to define and scope Artificial Intelligence (AI), especially within the professional services sector. At just 14 pages, it is a quick read, seeking to address the far-reaching impact of AI with adequate urgency, a challenge that is being felt across economies and industries as AI output continues to improve at light speed in quality, quantity and variety. It marks a strong beginning in a journey to?educate, engage and work with the Hong Kong legal sector in addressing developments in AI.
EXECUTIVE SUMMARY?
Before diving in, there is one notable misstep in the paper, calling AI models which use data sourced from the public internet, "Open AI".?And while this review is technology-focused and not from the legal perspective, it is worth noting Quinn Emanuel, representing OpenAI, Inc. is in a trademark infringement lawsuit against Guy Ravine and Open Artificial Intelligence, Inc.[1] regarding the usage of those same terms: OpenAI / Open AI. Back to the technologist's point of view, go to a search engine looking for "Open AI" and see which way the 100s of millions of results lean.
The "Open AI" misstep does help introduce a major issue all facets of the industry are dealing with: definitions and standardization. Major standards bodies--such as ISO, NIST, IAB Tech Lab, IAPP and others--are still working on standards with most at the beginning stages of building out published and reviewed definitions in relation to AI[2]. Still, "Open AI" as a term is in direct conflict with a major player in the industry and should not define a model that, "..goes out into the open internet and learns all the word patterns it can find."[3] Rather it should be replaced in upcoming Law Society documents with terms that are both more descriptive--ethical AI, transparent AI, AI governance--model specific, and less conflicted.?
Apart from the trademark considerations, the Law Society's definition combines two facets of AI that are often addressed separately: models and datasets. The current magic in Large Language Models (LLMs) made famous by ChatGPT is found in the combination of those two aspects:
As a simple example, if ChatGPT only trained on Cantonese, it would only 'understand' and 'reply' in Cantonese--even though the model could be the same as one that can produce answers in English. To continue our example, English writing would need to be added to the training data before ChatGPT could interact in English, again without the model ever being re-engineered.
Similarly, an unchanged dataset can produce dramatically different results by changing the type of model that interacts with that dataset. These model types include things like: computer vision, speech-to-text, natural language processing, multi-modal, etc. Next, within a given model type there are different "weights" and "temperatures" that can be scaled in order to affect output, not to mention other engineering factors. As an example of how many models can be created within one type, see the hundreds of entries on the Open LLM Leaderboard at Hugging Face.[4]?
The relationship between models and datasets circles back together though for training. Often the best scores for weights and temperatures are those derived from a large enough and sufficiently focused dataset. as Stephen Wolfram writes in his treatise, What is ChatGPT doing and why does it work?, "...weights are normally determined by 'training' the neural net using machine learning from examples of the outputs we want."[5]?Clear-eyed lawyers reading this far may be thinking, "If only ChatGPT had the entire written judgments of the Chief Justice McJudgy Judge[6], and trained on outputs where my firm won the case..." Indeed, if you are thinking along those lines, you're well on your way to understanding AI "weights" "temperatures" "training data" and "outputs".?
Hopefully the above breakdown, albeit brief, emphasizes the point of having clarity and commonality in terms.
Now that the "Open AI" hurdle is passed, the strengths of the Law Society Position Paper come through.
STRENGTHS (emphasis mine)
Prescriptions?
Cautions?
If a six-point highlight of a 14 page paper isn't succinct enough for you, here's the TL;DR: continue to learn about AI, be vigilant when AI output requires facts, and be extra careful to protect confidential data. To this end, the Law Society leans in to another strength with its plans to inform, engage and implement continuing professional development, promote ethical standards and best practices, and work as a hub for AI law-related resources in Hong Kong.
The weaknesses of the paper are much more open to interpretation than the usage of "Open AI", and are mostly the views and bias of your author here. That said, some quibbles to consider:
QUIBBLES (emphasis and commentary mine)
?
RECOMMENDATIONS
Educate. I wholly agree with Law Society plans to offer educational events and seminars on AI to the legal community in Hong Kong. As already seen even in the technology-focused standards organizations, there is a struggle to keep up with the pace of change brought on by AI. At a basic level, understand the factors that contribute to how an AI functions--prompts, weights, temperature, tokens, context windows and parameters--and begin to get a rough idea of how these factors compare in the performance of different models.
Partner. If not formally--at least in general alignment--the Law Society and lawyers should seek to define their AI terminology inline--and complimentary to--other global organizations.
领英推荐
Strengthen. If you or your firm have a data secure way to use AI, then lean into its strengths. Does your firm have a style guide? Make that style guide the training and output format instructions of the AI. Does your workflow benefit from summaries or in reframing and restructuring documents? See what AI can achieve in speed and efficiency. Wondering what your AI knows about the data its ingested? Ask it. LLMs tailor output to the structure and questions asked within a context window, so a well-formed prompt--or series of prompts--can make a dramatic difference in the reliability and repeatability of output.
CONCLUSION
Apart from the shoehorned use of "Open AI" into a mangled and constrained definition, the AI position paper of the Law Society is a solid beginning.?Its forward-looking approach to the next phases of educating, engaging, and participating with the Hong Kong legal sector and developments in AI are its strengths. It openly acknowledges the challenges faced by sole proprietorships and small firms. From this beginning, the Law Society can shore up a few weaknesses, improve alignment with the technology community at large, and better address the day-to-day technology needs of Hong Kong lawyers. We at LexiTech look forward to being a part of these coming changes and are happy to answer any questions you may have.?
REQUEST FOR COMMENTS (RFC)
In a nod to the origin of the Internet, please comment on this article below.*? Your time and constructive, polite opinions are greatly appreciated.
*"Request For Comments (RFC’s) documents were invented by Steve Crocker in 1969 to help record unofficial notes on the development of the ARPANET. They have since become the official record for Internet specifications, protocols, procedures, and events." [Source]
[1]?https://www.pacermonitor.com/public/case/49838878/OPENAI,_INC_v_Open_Artificial_Intelligence,_Inc_et_al?for more details.
[2]?https://www.iso.org/standard/81230.html - As of April 2024, ISO/IEC 42001 the international standard that specifies requirements for establishing, implementing, maintaining, and continually improving an Artificial Intelligence Management System (AIMS) is published, but still under review.
[3]?The Impact of Artificial Intelligence on the Legal Profession, Position Paper of The Law Society of Hong Kong, January 2024, p. 7, Item 19
[4]?Also: https://huggingface.co/models where rankings can be sorted by "most likes" "most downloads" "trending" and so on.
[5]?https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/ - "The basic idea is to supply lots of “input → output” examples to “learn from”—and then to try to find weights that will reproduce these examples."
[6]?Fictional
[7]?The Impact of Artificial Intelligence on the Legal Profession, p. 11, Item 40
[8]?Ibid., p. 5, Item 15
[9]?Ibid., p. 8, Item 33, (g)
[10]?Ibid., p. 7, Item 27
[11]?Ibid., p. 5, Item 11
[12]?Ibid., p.6, Item 20
[13]?Ibid., p. 5, Item 18
[14]?Ibid., p. 6, Item 25
[15]?The integration of Copilot, powered by OpenAI and GPT-4, into Microsoft 365 was announced on 16 March, 2023.?
This article originally appeared on LexiTech Consulting. Published here with permission.