AI meets Crypto, a view from crypto and infra background
Special thanks to Lei Tang Davide Crapis Leo Chen Siyuan Han Bo Yu for feedback and review.
I left Coinbase in March after spending 3 years there building the crypto platform for its 100M+ customers, primarily on Identity and Onboarding solutions for both web2 and web3 products.?
In the past two months, without a full time job, I got more time with my family, doing gardening, testing Tesla FSD edge cases and spending hundreds of hours exploring AI, particularly LLMs. This current AI wave seems more real than ever, given my background in crypto, I ponder what it means when AI meets Crypto. I asked myself several questions and below are the answers I got so far.
1: Will AGI Be A Reality? If Yes, When??
It depends on how we define AGI (Artificial General Intelligence). To me AGI means human-like reasoning, learning, problem solving and continuous improvements. It’s a matter of when not if, though I won’t spend too much time speculating about the timing.?
As a matter of fact, in the past 18 months or so, one branch of AI - Large Language Models (LLMs) - have made major technology and adoption breakthroughs. ChatGPT reached 100M users in just 2 months, the prior fastest one was Tiktok in 9 months. On the flip side, hallucination remains a major technical gap, besides all these ethical, societal, regulatory and geo-political concerns.?
I fine-tuned and run a few open source models myself locally on my Apple M2 silicon, even the most performing one - LlaMa3 (8B) - often responded to me with “logical but wrong” hallucinating answers.
Obviously there is more work to do before AGI, it might take years or even decades. Nevertheless AI gets more powerful everyday, it can do more types of work and do them more intelligently, see a diagram from The Internet of Agents by Davide Crapis.
2: What Are The Roadblocks To AGI??
?We often hear from people like Sam Altman and Elon Musk that AI scaling law will soon or eventually be constrained by compute, energy and/or training data before AGI. For sure we need a longer term vision and roadmap to revolutionize the GPU and energy industry (e.g. fusion energy), and (using AI to) generate unlimited, diversified, high quality synthetic training data.?
However, these seemingly new problems can potentially be solved or at least mitigated by existing solutions. In the short to mid term, to me this is an efficiency game in every step of the AI process. We should and must do performance profiling and optimization on training, fine-tuning and inference, e.g. more efficient data and model parallelization, cutting every waste in GPU memory and bandwidth consumptions, quantization, batch and offline inference processing etc. The good news is, Meta’s recent well received LlaMa3 LLM is a perfect example of efficiency meets performance.?
Another seemingly new problem around the globe GPU / energy shortage, my understanding is it’s more of a distribution problem than an overall supply problem. We should and can find an existing solution to take advantage of all those idle GPUs across data centers and energy across regions, without a longer term GPU rearchitect and revamping of the globe energy network. Crypto could help here (see later discussion).?
In addition, there are broader ethical, social, regulatory, geopolitical and interdisciplinary implications and challenges, which I will leave to future discussions.??
3: Large Or Small Language Model?
Large Language Models are getting larger and larger. GPT4 has 1.75 trillion parameters, fed with 15 trillion tokens and was trained by 25,000 NVIDIA A100 GPUs. The soon to come GPT5 will certainly out number these, people were speculating massive $2.5B training cost for GPT5.?
领英推荐
However, models don't always have to go bigger. Instead of a monolithic approach, we could adopt a modular approach just like any other modern computer systems. We could use a handful of medium/large size models, e.g. LlaMa3 70B, as base models, and build much smaller domain specific incremental models on top of them. During inference, these incremental models can share and connect through the same base model. In this way, one application can access multiple incremental models and base models at the same time. Hence not all these companies have to build their own super, costly and largely overlapped generalized model, instead we can keep expanding the overall model capability with an open community, building and sharing base models and incremental domain specific models on top of each other. (see the below diagram)
Moreover, companies like Apple and Microsoft are experimenting in a different direction. Instead of LLMs, they are building SLMs (Small Language Models), e.g. Apple OpenELM (270M, 450M, 1.1B, 3B) and Microsoft Phi-3 Mini (3.8B), both trying to balance model size and performance. They may not be as powerful and general as LLMs like ChatGPT 4, Claude 3 Opus, let alone the incoming GPT 5/6, but they could be very suitable for cases like edge AI on smaller IoT devices without reliable internet connection, or privacy sensitive scenarios like Amazon Alexa or Google Home or certain highly regulated industry where data needs to be kept on premises. SLMs inference can be 100% hosted on edge like a mobile device, without touching any remote servers or internet at all.
4: Does AI Need Crypto? And Vice Versa.?
While AI does not inherently "need" crypto to function, and vice versa, the integration of these technologies could lead to significant advances in both fields. Each can enhance the capabilities and address the limitations of the other, potentially leading to more robust, efficient, and trustworthy systems.
What can Crypto do for AI?
What can AI do for Crypto?
5: What Are The Challenges To Bring AI On-chain??
Let’s start with a quick comparison between AI and Crypto for their default settings:
As you can see, AI and Crypto defaults are different or even quite opposite in most dimensions. Bringing AI on-chain presents a set of unique challenges that stem from the fundamental differences in how these technologies are designed and operated. Here are some of the key challenges and possible solutions:?
Conclusion: (Generated by chatGPT)
As we stand at the crossroads of two revolutionary technologies, AI and crypto, the potential for transformative change is immense. The integration of AI's deep learning capabilities with blockchain's secure and decentralized framework promises to redefine how we handle data, execute transactions, and build trust in digital interactions. However, as outlined, challenges such as scalability, verifiability, data privacy and regulatory compliance remain formidable but not insurmountable.
While the technical hurdles are significant, the opportunities—for efficiency, security, and new business models—are profound. Innovations like intent-based smart contracts and enhanced security for crypto platforms highlight just the beginning of what's possible when these technologies converge. However, as we push forward, a balanced approach that considers ethical, social, and economic implications is crucial.
Ethereum | PIN AI
10 个月Amazing to be a coreviewer with Lei Tang , our latest coauthorship is a US patent for the AI system for Lyft driver marketplace few years back ??