Author: Zeke, YBB Capital Researcher
1. Starting with the love of new and old for attention
In the past year, due to the lack of narrative at the application layer and the inability to match the speed of infrastructure explosion, the crypto field has gradually become a game for attention resources. From Silly Dragon to Goat, from Pump.fun to Clanker, the love of new and old for attention has made this battle involute all the way. Starting with the most clichéd eye-catching monetization, it quickly evolved to a platform model that unifies attention demanders and suppliers, and then silicon-based biology became a new content provider. Among the various carriers of Meme Coin, there finally appeared a kind of existence that can make retail investors and VCs reach a consensus: AI Agent.
Attention is ultimately a zero-sum game, but speculation can indeed make things grow wildly. In our article about UNI, we reviewed the beginning of the last golden age of blockchain. The rapid growth of DeFi originated from the LP mining era opened by Compound Finance. Going in and out of thousands or even tens of thousands of mining pools in Apy was the most primitive way of gambling on the chain at that time, although the final situation was that various mining pools collapsed. But the crazy influx of gold miners did leave unprecedented liquidity for the blockchain. DeFi eventually broke away from pure speculation and formed a mature track, meeting the financial needs of users in payment, trading, arbitrage, pledge and other aspects. AI Agent is also experiencing this barbaric stage at this stage. What we are exploring is how Crypto can better integrate AI and ultimately promote the application layer to a new height.
2. How intelligent agents are autonomous
In the previous article, we briefly introduced the origin of AI Meme: Truth Terminal, and the future prospects of AI Agent. This article focuses first on AI Agent itself.
Let's start with the definition of AI Agent. Agent is an old but unclear term in the field of AI. It mainly emphasizes Autonomous, that is, any AI that can perceive the environment and make reflections can be called Agent. In today's definition, AI Agent is closer to intelligent agent, that is, a system that imitates human decision-making is set for the big model. In academia, this system is regarded as the most promising way to AGI (general artificial intelligence).
In the early version of GPT, we can clearly perceive that the big model is very similar to humans, but when answering many complex questions, the big model can only give some plausible answers. The essential reason is that the big model at that time was based on probability rather than causality. Secondly, it lacked the ability of humans to use tools, memory, planning, etc., and AI Agent can make up for these defects. So to summarize it with a formula, AI Agent (intelligent agent) = LLM (big model) + Planning (planning) + Memory (memory) + Tools (tools). The large model based on prompt words is more like a static person. It comes to life only when we input. The goal of the intelligent agent is to be a more real person. The current intelligent agents in the circle are mainly fine-tuned models based on Meta's open source Llama 70b or 405b versions (the two have different parameters). They have the ability to remember and use API access tools. In other aspects, they may need human help or input (including interactive collaboration with other intelligent agents). Therefore, we can see that the main intelligent agents in the circle today still exist on social networks in the form of KOLs. To make intelligent agents more human-like, it is necessary to access planning and action capabilities, and the sub-item thinking chain in the planning is particularly critical.
3. Chain of Thought (CoT)
The concept of Chain of Thought (CoT) first appeared in the paper "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models" published by Google in 2022. The paper points out that the reasoning ability of the model can be enhanced by generating a series of intermediate reasoning steps, helping the model to better understand and solve complex problems.
A typical CoT Prompt It consists of three parts: clear instructions, task description, logical basis, theoretical basis or principle to support task solution, examples, and specific solution demonstrations. This structured approach helps the model understand the task requirements and gradually approach the answer through logical reasoning, thereby improving the efficiency and accuracy of problem solving. CoT is particularly suitable for tasks that require in-depth analysis and multi-step reasoning, such as math problem solving, project report writing, and other simple tasks. CoT may not bring obvious advantages, but for complex tasks, it can significantly improve the performance of the model, reduce the error rate through a step-by-step solution strategy, and improve the quality of task completion.
When building AI Agents, CoT plays a key role. AI Agents need to understand the information they receive and make reasonable decisions based on it. CoT helps Agents effectively process and analyze input information by providing an orderly way of thinking, and converts the parsing results into specific action guidelines. This method not only enhances the reliability and efficiency of Agent decisions, but also improves the transparency of the decision-making process, making Agent behavior more predictable and traceable. CoT helps Agents carefully consider each decision point by breaking down tasks into multiple small steps, reducing wrong decisions caused by information overload. CoT makes Agent's decision-making process more transparent and users can more easily understand Agent's decision basis. In interacting with the environment, CoT allows Agents to continuously learn new information and adjust their behavior strategies.
As an effective strategy, CoT not only improves the reasoning ability of large language models, but also plays an important role in building more intelligent and reliable AI Agents. By using CoT, researchers and developers can create intelligent systems that are more adaptable to complex environments and have a high degree of autonomy. CoT has demonstrated its unique advantages in practical applications, especially when dealing with complex tasks. By breaking down the task into a series of small steps, it not only improves the accuracy of task solving, but also enhances the interpretability and controllability of the model. This step-by-step problem-solving approach can greatly reduce the number of wrong decisions caused by too much or too complex information when facing complex tasks. At the same time, this approach also improves the traceability and verifiability of the entire solution.
The core function of CoT is to combine planning, action and observation to bridge the gap between reasoning and action. This thinking mode allows AI Agents to formulate effective countermeasures when predicting possible abnormal situations, as well as accumulate new information and verify pre-set predictions while interacting with the external environment, providing new reasoning basis. CoT is like a powerful accuracy and stability engine that helps AI Agents maintain high efficiency in complex environments.
Fourth, the correct pseudo-demand
What aspects of the AI technology stack should Crypto be combined with? In last year's article, I believed that the decentralization of computing power and data is a key step to help small businesses and individual developers save costs. In the Crypto x AI segmentation track compiled by Coinbase this year, we saw a more detailed division:
(1) Computing layer (referring to the network that focuses on providing graphics processing unit (GPU) resources for AI developers);
(2) Data layer (referring to the network that supports decentralized access, orchestration and verification of AI data pipelines);
(3) Middleware layer (referring to the platform or network that supports the development, deployment and hosting of AI models or intelligent agents);
(4) Application layer (referring to user-oriented products that use on-chain AI mechanisms, whether B2B or B2C).
Each of these four division layers has a grand vision, and its goal is to fight against the Silicon Valley giants dominating the next era of the Internet. As I said last year, do we really have to accept the exclusive control of computing power and data by Silicon Valley giants? The closed-source big model under their monopoly is a black box inside. Science is the most believed religion of mankind today. In the future, every sentence answered by the big model will be regarded as the truth by a large part of people, but how to verify this truth? According to the vision of Silicon Valley giants, the rights that intelligent entities will eventually have will be beyond imagination, such as the right to pay for your wallet and the right to use the terminal. How to ensure that people have no evil thoughts?
Decentralization is the only answer, but sometimes we need to reasonably consider comprehensively, how many people pay for these grand visions? In the past, we could use Token to make up for the errors caused by idealization without considering the closed loop of business. But the current situation is very serious. Crypto x AI needs to be designed in combination with the actual situation. For example, how to balance the supply of both ends of the computing power layer when the performance is lost and unstable? In order to achieve the competitiveness of matching centralized cloud. How many real users will there be in the data layer project? How to verify the authenticity and validity of the data provided, and what kind of customers need this data? The same is true for the other two layers. In this era, we don’t need so many seemingly correct pseudo-demands.
5. Meme has gone beyond SocialFi
As I said in the first paragraph, Meme has already gone out of the SocialFi form that conforms to Web3 in an ultra-fast way. Friend.tech is the Dapp that fired the first shot of this round of social applications, but unfortunately failed in the token design that was too eager for success. Pump.fun has verified the feasibility of pure platformization, without any tokens or rules. The demanders and suppliers of attention are unified. You can post memes, do live broadcasts, issue coins, leave messages, and trade on the platform. Everything is free. Pump.fun only charges service fees. This is basically the same as the attention economy model of social media such as YouTube and Ins today, except that the charging objects are different, and Pupm.fun is more Web3 in terms of gameplay.
Base's Clanker is the culmination of all. Thanks to the integrated ecosystem personally managed by the ecosystem, Base has its own social Dapp as an auxiliary to form a complete internal closed loop. Meme is the 2.0 form of Meme Coin. People always seek novelty, and Pump.fun is now at the forefront of the storm. From the trend point of view, it is only a matter of time before the fantasy of silicon-based organisms replaces the vulgar stalks of carbon-based organisms.
I have mentioned Base for the umpteenth time, but the content mentioned each time is different. From the timeline, Base has never been a first mover, but it is always a winner.
Sixth, what else can an intelligent agent be?
From a pragmatic point of view, it is impossible for intelligent agents to be decentralized for a long time in the future. From the perspective of the construction of intelligent agents in the traditional AI field, it is not a problem that can be solved by simply decentralizing the reasoning process and open source. It needs to access various APIs to access the content of Web2. Its operating cost is very expensive. The design of the thinking chain and the collaboration of multiple intelligent agents usually rely on a human as a medium. We will go through a very long transition period until a suitable fusion form appears, perhaps like UNI. But like the previous article, I still think that intelligent agents will have a great impact on our industry, just like the existence of Cex in our industry, which is incorrect but very important.
The article "AI Agent Overview" issued by Stanford & Microsoft last month described a lot of applications of intelligent agents in the medical industry, intelligent machines, and virtual worlds. In the appendix of this article, there are already many experimental cases of GPT-4V as an intelligent agent participating in the development of top 3A games.
There is no need to force the speed of its combination with decentralization. I hope that the first puzzle piece that the intelligent agent fills is the bottom-up ability and speed. We have so many narrative ruins and blank metaverses that need it to fill. At the right stage, we will consider how to make it the next UNI.