Richards Tu's Blog

Looking Ahead to 2025

2025-02-06T17:22:56Z

Although it's February now, I still think it would be quite nice to wrap up 2024 together with the beginning of 2025 and look forward into the new year.

The past thirteen months have been great, with a lot of good things happening. We've made huge progress across models' multimodal, reasoning, and agentic abilities, which are all important components on my own imaginary roadmap to capable AI systems that would have a huge impact on our species (or what people call "AGI").

Envisioning Our Future with AI

2024-10-12T10:33:36Z

This blog is inspired by Dario’s “Machines of Loving Grace”

My Few Thoughts on OpenAI's o1 family models

2024-09-15T06:30:01Z

Thinking Models Are Good Models

My Few Thoughts on LLM's Reasoning Ability

2024-08-19T08:00:00Z

People are having debate on this topic on X these days. Some ppl say that LLM can definitely reason because it can help us do math and code on some extent; but some other guys argue that LLM can't reason and they are not designed for it, what they do is just recite in training data.

My Few Thoughts on Agents and Model's Autonomous Behavior

2024-08-12T18:00:00Z

This is definitely one of the hottest topic these days lol. And I personally think that except the agent itself, its autonomous behavior is also really intriguing, since they're both related to how capable the base model is and also how dangerous the model can be.

My Few Thoughts on Compute Scaling

2024-08-08T12:00:00Z

To scale, or not to scale? It's a really interesting topic. Scaling Laws is a famous law in AI and ML, and the people's opinion's on it is also diverse. So I wanna share some thoughts on compute scaling, which is a part of the Scaling Laws.

My Few Thoughts on AI Ethics

2024-07-10T14:50:00Z

Tl;dr

AI development is rapidly outpacing societal adaptation
We should focus on adapting AI development to benefit human society
Concerns about job displacement can be mitigated by creating new AI-related roles
Combining AI with various scientific domains could accelerate breakthroughs
AI raises existential and philosophical questions about human uniqueness and purpose

Testing with new Claude-3.5 Sonnet

2024-06-20T15:24:05Z

Just now, Anthropic released their Claude-3.5 Sonnet (see the announcement here), and promised to release rest of the models in the model family later this year (i hate this 🫠).

Tl;dr

3.5 Sonnet is more capable than 3 Opus yet cheaper;
3.5 Sonnet is really good at reasoning tasks (it will ask you for "elaboration" frequently);
3.5 Sonnet is good at vision;
3.5 Sonnet has an updated knowledge cutoff date;
and more...

Current AI Development Path

Accelerate faster please.🥺

Solution Sharing and Some Thoughts about Alibaba Global Mathematical Competition for AI

2024-06-16T13:34:58Z

In Alibaba Global Mathematical Competition for AI, I created a system called "Self-Iterative Agent System for Complex Problem Solving"; however, I was surprised that it was so simple yet achieved such good results compared to others who used complex multi-agent systems. And for the details of my solution, I've uploaded to GitHub, you can find the URL in Read More. Feel free to share your opinion! 🤗

My Few Thoughts on AI Security and AGI

2024-06-06T10:38:00Z

Tl;dr

AI security crucial for future development; advocates for precautions
Seeks balance between AI capabilities and safety measures
AGI definition debatable; suggests exceeding average human ability in most domains
Emphasizes importance of AI understanding context-dependent language
Considers implications of advanced AI and hopes for beneficial outcomes

Richards Tu's Blog

Looking Ahead to 2025

Envisioning Our Future with AI

My Few Thoughts on OpenAI's o1 family models

My Few Thoughts on LLM's Reasoning Ability

My Few Thoughts on Agents and Model's Autonomous Behavior

My Few Thoughts on Compute Scaling

My Few Thoughts on AI Ethics

Tl;dr

Testing with new Claude-3.5 Sonnet

Tl;dr

Current AI Development Path

Solution Sharing and Some Thoughts about Alibaba Global Mathematical Competition for AI

My Few Thoughts on AI Security and AGI

Tl;dr

Read more »