arbisoft brand logo
arbisoft brand logo
Contact Us

The Small AI Models and Tools Making a Big Splash

Hijab's profile picture
Hijab e FatimaPosted on
5-6 Min Read Time
https://d1foa0aaimjyw4.cloudfront.net/Models_and_Tools_to_Look_Out_For_da26fdeb64.png

In the last few weeks, researchers dropped models so fast, so powerful, and so weirdly creative that they’re rewriting the rules of what AI can do. We’re talking about systems that think in 140 languages, outsmart GPT-4 on a shoestring budget, and even have ‘visual epiphanies’ mid-task. 

 

No hype, no fluff—just the stuff that’ll make your current toolkit look outdated. But here's the kicker - the real magic isn’t just in the models themselves—it’s in how they’re flipping entire industries on their heads. Ready for an amazing round-up? Let’s go.

 

Models - The Heavy Hitters

Let’s start with the brainiacs redefining what “small but mighty” really means.

 

1. Little Gemma 3

Google DeepMind’s Little Gemma 3 is proof that bigger isn’t always better. This lightweight model runs smoothly on a single GPU or TPU, yet outperforms heavyweights like Gemini. It works with 140 languages, processes texts as long as 128,000 tokens (think two copies of War and Peace back-to-back), and even combines text and visual reasoning. 

 

Plus, it teams up with ShieldGemma 2, a 4-billion-parameter safety checker, to automatically filter harmful content.

 

2. Command A from Cohere

Cohere’s Command A is here to save businesses money—without sacrificing power. It matches GPT-4o’s performance in coding, STEM, and business tasks but runs on just two GPUs instead of 32. Speed? It crushes 156 tokens per second (1.75x faster than GPT-4o) and remembers up to 256,000 tokens of context. 

 

3. OLMo 2 and Building Effective Teams for Training Language Models

The AI2 team’s OLMo 2 is shaking up the open-source world. Trained on 5 trillion tokens, this 13-billion-parameter model beats fan favorites like Llama 3.1 and Qwen 2.5 in coding and reasoning tasks. 

 

It's secret? A training hack called RLVR that boosts scores by over 4 points. If you want enterprise-grade AI without the enterprise price tag, this is your go-to.

 

4. Baidu Unveils ERNIE 4.5 and Reasoning Model ERNIE X1

Baidu just dropped ERNIE 4.5, which claims to outperform GPT-4.5 at 1% of the cost. That's a lot to take in. If we put it into numbers, the input/output is priced at RMB 0.004/0.016 per 1,000 tokens. 

On the other hand, ERNIE X1 is excelling in reasoning and tool use, and costs RMB. What’s even better is that Baidu made ERNIE Bot free for individuals, so you can test-drive it right now with enterprise access via Baidu AI Cloud’s Qianfan platform.

 

5. Open-sourced MM-Eureka

Shanghai AI Lab’s MM-Eureka tackles image-text tasks with a unique twist - rule-based reinforcement learning. Instead of brute-forcing through millions of samples, it learns smarter, training on just 54,000 examples (most models need 1 M+). The result is faster learning, sharper accuracy, and a quirky habit of “reflecting” mid-task to adjust its answers. If you work with visuals or multimodal AI, this model’s efficiency and adaptability are game-changing.

 

6. Sesame AI Labs Speech Model CSM 1B

Need AI that talks like a human? Sesame AI’s CSM 1B turns text into natural speech using a Llama-based backbone. It handles conversations, processes speaker turns, and avoids voice cloning or misinformation by design. With 1.55 billion parameters and open-source access, it’s perfect for developers building ethical voice assistants or dialogue systems.

 

The Tools That Are Redefining AI Workflows

Great models need great tools. And this batch? They’re changing the way we design, edit, automate, and even plan our careers.

1. Recraft AI

Recraft AI redefines design workflows with its vector-based image generator. Create crisp illustrations, icons, and 3D renders in seconds, tailored to your brand’s style. Customize colors, adjust line weights, and switch between art styles (flat, 3D, pixel art) effortlessly. Ideal for designers, marketers, and creators who need scalable, editable visuals without hours in Photoshop.

 

2. Wisecut

Wisecut automates the grunt work of video editing. It detects and cuts awkward silences, adds subtitles in 50+ languages, and generates lifelike voiceovers to polish raw footage into professional content. Perfect for YouTubers, educators, and businesses looking to turn interviews, tutorials, or vlogs into sleek videos—no editing skills required.

 

3. Riku AI

Riku AI democratizes AI development. Build, tweak, and deploy custom AI models (chatbots, classifiers, content generators) through a no-code interface. Train models on your data, test prompts in real time, and integrate them into apps or workflows—no engineering team required. It is built for entrepreneurs and creators who want AI tailored to their niche.

 

4. Convergence

Need an AI assistant to handle chores or research? Convergence’s AI agent can book reservations, shop for groceries, or compile data from the web. But fair warning is that it is still learning. Simpler tasks (like finding a gift on Amazon) work better than more complex tasks. Try it risk-free—five tasks per day are free, or pay $20/month for unlimited requests.

 

5. Scribe

ElevenLabs’ Scribe is a transcription superstar—and it’s free until April 9. Unlike most tools, it nails tricky details like website URLs and subtle speech quirks. Need flawless meeting notes or interview transcripts? This is your pick. Works in 99 languages with no hassle.

 

6. Google Career Dreamer

Not sure what’s next for your career? Google’s free Career Dreamer helps you brainstorm paths without logging in or sharing personal details. Describe your job and skills, and it generates a “career identity” and a visual map of related jobs. Click any role to see real openings or jump to Gemini to draft a cover letter. Bonus is that you can build free custom Gemini “Gems” using your own instructions and documents.

 

7. Adobe Enhance Speech

Turn messy audio into studio-quality recordings with Adobe’s upgraded tool. Upload any file with background noise, adjust the new clarity and noise sliders, and download a polished version. Edit the transcript like a text doc in Adobe Podcast, add royalty-free music, and export. It can even support French, German, Spanish, and more. The toll comes with a free trial for a month, or is included with Adobe subscriptions.

 

Parting Thoughts

AI isn’t just growing—it’s evolving into something faster, smarter, and more accessible. These models aren’t just smarter—they’re cheaper, faster, and built for real-world problems. Whether you’re coding, designing, or just geeking out, one of these tools will change how you work.

 

Your move? Stop spectating. Pick one. Test it. Break it. Repeat. Because in this race, the only wrong move is standing still.

 

P.S. Your GPU might complain. Ignore it.

...Loading

Explore More

Have Questions? Let's Talk.

We have got the answers to your questions.

Newsletter

Join us to stay connected with the global trends and technologies