WORLD.RICKRUBEN.COM
Biography
Contact
Facebook
Instagram
LinkedIn
llm
The rhythm of algorithm needs to change
Announcing our official LangChain integration
LLM Agent’s Arsenal: A Beginner’s Guide to the Action Space
LLM Agent’s Arsenal: A Beginner’s Guide to the Action Space
Aetheria: Reimagining Material Discovery with Autonomous AI Agents
Advanced mq Techniques: From Simple Queries to Complex Workflows
Revolutionizing LLM Interactions: Code2Prompt – Your Code’s New AI Assistant
透過模擬人類的對話解除 LLM 對話串長度限制與解決多輪對話中迷失問題
A simple screening agent with CrewAI
MCP for Dummies
Upgrade your notes with AI research
🧠 The Illusion of Thinking: Apple’s Deep Dive into AI’s Reasoning Limits
💡 From Idea to Post: Meet the AI Agent That Writes Linkedin post for You💡
Revolutionizing AI with Retrieval-Augmented Generation (RAG): Architectures, Workflows, and Practical Applications
Anthropic analytics query
50 Days of Building a Small Language Model from Scratch
Important LLM Papers for the Week From 12/05 to 18/05
Try DeepWiki MCP Server with MCP Clients
MetaCene Launches World’s First GameFi On HyperEVM With LLM Trained By Unlocked GPU Infrastructure
Join Fiverr’s Ultimate Easter Egg Challenge & Win $70K+ in Bitcoin!
Synonymic Query Expansion for Smarter Search
Khai thác lỗ hổng web tự động bằng AI
Zero-Shot Prompting: The Cleanest Trick in Prompt Engineering
How to Get Started with AI Agents: A Beginner’s Guide
A little Rust proxy for Ollama
MCP Servers: Plugging AI into Your Developer Toolkit
Building an Automated Notes Publishing Pipeline at Zero Cost
How to Deploy a LLM Locally and Make It Accessible from the Internet
You’re Not Coding Alone Anymore: Coding in the Age of Agents
Chain of Draft (COD) Prompting
Handling rate limits of OpenAI models in Java using Guava, JTokkit
The Rise of AI and the Need for SVG to PNG Conversion
Bridging LLMs and External Data: An Introduction to the Model Context Protocol (MCP)— Part 1
Understanding Language Models: A Beginner-Friendly Introduction
LCM vs. LLM
How to Run Gemma 2: The Next-Generation LLM
Gem-Assist: Your Command-Line Personal Assistant
Semantic search alone won’t solve relational queries in your LLM retrieval pipeline.
Build Your Talking Voice AI Assistant Locally: Memory-Retaining Chatbot with Streamlit UI…
Create agentic systems by just describing what you want.
Why Advanced LLMs, Such as GPT-4 or Claude, Fail in Critical Use Cases Despite Large Training Data
Why Advanced LLMs, Such as GPT-4 or Claude, Fail in Critical Use Cases Despite Large Training Data
How to Build RAG Agents in the OpenAI’s Swarm Framework
Are LLMs Still Lost in the Middle?
Meta LayerSkip Llama3.2 1B: Achieving Fast LLM Inference with Self-Speculative Decoding locally
Unleashing the Power of AI: Running Large Language Models on Your Own Cloud Server (Digital Ocean)
Claude 3.5 — The King of Document Intelligence
New Feature for caching LLM response with redis instance
A Comprehensive guide to Ollama (Tutorial)
How Ragie Outperformed the FinanceBench Test
Comparing 6 Frameworks for Rule-based PDF parsing
How do LLMs like GPT Generate Human-Like Text?
Anomaly Detection Using Machine Learning
Prove it is feasible with Google AI Studio
Scaling Enterprise Mobile Apps with LLMs: Automating Development, Enhancing User Experience and Driving Insights
How to use Llama3.2 to write daily logs in Notion based on your screen
LLMs are NOT the product
Few Prompt Engineering Tricks
Build Your Own Language Model: A Simple Guide with Python and NumPy
Parallel Chains in LangChain
OpenAI Swarm: A Lightweight Framework for Multi-Agent Orchestration
AgentKit, A Lightweight Multi-Agent Framework for Creating Complex Apps
🚀 Building a Smart Cisco Webex Bot: Harnessing LangGraph’s Stateful LLM Agents for AI-Powered Assistance 🤖✨
How to Build Smarter AI Apps and Reduce Hallucinations with RAG
Be Real: How to Use AI Writing Tools and Stay Authentic
Build an AI Discord Bot in Rust with Rig: A Step-by-Step Guide
Prompt Engineering
How to Implement Function Calling for the Tiny LLaMA 3.2 1B Model
A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT
New eng ep. on Building Jam! How we’re building AI features 🎧
Hexabot Setup & Visual Editor Tutorial: Build Your First AI Chatbot
Positional Encoding: Adding Sequence Awareness to Transformers
Llama 3.2 Vision Model Tutorial: Build Vision Apps, Multimodal Agents
Data factory for generative video models
Complete Generative AI Glossary with examples
Denoising technique: Remove Watermarks from Document Images: A Step-by-Step Guide Using OpenCV and…
Monitoring LLM Inference Endpoints with Wallaroo LLM Listeners
Data factory for LLM video models
Intro to Ollama: Insights and Reflections
Unleashing LLM Inference Power: A Comprehensive Guide to the Best NVIDIA GPUs
Llama-Omni: The AI That Talks Back “Speech to Speech LLM”
Using Instructor for Structured Data Output in AI Agents
Detailed Introduction to Word Embedding
Transformers: The Engine Powering ChatGPT and Beyond
Using LLMs to reduce sign-up & onboarding friction
Search And (LLM) Transform
DialectMorph – A CLI Tool To Transpile Code
Just now, OpenAI released the o1 large model!
Deploy Your LLM on AWS EC2
Deploy Your LLM on AWS EC2
Tau LLM Series: Enhancements and Debugging | part 18, 19
OpenAI o1 Release is so Reminiscent of Apple Events – it’s an Incremental Update
Building an LLM-Powered Knowledge Curation System
Mastering Prompt Engineering for Generative AI: A Simple Guide
GitHub Copilot Security and Privacy Concerns: Understanding the Risks and Best Practices
Freeloading gpt-4o, llama, and many more llms on Github Model in AutoGen.Net
Building Your Own ChatGPT with Multimodal Data on a GPU Platform
Key differences in GPT3.5 VS GPT4.0
RAGEval: Scenario-specific RAG evaluation dataset generation framework
Use This Trick To Easily Integrate GenAI In Your Websites with RAG-as-a-Service
1
2
3
→