Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. This model also requires less VRAM, e.g. works well on M4 64GB with a speed of 10 tokens/s. https://ollama.com/library/llama3.3 Right…
Author: adrian
Chat from LangChain
This is a significantly different RAG we used to think of: https://chat.langchain.com/. Take a look at the sample questions and observe how agents from LangChain and LangGraph utilize them to construct responses….
Genie 2, DeepThought-8B, snowflake-arctic-embed2
Google has recently released Genie 2, their most advanced large-scale foundation model capable of generating consistent and playable worlds for a minute. This advancement could be particularly beneficial for game developers, but…
OpenWebUI + Ollama
Models Models keep information about existing models, allow us to modify them by adding functions, tools, system prompts, etc., or even create custom models on existing LLMs like Llama 3.x. It’s worth…
X source of AI/ML news
Here, we have a wealth of information about what transpired last week in the highly technical world of AI. This week, you can delve into topics such as: GPT-4o System Card, Can…
Apple Intelligence – EU region
Apple Intelligence already available in EU regions – if you have English US language chosen then you can try the Apple Intelligence Beta. I have given it a shout, but it didn’t…
Stable Diffusion 3.5
Stability AI released new model SD 3.5 in 2 versions for now Large and Large Turbo, Medium will be released at the end of October. Stable Diffusion 3.5 Large: At 8 billion…
Llama-3.1-Nemotron-70B-Instruct-HF
New model released by Nvidia available here: https://huggingface.co/chat/ it’s said that it has better performance than 4o or Claude 3.5 Sonnet (https://github.com/lmarena/arena-hard-auto). Also available in ollama: And it counts strawberry R letter…
Pinokio, Ichigo
Interesting software for using and testing AI software locally: https://pinokio.computer/ I’ve not yet installed it, but the amount of systems it gathered together is impressive. From ComfyUI to Voice Cloning, and probably…
Spring Framework 7.0 announcement, Gradio 5
I have completely missed the announcement of Spring Framework 7.0 (https://spring.io/blog/2024/10/01/from-spring-framework-6-2-to-7-0) which will happen at the beginning of next year. The decision was made to stick with Java 17, though most comments…