Quite nice application for working with files and LLMs: Repo Prompt. I have done two tests so far, and I’m sold. The tests, of course, were super simple apps, but I found…
Author: adrian
BlenderGPT, MoneyPrinterV2
This is the next area where AI aids designers – 3D modeling. This page offers free access to create meshes from text or images. Additionally, there are models available on Hugging Face…
Llama 3.3, Ollama structured output
Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model. This model also requires less VRAM, e.g. works well on M4 64GB with a speed of 10 tokens/s. https://ollama.com/library/llama3.3 Right…
Chat from LangChain
This is a significantly different RAG we used to think of: https://chat.langchain.com/. Take a look at the sample questions and observe how agents from LangChain and LangGraph utilize them to construct responses….
Genie 2, DeepThought-8B, snowflake-arctic-embed2
Google has recently released Genie 2, their most advanced large-scale foundation model capable of generating consistent and playable worlds for a minute. This advancement could be particularly beneficial for game developers, but…
OpenWebUI + Ollama
Models Models keep information about existing models, allow us to modify them by adding functions, tools, system prompts, etc., or even create custom models on existing LLMs like Llama 3.x. It’s worth…
X source of AI/ML news
Here, we have a wealth of information about what transpired last week in the highly technical world of AI. This week, you can delve into topics such as: GPT-4o System Card, Can…
Apple Intelligence – EU region
Apple Intelligence already available in EU regions – if you have English US language chosen then you can try the Apple Intelligence Beta. I have given it a shout, but it didn’t…
Stable Diffusion 3.5
Stability AI released new model SD 3.5 in 2 versions for now Large and Large Turbo, Medium will be released at the end of October. Stable Diffusion 3.5 Large: At 8 billion…
Llama-3.1-Nemotron-70B-Instruct-HF
New model released by Nvidia available here: https://huggingface.co/chat/ it’s said that it has better performance than 4o or Claude 3.5 Sonnet (https://github.com/lmarena/arena-hard-auto). Also available in ollama: And it counts strawberry R letter…