Meta Movie Gen – it seems to be the biggest news for this weekend – there are already tons of sample creations. But also other things in movie gen world which happened…
Author: adrian
AI News
News that caught my attention: Meta published Llama Stack: From GitHub: The Llama Stack defines and standardizes the building blocks needed to bring generative AI applications to market. These blocks span the…
Small package of AI News
HuggingFace crossed now 1.000.000 public models. Mystic v2 is out – on X there are plenty examples right now of that upscaler (4k and 8k images). Molmo was released, this is direct…
WIP: LLMs
Ollama is my preferred choice, but here I want to gather the alternatives I’ve found. mlx-lm Repo: https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/README.md it’s part of MLX:MLX is an array framework for machine learning research on Apple…
Llama 3.2 released
New models are in versions 1B, 3B, 11B or 90B. The smallest ones are described as: Use our 1B or 3B models for on device applications such as summarizing a discussion from…
Moshi Foundation Model for Speech-Text Processing
If you’re looking for an alternative to the Whisper stack, one option worth considering is Moshi. I’ve originally found it mentioned on X: Summary: Key Features: Mimi Codec: Training and Evaluation:
WIP: Quarkus
I started to learn something different from Spring Boot, which I use daily in my work. My choice is Quarkus – here I will gather some useful things. Long time I was…
ColPali and Byaldi for reading PDFs with images, Reflection-70B
Multi-modal documents were always a problem, but what I can read now AI developers already made a huge progress, beating traditional PDF parsers. Now we have at least 3 solutions available as…
AI programming tools
Claude, Cursor or Replit agents? It seems like the market of application creation AI tools is growing fast. After a very good Claude Sonnet application creation chat, developers moved to Cursor and…
WIP: Hugging Face cheatsheet part 1
As prerequisite we need to have Python, pip, venv installed, for Git there is a need for lfs, because Hugging Face supports files larger than 5GB in repos. To work with with…