Latest release of kobold.cpp adds tts voice cloning support via OuteTTS, updates multimodal vision mmproj projectors for Qwen2.5 VL
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
SpatialLM, a 1B model capable of spatial identification, using 3d point cloud data. The video demo is amazing.