fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

9.9K
active users

#mamba

0 posts0 participants0 posts today

― Очень трепетно отношусь к своему времени и уважаю ваше. Не ответила — не судьба (может, я вам психику сохранила — ищите плюсы!).
― Что я ищу от сайта — без понятия, буду импровизировать.
― Интересны получающие удовольствие от жизни.

Вас много, холопы, а она одна, королева!

Сайты знакомств — ядрёный рассадник мужененавистничества, лишающий веры в человечество. Женщины могут открыто и безнаказанно не уважать мужчин, почувствовав себя Мисс мира, не выходя из хрущёвки.

― Мы пара МЖ. Мужчины-одиночки, [мы] отвечаем ТОЛЬКО на конкретные предложения об организации встречи! На «Привет. Встретимся? Познакомимся? Как дела?» не отвечаем.

И #продающеефото. И некая «поддержка» в интересах (с прочим досугом типа бани, ресторанов, шашлыков. нудизма 🤯). Заплати куколду за то, чтобы трахнуть (не) его тёлку? Маразм.

И хоть куколды должны страдать (раз не умеют создавать гетеросексуальные моногамные отношения), это нельзя популяризировать; но... 🙈

👋 We are #opensource developers, maintainers and consultants working on scientific and data-heavy software!

We are in part, or fully responsible for critical tools used by millions for research and tech across industries, like #Jupyter, #condaforge, #Mamba, or #ApacheArrow. We help navigate the complex data stack, use these tools efficiently, or develop and upstream contributions.

🚀 Follow us for light tech posts, opinions, links to deeper content as well as company updates.

Replied in thread

@python_discussions An absolutely great blog post about the pain of environment management in #python.

It summaries the experiences I made over the years: Breaking my system python environment with plain #pip, moving to #conda for easy #CUDA installation, frustrated by its slow dependency solution and moving to #mamba; starting to like #poetry for its dependency locking but also annoyed by its undocumented C/ C++ Extensions support.
I thought about giving #pixi a try, maybe next time ...

Continued thread

🛠️ Don't know what rattler-build is yet?

📦 Cross-platform relocatable binaries/packages
📝 Simple recipe format inspired by conda-build & boa
🚀 Packages ready for #mamba, #conda, or #pixi

🔗 Bonus: rattler-build is a standalone tool with no dependencies on conda-build or Python! 🙌

Oh, it is hard to keep up with Mistral. Beside NeMo 12B, they also released two other models:
* mistral.ai/news/mathstral/
* mistral.ai/news/codestral-mamb

Codestral Mamba is exciting. I was not aware that they were also experimenting with Mamba2 models.

My understanding is that the Mamba architecture (arxiv.org/abs/2312.00752) is especially promising for problem domains that require large context lengths, so using it for programming makes sense. A fundamental difference compared to transformers (as introduced 2017 in the famous "Attention Is All You Need" paper) is that it does not have attention. Thus, it avoids the quadratic complexity that typically comes it. Another difference is that it is again a recurrent model; so, it is more like the once popular LSTM that fell out of fashion after the success of transformers.

Here is a longer interview with Albert Gu, one of the researchers behind Mamba, where he shares insights on his work on Mamba and Mamba2 and his design philosophy:

youtube.com/watch?v=1zjMalKLHi

mistral.ai · MathΣtralAs a tribute to Archimedes, whose 2311th anniversary we're celebrating this year, we are proud to release our first Mathstral model, a specific 7B model designed for math reasoning and scientific discovery. The model has a 32k context window published under the Apache 2.0 license.
#mistral#mamba#llm
Continued thread

Finally the "#mamba for audio" papers are coming, appearing on arxiv. Surprisingly, they're based on spectrogram patches rather than raw audio. Why, when Mamba is for sequences? Probably because "spectrogram patch embeddings" are the best feature representation of the moment, and progressively they'll be replaced (?) as the early feature-extraction step.

🧠 Durante l'AI Festival ho parlato di #Mamba, una tecnologia rivoluzionaria in grado di superare i limiti di lunghezza delle sequenze e di efficienza computazionale. 
🚀 AI21labs ha creato #Jamba: il primo modello in produzione che combina Mamba con l'architettura dei Transformer tradizionali. 
💡 Risultato: un sistema che ha la capacità di selezionare e "ricordare" informazioni importanti da lunghe sequenze, con una maggior capacità di comprensione del contesto.