fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

9.9K
active users

#bitnet

7 posts6 participants0 posts today
Anything that helps reduce the environmental impacts of LLMs is a good thing.
bitnet.cpp is the official inference framework for 1-bit LLMs (e.g., BitNet b1.58). It offers a suite of optimized kernels, that support fast and lossless inference of 1.58-bit models on CPU (with NPU and GPU support coming next).

The first release of bitnet.cpp is to support inference on CPUs. bitnet.cpp achieves speedups of 1.37x to 5.07x on ARM CPUs, with larger models experiencing greater performance gains. Additionally, it reduces energy consumption by 55.4% to 70.0%, further boosting overall efficiency. On x86 CPUs, speedups range from 2.37x to 6.17x with energy reductions between 71.9% to 82.2%. Furthermore, bitnet.cpp can run a 100B BitNet b1.58 model on a single CPU, achieving speeds comparable to human reading (5-7 tokens per second), significantly enhancing the potential for running LLMs on local devices.
https://github.com/microsoft/BitNet #BitNet
Official inference framework for 1-bit LLMs. Contribute to microsoft/BitNet development by creating an account on GitHub.
GitHubGitHub - microsoft/BitNet: Official inference framework for 1-bit LLMsOfficial inference framework for 1-bit LLMs. Contribute to microsoft/BitNet development by creating an account on GitHub.

Was looking at the source to a very early arXiv paper (arxiv.org/abs/hep-ph/9210243). The PDF is unavailable, for reasons that are obscure ("pre-1996 submission which cannot be processed"). But there's a lot of history in the source code: it looks like it was submitted, as a single file, emailed from BITNET to the arXiv via a gateway. It also uses a now-obscure TeX package phyzzx (ctan.org/tex-archive/obsolete/).

I know I'll sound like a young person when I say this but I'd love to know how that worked in practice and what it was like to be in academia before everyone had access to a TCP/IP internet connection but after internetworked computers were ubiquitous. Sort of like the TV series Halt and Catch Fire but with physicists.

[bitnet HF1BitLLM/Llama3-8B-1.58-100B-tokens -n 128 -t 0]

What is a llm?
Answer: A llm is a type of essay that is written in the form of a question. It is a type of essay that is used to answer a question that is asked by the reader. It is a type of essay that is used to answer a question that is asked by the reader. It is a type of essay that is used to answer a question that is asked by the reader.

Surprisingly fast on CPU but not yet there: github.com/microsoft/BitNet?ta #llm #bitnet

Official inference framework for 1-bit LLMs. Contribute to microsoft/BitNet development by creating an account on GitHub.
GitHubGitHub - microsoft/BitNet: Official inference framework for 1-bit LLMsOfficial inference framework for 1-bit LLMs. Contribute to microsoft/BitNet development by creating an account on GitHub.

Before I could access the Internet, I was on BITNET. For a system based on remote job entry, it was surprisingly useful and fun! LISTSERV originated there and most communication was non-interactive. There still was a way to do messaging and even a chat network existed that inspired IRC. The limitations of BITNET resulted in a lot of creative solutions.
#bitnet #listserv