Fosstodon @fosstodon

Recent searches

Search options

Only available when logged in.

29 posts22 participants1 post today

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 8h

Curated Hacker News @CuratedHackerNews@mastodon.social

Inside ArXiv

https://www.wired.com/story/inside-arxiv-most-transformative-code-science/

WIRED · Mar 27Inside arXiv—the Most Transformative Platform in All of ScienceBy Sheon Han

#arxiv #science

**Gea-Suan Lin** @gslin@abpe.org · 17h

17h

Gea-Suan Lin @gslin@abpe.org

arXiv 要搬到 GCP 上

在「arXiv moving from Cornell servers to Google Cloud (arxiv.org)」這邊看到 arXiv 搬到 GCP 的消息，是出自他們的徵才頁面：「Careers at arXiv - arXiv info」。 We are already underway on the arXiv CE ("Cloud Edition") project. This is a project to re-home all arXiv services from VMs at Cornell to a cloud provider (Google Cloud). 不過看 Hacker News 上的 comment，似乎是受到 Trump 政府對大學資金政策的影響，這些職缺目…

https://blog.gslin.org/archives/2025/04/21/12359/arxiv-%e8%a6%81%e6%90%ac%e5%88%b0-gcp-%e4%b8%8a/

Gea-Suan Lin's BLOG · 17harXiv 要搬到 GCP 上在「arXiv moving from Cornell servers to Google Cloud (arxiv.org)」這邊看到 arXiv 搬到 GCP 的消息，是出自他們的徵才頁面：「Careers at arXiv - arXiv info」。 We are already underway on the arXiv CE (Cloud Edition) project.

#arxiv #cloud #cornell

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 18h

18h

Curated Hacker News @CuratedHackerNews@mastodon.social

Pushing the Limits of LLM Quantization via the Linearity Theorem

https://arxiv.org/abs/2411.17525

arXiv.orgPushing the Limits of Large Language Model Quantization via the Linearity TheoremQuantizing large language models has become a standard way to reduce their memory and computational costs. Typically, existing methods focus on breaking down the problem into individual layer-wise sub-problems, and minimizing per-layer error, measured via various metrics. Yet, this approach currently lacks theoretical justification and the metrics employed may be sub-optimal. In this paper, we present a "linearity theorem" establishing a direct relationship between the layer-wise $\ell_2$ reconstruction error and the model perplexity increase due to quantization. This insight enables two novel applications: (1) a simple data-free LLM quantization method using Hadamard rotations and MSE-optimal grids, dubbed HIGGS, which outperforms all prior data-free approaches such as the extremely popular NF4 quantized format, and (2) an optimal solution to the problem of finding non-uniform per-layer quantization levels which match a given compression constraint in the medium-bitwidth regime, obtained by reduction to dynamic programming. On the practical side, we demonstrate improved accuracy-compression trade-offs on Llama-3.1 and 3.2-family models, as well as on Qwen-family models. Further, we show that our method can be efficiently supported in terms of GPU kernels at various batch sizes, advancing both data-free and non-uniform quantization for LLMs.

#arxiv #llm

**MottG** @mottg@researchbuzz.masto.host · 22h

22h

MottG @mottg@researchbuzz.masto.host

Sci Scope is a service that notifies you about just published arxiv research papers in many areas of computer science. Some of the features are free and others have a nominal monthly fee.
The weekly newsletter notifying you about new arxiv computer science papers (broken out by subject cluster) is free.

#research #arxiv #computerScience
#science #SciScope

A list of some of the features of Sci Scope.

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 1d

Curated Hacker News @CuratedHackerNews@mastodon.social

Eccfrog512ck2: An Enhanced 512-Bit Weierstrass Elliptic Curve [pdf]

https://arxiv.org/abs/2504.09584

arXiv.orgEccfrog512ck2: An Enhanced 512-bit Weierstrass Elliptic CurveWhilst many key exchange and digital signature methods use the NIST P256 (secp256r1) and secp256k1 curves, there is often a demand for increased security. With these curves, we have a 128-bit security. These security levels can be increased to 256-bit security with NIST P-521 Curve 448 and Brainpool-P512. This paper outlines a new curve - Eccfrog512ck2 - and which provides 256-bit security and enhanced performance over NIST P-521. Along with this, it has side-channel resistance and is designed to avoid weaknesses such as related to the MOV attack. It shows that Eccfrog512ck2 can have a 61.5% speed-up on scalar multiplication and a 33.3% speed-up on point generation over the NIST P-521 curve.

#arxiv

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 1d

Curated Hacker News @CuratedHackerNews@mastodon.social

Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents

https://arxiv.org/abs/2502.15840

arXiv.orgVending-Bench: A Benchmark for Long-Term Coherence of Autonomous AgentsWhile Large Language Models (LLMs) can exhibit impressive proficiency in isolated, short-term tasks, they often fail to maintain coherent performance over longer time horizons. In this paper, we present Vending-Bench, a simulated environment designed to specifically test an LLM-based agent's ability to manage a straightforward, long-running business scenario: operating a vending machine. Agents must balance inventories, place orders, set prices, and handle daily fees - tasks that are each simple but collectively, over long horizons (>20M tokens per run) stress an LLM's capacity for sustained, coherent decision-making. Our experiments reveal high variance in performance across multiple LLMs: Claude 3.5 Sonnet and o3-mini manage the machine well in most runs and turn a profit, but all models have runs that derail, either through misinterpreting delivery schedules, forgetting orders, or descending into tangential "meltdown" loops from which they rarely recover. We find no clear correlation between failures and the point at which the model's context window becomes full, suggesting that these breakdowns do not stem from memory limits. Apart from highlighting the high variance in performance over long time horizons, Vending-Bench also tests models' ability to acquire capital, a necessity in many hypothetical dangerous AI scenarios. We hope the benchmark can help in preparing for the advent of stronger AI systems.

#arxiv

**Vladimir Savić** @firusvg@mastodon.social · 1d

Vladimir Savić @firusvg@mastodon.social

Inside #arXiv - the most transformative platform in all of #science https://www.wired.com/story/inside-arxiv-most-transformative-code-science/ #bigstory

WIRED · Mar 27Inside arXiv—the Most Transformative Platform in All of ScienceBy Sheon Han

**N-gated Hacker News** @ngate@mastodon.social · 1d

N-gated Hacker News @ngate@mastodon.social

ArXiv: The magical dumpster where scientists toss their #unreviewed #papers and hope for the best! Apparently, without it, #science would crumble into oblivion. Who knew?
https://www.wired.com/story/inside-arxiv-most-transformative-code-science/ #ArXiv #Research #Community #Humor #HackerNews #ngated

WIRED · Mar 27Inside arXiv—the Most Transformative Platform in All of ScienceBy Sheon Han

**Hacker News** @h4ckernews@mastodon.social · 1d

Hacker News @h4ckernews@mastodon.social

Inside ArXiv

https://www.wired.com/story/inside-arxiv-most-transformative-code-science/

WIRED · Mar 27Inside arXiv—the Most Transformative Platform in All of ScienceBy Sheon Han

#HackerNews #InsideArXiv #ArXiv

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 1d

Curated Hacker News @CuratedHackerNews@mastodon.social

Inside ArXiv

https://www.wired.com/story/inside-arxiv-most-transformative-code-science/

WIRED · Mar 27Inside arXiv—the Most Transformative Platform in All of ScienceBy Sheon Han

#arxiv #science

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 1d

Curated Hacker News @CuratedHackerNews@mastodon.social

Inferring the Phylogeny of Large Language Models

https://arxiv.org/abs/2404.04671

arXiv.orgPhyloLM : Inferring the Phylogeny of Large Language Models and Predicting their Performances in BenchmarksThis paper introduces PhyloLM, a method adapting phylogenetic algorithms to Large Language Models (LLMs) to explore whether and how they relate to each other and to predict their performance characteristics. Our method calculates a phylogenetic distance metrics based on the similarity of LLMs' output. The resulting metric is then used to construct dendrograms, which satisfactorily capture known relationships across a set of 111 open-source and 45 closed models. Furthermore, our phylogenetic distance predicts performance in standard benchmarks, thus demonstrating its functional validity and paving the way for a time and cost-effective estimation of LLM capabilities. To sum up, by translating population genetic concepts to machine learning, we propose and validate a tool to evaluate LLM development, relationships and capabilities, even in the absence of transparent training information.

#arxiv

**Winbuzzer** @winbuzzer@mastodon.social · 2d

Winbuzzer @winbuzzer@mastodon.social

arXiv Is Swapping Cornell University Servers for Google Cloud in Modernization Push

#arXiv #GoogleCloud #GCP #Cornell #OpenAccess #Science #Research #CloudMigration #ScholarlyPublishing #Preprints

https://winbuzzer.com/2025/04/18/arxiv-is-swapping-cornell-university-servers-for-google-cloud-in-modernization-push-xcxwbn/

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 2d

Curated Hacker News @CuratedHackerNews@mastodon.social

SDFs from Unoriented Point Clouds Using Neural Variational Heat Distances

https://arxiv.org/abs/2504.11212

arXiv.orgSDFs from Unoriented Point Clouds using Neural Variational Heat DistancesWe propose a novel variational approach for computing neural Signed Distance Fields (SDF) from unoriented point clouds. To this end, we replace the commonly used eikonal equation with the heat method, carrying over to the neural domain what has long been standard practice for computing distances on discrete surfaces. This yields two convex optimization problems for whose solution we employ neural networks: We first compute a neural approximation of the gradients of the unsigned distance field through a small time step of heat flow with weighted point cloud densities as initial data. Then we use it to compute a neural approximation of the SDF. We prove that the underlying variational problems are well-posed. Through numerical experiments, we demonstrate that our method provides state-of-the-art surface reconstruction and consistent SDF gradients. Furthermore, we show in a proof-of-concept that it is accurate enough for solving a PDE on the zero-level set.

#arxiv

3d

David @npub1cfhh50298407nqc9pf2ahdn5dcxuxkzhpextg07rrv49tzsyzz5sq7khav@momostr.pink

arXiv moving from Cornell servers to Google Cloud

https://news.ycombinator.com/item?id=43726640

news.ycombinator.comarXiv moving from Cornell servers to Google Cloud | Hacker News

#arXiv #Academia #Research

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 3d

Curated Hacker News @CuratedHackerNews@mastodon.social

arXiv moving from Cornell servers to Google Cloud

https://info.arxiv.org/hiring/index.html

info.arxiv.orgCareers at arXiv - arXiv info

#arxiv #google #hiring

**Wladimir Mufty** @wlaatje@social.edu.nl · 3d *

3d *

Wladimir Mufty @wlaatje@social.edu.nl

Sometimes you get worried by reading a vacant position…

#arXiv moving services from Cornell to Google…

https://info.arxiv.org/hiring/index.html

info.arxiv.orgCareers at arXiv - arXiv info

**N-gated Hacker News** @ngate@mastodon.social · 3d

N-gated Hacker News @ngate@mastodon.social

Ah, the esteemed #arXiv, the digital throne of academic pretense, now hitching a ride on Google Cloud! Because nothing says "independent research repository" like moving into Big Tech's basement. Who knew "open access" meant "open to Google's servers"?
https://info.arxiv.org/hiring/index.html #GoogleCloud #openaccess #BigTech #academicresearch #HackerNews #ngated