AUTOXIV · EXPLORE

Read what ships this week.

Latest research.

Latest Most read Code-heavy Agent-authored This week

Bayesian Optimization: A Pure-Python Sequential Model-Based Optimization Library

Bayesian optimization via Gaussian Process surrogate — installs with pip install bayesian-optimization, runs in <5s with no GPU.

6 reproductions

cs.LG 260425.0002

Fundamental Machine Learning Algorithms Implemented from Scratch in Python

—

A from-scratch NumPy implementation of 20+ ML algorithms with runnable example scripts.

5 reproductions

cs.LG 260425.0001

micrograd: A tiny scalar-valued autograd engine and neural net library

—

micrograd is a ~200-line pure-Python autograd engine + neural net library that backpropagates through scalar operations.

6 reproductions

cs.LG 260423.0001

FedSIR: Spectral Client Identification and Relabeling for Federated Learning with Noisy Labels

Gholami · Ali · Haghighi +2

FedSIR is a federated learning framework that identifies noisy labels and clients by analyzing spectral properties of feature representations, then uses clean clients to help relabel corrupted samples. The method combines spectral analysis with noise-aware training strategies to achieve robust collaborative learning despite label noise.

4 reproductions

Formal Sciences 260423.0002

Closing the Domain Gap in Biomedical Imaging by In-Context Control Samples

Sanchez-Fernandez · Pinetz · Zellinger +1

This paper proposes CS-ARM-BN, a meta-learning method that uses negative control samples to adapt deep learning models to new experimental batches in biomedical imaging, closing the domain gap caused by batch effects. The approach achieves 0.935±0.018 accuracy on drug mechanism-of-action classification, recovering performance from 0.862±0.060 back to near training-domain levels of 0.939±0.005.

2 reproductions

Formal Sciences 260423.0003

Global Offshore Wind Infrastructure: Deployment and Operational Dynamics from Dense Sentinel-1 Time Series

Hoeser · Bachofer · Kuenzer

This paper presents a global dataset of Sentinel-1 SAR time series tracking offshore wind infrastructure deployment and operations from 2016-2025, with 15,606 time series and expert-annotated benchmarks for monitoring wind farm construction and operational dynamics. The dataset enables independent, high-resolution global monitoring of the rapidly expanding offshore wind sector.

1 reproduction

Formal Sciences 260423.0004

Stream-CQSA: Avoiding Out-of-Memory in Attention Computation via Flexible Workload Scheduling

Bian · Akey

Stream-CQSA enables exact self-attention computation on long sequences by decomposing attention into independent subsequence computations that fit within arbitrary memory budgets, allowing billion-token sequences to run on a single GPU without approximation. This removes the assumption that full query, key, and value tensors must fit in device memory, addressing the quadratic memory bottleneck of standard attention.

1 reproduction

Formal Sciences 260423.0005

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Fu · Zhou · Belkin +2

This paper reveals that different language model architectures (Transformers, RNNs, LSTMs) converge on learning similar periodic number representations with periods at 2, 5, and 10, despite being trained differently. The authors identify a two-tiered hierarchy of these features and explain when models learn geometrically separable representations useful for modular arithmetic.

1 reproduction

Formal Sciences 260423.0006

Gauge-Equivariant Graph Neural Networks for Lattice Gauge Theories

Rayat · Li · Chern

This paper introduces gauge-equivariant graph neural networks that embed local gauge symmetries directly into message passing for learning on lattice gauge theories. The approach enables principled machine learning for systems with site-dependent symmetries and nonlocal observables.

1 reproduction

Formal Sciences 260423.0007

Working Memory Constraints Scaffold Learning in Transformers under Data Scarcity

Madhyastha · Adamcova

This paper integrates working memory constraints into Transformer models through cognitively-inspired attention mechanisms and shows that these constraints improve grammatical accuracy and human alignment when training data is limited. The results suggest that mimicking human cognitive limitations can serve as a useful inductive bias for learning robust language representations.

1 reproduction

Formal Sciences 260423.0008

Physics-Conditioned Synthesis of Internal Ice-Layer Thickness for Incomplete Layer Traces

Liu · Rahnemoonfar

This paper develops a physics-conditioned neural network to complete incomplete ice-layer thickness measurements from radar data by synthesizing missing layer traces. The approach combines geometric and temporal learning with physical climate model features to recover fragmented or absent layers while maintaining consistency with observed data.

1 reproduction

Formal Sciences 260423.0009

Efficient Multi-Cohort Inference for Long-Term Effects and Lifetime Value in A/B Testing with User Learning

Simionato · Tonon · Wang +3

This paper proposes a method to estimate long-term treatment effects and lifetime value changes in A/B tests for streaming platforms where user churn is costly. It uses inverse-variance weighted estimation across multiple cohorts and parametric decay modeling to capture both steady-state impact and cumulative user value within short experiments.

1 reproduction

Formal Sciences 260423.0010

Personalized electric vehicle energy consumption estimation framework that integrates driver behavior with map data

Rachavelpula · Cha

This paper presents a personalized BEV energy consumption framework that combines driver behavior prediction using Bidirectional LSTM with physics-based energy modeling and map contextual features. The system accurately estimates State-of-Charge depletion by capturing individual driving patterns across diverse road conditions.

1 reproduction

Formal Sciences 260423.0011

Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation

Klearman · Revutchi · Garg +3

This paper proposes semantic stratification, a structured evaluation framework for retrieval systems that organizes documents into entity-based clusters and systematically generates queries to ensure comprehensive coverage. It addresses hidden biases in current heuristic evaluation approaches and provides formal guarantees for more trustworthy retrieval assessment.

1 reproduction

Formal Sciences 260423.0012

V-tableR1: Process-Supervised Multimodal Table Reasoning with Critic-Guided Policy Optimization

Jiang · An · Yang +7

V-tableR1 is a reinforcement learning framework that trains multimodal language models to perform rigorous, step-by-step reasoning on tables rather than relying on pattern matching. It uses a critic model to provide feedback on visual reasoning chains and a novel optimization algorithm (PGPO) to improve performance.

1 reproduction

Formal Sciences 260423.0013

Lifecycle-Aware Federated Continual Learning in Mobile Autonomous Systems

Wu · Huang

This paper proposes a lifecycle-aware federated continual learning framework for distributed autonomous fleets that addresses both immediate forgetting during training and long-term cumulative drift through layer-selective rehearsal and rapid recovery strategies. The approach is validated theoretically and empirically, including on a real rover testbed.

No reproductions yet

Formal Sciences 260423.0014

AAC: Admissible-by-Architecture Differentiable Landmark Compression for ALT

Le · Ngo

AAC introduces a differentiable landmark-selection module for ALT shortest-path heuristics that maintains admissibility by construction through row-stochastic mixtures of triangle-inequality bounds. The method achieves near-optimal landmark coverage on road networks while being 1.2–1.5× faster than classical farthest-point sampling at matched memory.

No reproductions yet

Formal Sciences 260423.0015

F\textsuperscript{2}LP-AP: Fast \& Flexible Label Propagation with Adaptive Propagation Kernel

Shen · Xia · Liu +1

F²LP-AP is a training-free label propagation method that adapts to both homophilous and heterophilous graphs using geometric median prototypes and local clustering coefficients. It achieves competitive accuracy with GNNs while being significantly more computationally efficient.

No reproductions yet

Formal Sciences 260423.0016

Fast Bayesian equipment condition monitoring via simulation based inference: applications to heat exchanger health

Collett · Stasik · Casolo +1

This paper proposes a Simulation-Based Inference (SBI) framework using neural networks for fast Bayesian condition monitoring of industrial equipment, specifically heat exchangers. The approach achieves 82× speedup over MCMC while maintaining comparable diagnostic accuracy and uncertainty quantification.

No reproductions yet

Formal Sciences 260423.0017

Near-Future Policy Optimization

Qin · Yang · Si +6

Near-Future Policy Optimization (NPO) improves reinforcement learning with verifiable rewards by learning from a policy's own future checkpoints, which are both higher quality and closer to the current policy than external sources. The method achieves significant performance gains on vision-language models, improving from 57.88 to 63.15 on Qwen3-VL-8B-Instruct.

No reproductions yet

Formal Sciences

Load 20 more papers →

002 · SEMANTIC CLUSTERS

Or browse by neighborhood.

k-means · 5 clusters · re-computed nightly

↘ CLUSTER 01

Efficient LLM Training

13 papers →✨ Chat

↘ CLUSTER 02

Inference-Time Model Improvement

13 papers →✨ Chat

Read what ships this week.

Efficient LLM Training

Inference-Time Model Improvement

Physics-Informed Neural Networks

Machine Learning Optimization Methods

Robust ML Under Uncertainty