← Back to Homepage

AI for Marketing

人工智能在市场营销领域的应用研究

📊 50 Papers 📅 Updated: 2026-05-14
1
KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving
Zedong Liu, Xinyang Ma, Dejun Luo et al. (12 authors)
📅 2026-05-13
LLMs are widely adopted in production, pushing inference systems to their limits. Disaggregated LLM serving (e.g., PD separation and KV state disaggregation) improves scalability and cost efficiency, but it also turns KV into an explicit payload crossing network and storage boundaries, making KV a dominant end-to-end bottleneck. Existing KV compression are typically static runtime configurations,...
2
A Hierarchical Language Model with Predictable Scaling Laws and Provable Benefits of Reasoning
Jason Gaitonde, Frederic Koehler, Elchanan Mossel et al. (5 authors)
📅 2026-05-13
We introduce a family of synthetic languages with hierarchical structure -- generated by a broadcast process on trees -- for which the role of context length and reasoning in autoregressive generation can be analyzed precisely. At the heart of our analytic approach is an \emph{exact $k$-gram ansatz} in place of transformers with context length $k$, a substitution we then validate empirically....
3
OpenAaaS: An Open Agent-as-a-Service Framework for Distributed Materials-Informatics Research
Peng Kang, Bixuan Li, Xiaoya Huang et al. (8 authors)
📅 2026-05-13
The Materials Genome Initiative catalyzed the proliferation of centralized platforms--SaaS, PaaS, and IaaS--that aggregate computational and experimental resources for accelerated materials discovery. In parallel, breakthroughs in large language models (LLMs) and autonomous agents have created powerful new reasoning capabilities for scientific research. Yet a critical "last mile"...
4
It's not the Language Model, it's the Tool: Deterministic Mediation for Scientific Workflows
Marios Adamidis, Danae Katrisioti, Yannis Tzitzikas et al. (4 authors)
📅 2026-05-13
Language models can produce convincing scientific analyses, but repeated generations on the same data do not guarantee the same result. A researcher may regenerate an identical query and receive a different fit, a different peak position or a different analysis procedure, without an obvious way to decide which output to trust. We propose typed mediation, a pattern in which the model orchestrates...
5
ViDR: Grounding Multimodal Deep Research Reports in Source Visual Evidence
Zhuofan Shi, Peilun Jia, Baoqin Sun et al. (7 authors)
📅 2026-05-13
Recent deep research systems have improved the ability of large language models to produce long, grounded reports through iterative retrieval and reasoning. However, most text-centered systems rely mainly on textual evidence, while multimodal systems often retrieve images only weakly or generate charts themselves, leaving source figures underused as evidence. We present ViDR, a multimodal deep...
6
Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels
Fairoz Nower Khan, Nabuat Zaman Nahim, Md Sajid Ahmed et al. (5 authors)
📅 2026-05-12
MeanFlow enables one-step generation in continuous spaces by learning an average velocity over a time interval rather than the instantaneous velocity field of flow matching. However, discrete state spaces do not have smooth trajectories or spatial derivatives, so the continuous formulation does not directly apply. We introduce Discrete MeanFlow, which replaces the motion of a point with the...
7
Grid-Orch: An LLM-Powered Orchestrator for Distribution Grid Simulation and Analytics
Boming Liu, Jin Dong, Jamie Lian
📅 2026-05-12
The power distribution engineering workforce faces a projected shortage of up to 1.5 million engineers by 2030, creating urgent demand for more accessible analysis tools. This paper introduces Grid-Orch, a framework that bridges Large Language Models (LLMs) and power system simulation through the Model Context Protocol (MCP), enabling engineers to perform complex distribution analyses via natural...
8
The critical slowing down in diffusion models
Luca Maria Del Bono, Giulio Biroli, Patrick Charbonneau et al. (4 authors)
📅 2026-05-12
Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theoretical control over when and why they succeed. Here we provide such insight for diffusion models-a class of generative schemes highly effective in practice-by analyzing their...
9
Design Your Ad: Personalized Advertising Image and Text Generation with Unified Autoregressive Models
Yexing Xu, Wei Feng, Shen Zhang et al. (18 authors)
📅 2026-05-12
Generating realistic and user-preferred advertisements is a key challenge in e-commerce. Existing approaches utilize multiple independent models driven by click-through-rate (CTR) to controllably create attractive image or text advertisements. However, their pipelines lack cross-modal perception and rely on CTR that only reflects average preferences. Therefore, we explore jointly generating...
10
EvoNav: Evolutionary Reward Function Design for Robot Navigation with Large Language Models
Zhikai Zhao, Chuanbo Hua, Federico Berto et al. (7 authors)
📅 2026-05-12
Robot navigation is a crucial task with applications to social robots in dynamic human environments. While Reinforcement Learning (RL) has shown great promise for this problem, the policy quality is highly sensitive to the specification of reward functions. Hand-crafted rewards require substantial domain expertise and embed inductive biases that are difficult to audit or adapt, limiting their...
11
FedMM: Federated Collaborative Signal Quantization for Multi-Market CTR Prediction
Jun Zhang, Dugang Liu, Xing Tang et al. (5 authors)
📅 2026-05-12
Online platforms such as Amazon and Netflix serve users across multiple countries and regions, underscoring the importance of multi-market recommendation (MMR). Most MMR methods adopt a pre-training and fine-tuning paradigm, in which a unified model is first trained on centralized, global data and subsequently adapted to specific markets. However, this approach ignores the privacy of market data....
12
A Mechanistic Investigation of Supervised Fine Tuning
Ruhaan Chopra
📅 2026-05-12
The cosine similarity between a large language model's hidden activations before and after Supervised Fine-Tuning (SFT) remains very high. This, at first glance, suggests that SFT leaves the model's activation geometry largely undisturbed. However, projecting both sets of activations through a Sparse Autoencoder (SAE) pretrained on the base model reveals that the underlying sparse...
13
Discovery of Interpretable Surrogates via Agentic AI: Application to Gravitational Waves
Tousif Islam, Digvijay Wadekar, Tejaswi Venumadhav et al. (7 authors)
📅 2026-05-11
Fast surrogate models for expensive simulations are now essential across the sciences, yet they typically operate as black boxes. We present \texttt{GWAgent}, a large language model (LLM)-based workflow that constructs interpretable analytic surrogates directly from simulation data. Surrogate modeling is well suited to agentic workflows because candidate models can be quantitatively validated...
14
Localization Boosting for Growth Markets: Mitigating Cross-Locale Behavioral Bias in Learning-to-Rank
Suryaa Veerabathiran Seran, Ashwin Naresh Kumar, Tracy Holloway King et al. (4 authors)
📅 2026-05-11
Adobe Express is expanding internationally, but the US has a disproportionately large content supply and interaction volume. Learning-to-rank (LTR) models trained primarily on behavioral feedback inherit this imbalance: templates popular in US are over-served in non-US locales. This cross-locale exposure bias suppresses local content discoverability and degrades ranking quality in growth locales....
15
gwBenchmarks: Stress-Testing LLM Agents on High-Precision Gravitational Wave Astronomy
Tousif Islam, Digvijay Wadekar, Zihan Zhou
📅 2026-05-11
Modern gravitational wave astronomy relies on modeling tasks that often require months of graduate-level effort, including building fast waveform surrogates from expensive numerical relativity simulations, modeling orbital dynamics of black holes, fitting merger remnant properties and constructing template banks. These problems demand extreme precision to support detection and parameter...
16
Template-as-Ontology: Configurable Synthetic Data Infrastructure for Cross-Domain Manufacturing AI Validation
Grama Chethan
📅 2026-05-11
LLarge language model (LLM)-based AI agents deployed in manufacturing environments require populated, schema-correct data for validation, yet production MES data is proprietary, privacy-encumbered, and vendor-specific. This paper introduces the Template-as-Ontology principle: a single Python configuration module (700-770 lines, 45 validated exports) serves simultaneously as the specification for...
17
The Semantic Training Gap: Ontology-Grounded Tool Architectures for Industrial AI Agent Systems
Grama Chethan
📅 2026-05-11
Large language model (LLM)-based AI agents are increasingly deployed in manufacturing environments for analytics, quality management, and decision support. These agents demonstrate statistical fluency with domain terminology but lack grounded understanding of operational semantics -- the relational structure that connects equipment identifiers, process parameters, failure codes, and regulatory...
18
The Agent Use of Agent Beings: Agent Cybernetics Is the Missing Science of Foundation Agents
Xinrun Wang, Chang Yang, He Zhao et al. (5 authors)
📅 2026-05-11
LLM-based foundation agents that perceive, reason, and act across thousands of reasoning steps are rapidly becoming the dominant paradigm for deploying artificial intelligence in open-ended, long-horizon complex tasks. Despite this significance, the field remains overwhelmingly engineering-driven. Engineering practice has converged on useful primitives (tool loops, memory banks, harnesses,...
19
iPay: Integrated Payment Action Recognition via Multimodal Networks and Adaptive Spatial Prior Learning
Kaicong Huang, Weiheng Oh, Thomas Guggisberg et al. (4 authors)
📅 2026-05-11
Automated transit payment analysis is vital for scalable fare auditing and passenger analytics, yet practice still relies on limited manual inspection. Prior vision- and skeleton-based methods remain brittle under noisy onboard surveillance and often depend on poorly generalizable handcrafted features. Building on the success of graph convolutional networks in human action recognition, we observe...
20
Measuring Embedding Sensitivity to Authorial Style in French: Comparing Literary Texts with Language Model Rewritings
Benjamin Icard, Lila Sainero, Alice Breton et al. (5 authors)
📅 2026-05-11
Large language models (LLMs) can convincingly imitate human writing styles, yet it remains unclear how much stylistic information is encoded in embeddings from any language model and retained after LLM rewriting. We investigate these questions in French, using a controlled literary dataset to quantify the effect of stylistic variation via changes in embedding dispersion. We observe that...
21
Can Language Models Analyze Data? Evaluating Large Language Models for Question Answering over Datasets
Andreas Xenofontos, Pavlos Fafalios
📅 2026-05-11
This paper investigates the effectiveness of large language models (LLMs) in answering questions over datasets. We examine their performance in two scenarios: (a) directly answering questions given a dataset file as input, and (b) generating SQL queries to answer questions given the schema of a relational database. We also evaluate the impact of different prompting strategies on model...
22
Strategic Exploitation in LLM Agent Markets: A Simulation Framework for E-Commerce Trust
Shijun Lei, Quang Nguyen, Swapneel S Mehta et al. (10 authors)
📅 2026-05-11
Agent-based modeling (ABM) has long been used in economics to study human behavior, and large language model (LLM) agents now enable new forms of social and economic simulation. While prior work has discovered strategic deception by LLM agents in financial trading and auction markets, e-commerce remains underexplored despite its distinctive information asymmetry: sellers privately observe product...
23
NaiAD: Initiate Data-Driven Research for LLM Advertising
Yihang Zhang, Zimeng Huang, Ren Zhai et al. (5 authors)
📅 2026-05-11
Reconciling platform revenue with user experience in LLM advertising motivates a data-centric foundation. We introduce NaiAD, the first comprehensive dataset for LLM-native advertising comprising 58,999 carefully constructed ad-embedded responses paired with user queries. NaiAD is organized around theoretically grounded evaluation metrics that separately and comprehensively capture user and...
24
Adaptive Data Harvesting for Efficient Neural Network Learning with Universal Constraints
Siteng Kang, Xinhua Zhang
📅 2026-05-10
Training neural networks to satisfy universal constraints over continuous domains poses unique challenges. Common examples include Lyapunov Neural Networks (Lyapunov NNs) and Physics-Informed Neural Networks (PINNs), where analytical solutions are generally either unavailable or overly restrictive. Sample-based methods are therefore commonly used to enforce these constraints, and the choice of...
25
RAwR: Role-Aware Rewiring via Approximate Equitable Partition
Riccardo Porcedda, Giuseppe Squillace, Bastian Epping et al. (7 authors)
📅 2026-05-10
While Graph Neural Networks (GNNs) have demonstrated significant efficacy in node classification tasks, where predictions rely on local neighborhood information, the performance of GNNs often drops when prediction tasks depend on long-range interactions. These limitations are attributed to phenomena such as oversquashing, where structural bottlenecks restrict signal propagation across the network...
26
Perceptual Asymmetry Between Hue Categories: Evidence from Human Color Categorization
Elnara Kadyrgali, Nuray Toganas, Muragul Muratbekova et al. (4 authors)
📅 2026-05-10
Human color categories are not uniformly distributed in perceptual space, yet most computational color models still assume fixed and evenly structured representations. In this paper, we present a focused analytical extension of the COLIBRI fuzzy color model by investigating perceptual asymmetry between hue categories. Using previously collected large-scale human color categorization data, we...
27
MC$^2$: Monte Carlo Correction for Fast Elliptic PDE Solving
Ethan Hsu, Hong Meng Yam, Ivan Ge
📅 2026-05-10
Partial differential equation (PDE) solvers underpin scientific computing, but real-world deployment is bounded by compute. Classical Monte Carlo solvers such as Walk-on-Spheres (WoS) are unbiased and geometry-agnostic but are slow. Learned solvers are fast but biased and brittle under distribution shift. We present \textbf{MC$^2$}, a hybrid WoS-Neural Network (WoS-NN) PDE solver that treats a...
28
How Much is Brain Data Worth for Machine Learning?
Lane Lewis, Zhixin Wang, David Schwab et al. (4 authors)
📅 2026-05-10
If a person can solve a task, can measuring their brain make it easier to train a model to solve that task too? Recent NeuroAI work suggests that supplementing task training with neural recordings can modestly improve model performance and robustness. However, it is unclear when there should be a benefit from using neural data and how much benefit to expect. We formulate this question...
29
Detect, Localize, and Explain: Interactive Hierarchical Log Anomaly Analytics with LLM Augmentation
Lei Ma, Suhani Chaudhary, Ethan Shanbaum et al. (8 authors)
📅 2026-05-09
Logs are ubiquitous in modern systems. Unfortunately, their unstructured nature in flat sequences limits understanding of execution behaviors, hindering effective anomaly diagnosis. To address this, Krone introduces a novel hierarchical log abstraction that transforms flat log sequences into semantically coherent units across entity, action, and status levels. Building on this abstraction, Krone...
30
Fast Rates for Offline Contextual Bandits with Forward-KL Regularization under Single-Policy Concentrability
Qingyue Zhao, Kaixuan Ji, Heyang Zhao et al. (4 authors)
📅 2026-05-09
\emph{Kullback-Leibler} (KL) regularization is ubiquitous in reinforcement learning algorithms in the form of \emph{reverse} or \emph{forward} KL. Recent studies have demonstrated $ε^{-1}$-type fast rates for decision making under reverse KL regularization, in contrast to the standard $ε^{-2}$-type sample complexity. However, for forward-KL-regularized objectives, existing statistical analyses...
31
A Communication-Theoretic Framework for LLM Agents: Cost-Aware Adaptive Reliability
Hamed Omidvar, Vahideh Akhlaghi
📅 2026-05-09
Agents built on large language models (LLMs) rely on a range of reliability techniques, including retry, majority voting, and self-consistency, that have been developed in parallel rather than within a common analytical framework. We observe that an LLM sampled at temperature $T$ is a discrete stochastic channel $p(y \mid x)$ in the sense of Shannon's coding theory, and use this identity as...
32
Why Do Aligned LLMs Remain Jailbreakable: Refusal-Escape Directions, Operator-Level Sources, and Safety-Utility Trade-off
Yu Chen, Yuanhao Liu, Qi Cao
📅 2026-05-09
Aligned large language models (LLMs) remain vulnerable to jailbreak attacks. Recent mechanistic studies have identified latent features and representation shifts associated with jailbreak success, but they leave a more fundamental question open: why do aligned LLMs remain jailbreakable, and what structural vulnerabilities in the model make this possible? We study this question through a...
33
CROP: Expert-Aligned Image Cropping via Compositional Reasoning and Optimizing Preference
Zhitong Dong, Chao Li, Jie Yu et al. (4 authors)
📅 2026-05-09
Aesthetic image cropping aims to enhance the aesthetic quality of an image by improving its composition through spatial cropping. Previous methods often rely on saliency prediction or retrieval augmentation, ignoring the task's core requirement: a deep understanding of composition and aesthetics. Consequently, saliency-based methods struggle to make compositional trade-offs in complex...
34
FRACTAL: SSM with Fractional Recurrent Architecture for Computational Temporal Analysis of Long Sequences
Mengqi Li, Wensheng Lin, Jinshuai Yang et al. (4 authors)
📅 2026-05-09
Effective sequence modeling fundamentally requires balancing the retention of unbounded history with the high-resolution detection of abrupt short-term variations common in real-world phenomena. However, existing state space models (SSMs) relying on high-order polynomial projection operators (HiPPO) face a critical trade-off where uniform measures dilute recent information to maintain timescale...
35
LLM Advertisement based on Neuron Auctions
Peiran Yun, Wenxin Xu, Jiayuan Liu et al. (7 authors)
📅 2026-05-08
As Large Language Models (LLMs) transition into conversational agents, generative advertising emerges as a crucial monetization strategy. However, embedding advertisements within unstructured LLM outputs introduces a critical trilemma: balancing advertiser payoffs, platform revenue, and user experience. Existing methods, such as prompt injection or rigid position slots, disrupt semantic coherence...
36
The Reciprocity Gradient
Yue Lin, Pascal Poupart, Shuhui Zhu et al. (8 authors)
📅 2026-05-08
Communication is fundamental to sustaining reciprocity and cooperation in strategic interactions. We identify and formulate the influence attribution problem as the central optimization difficulty inherent in such dynamics for a learning agent: any action or signal the agent emits reshapes the reputations of many third parties along combinatorially branching paths before feeding back into its own...
37
Neural Operators as Efficient Function Interpolators
Vasilis Niarchos, Angelos Sirbu, Sokratis Trifinopoulos
📅 2026-05-08
Neural operators (NOs) are designed to learn maps between infinite-dimensional function spaces. We propose a novel reframing of their use. By introducing an auxiliary base-space, any finite-dimensional function can be viewed as an operator acting by composition on functions of the base-space. Through a range of benchmarks on analytic functions of increasing complexity and dimensionality, we...
38
Three-in-One World Model: Energy-Based Consistency, Prediction, and Counterfactual Inference for Marketing Intervention
Junichiro Niimi
📅 2026-05-08
Marketing decisions reflect the interaction of latent consumer heterogeneity, time-varying internal states, and explicit interventions, a structure that current prediction- and language-oriented models do not capture in a unified manner. We propose a Three-in-One world-model architecture in which a Deep Boltzmann Machine (DBM) learns a frozen belief representation from demographics, time, and...
39
Path-Coupled Bellman Flows for Distributional Reinforcement Learning
Boyang Xu, Qing Zou, Siqin Yang et al. (4 authors)
📅 2026-05-07
Distributional reinforcement learning (DRL) models the full return distribution, but existing finite-support or quantile-based methods rely on projections, while recent flow-based approaches can suffer from \emph{boundary mismatch} at the flow source or from \emph{high-variance} bootstrapping when current and successor noises are independent. We propose Path-Coupled Bellman Flows (PCBF), a...
40
LLM-Guided Open Hypothesis Learning from Autonomous Scanning Probe Microscopy Experiments
Boris Slautin, Utkarsh Pratiush, Yu Liu et al. (5 authors)
📅 2026-05-07
Autonomous experimentation has transformed microscopy and materials discovery by enabling closed-loop optimization including imaging and spectroscopy tuning, strucutre property relationship discovery, and exploration of combinatorial libraries. However, most current workflows remain limited to selecting measurements within fixed objective or hypothesis spaces, rather than generating new physical...
41
Market-Alignment Risk in Pricing Agents: Trace Diagnostics and Trace-Prior RL under Hidden Competitor State
Peiying Zhu, Sidi Chang
📅 2026-05-07
Outcome metrics can certify the wrong behavior. We study this failure in a two-hotel revenue-management simulator where Hotel A trains an agent against a fixed rule-based revenue-management competitor, Hotel B. A standard learning agent can obtain near-reference revenue per available room (RevPAR) while failing to learn market-like yield management: it sells too aggressively, undercuts, or...
42
Litespark Inference on Consumer CPUs: Custom SIMD Kernels for Ternary Neural Networks
Nii Osae Osae Dade, Tony Morri, Moinul Hossain Rahat et al. (4 authors)
📅 2026-05-07
Large language models (LLMs) have transformed artificial intelligence, but their computational requirements remain prohibitive for most users. Standard inference demands expensive datacenter GPUs or cloud API access, leaving over one billion personal computers underutilized for AI workloads. Ternary models offer a path forward: their weights are constrained to {-1, 0, +1}, theoretically...
43
Render, Don't Decode: Weight-Space World Models with Latent Structural Disentanglement
Roussel Desmond Nzoyem, Mauro Comi
📅 2026-05-07
Training world models on vast quantities of unlabelled videos is a critical step toward fully autonomous intelligence. However, the prevailing paradigm of encoding raw pixels into opaque latent spaces and relying on heavy decoders for reconstruction leaves these models computationally expensive and uninterpretable. We address this problem by introducing NOVA, a world modelling framework that...
44
OmicsLM: A Multimodal Large Language Model for Multi-Sample Omics Reasoning
Maciej Sypetkowski, Joanna Krawczyk, Łukasz Smoliński et al. (7 authors)
📅 2026-05-07
Interpreting transcriptomic data is one of the most common analytical tasks in modern biology. Yet most current models either consume expression profiles without producing natural-language biological explanations, or reason in language without direct access to quantitative omics measurements. We introduce OmicsLM, a multimodal LLM that connects quantitative omics profiles with natural-language...
45
Unified Value Alignment for Generative Recommendation in Industrial Advertising
Xinxun Zhang, Yuling Xiong, Jiale Zhou et al. (16 authors)
📅 2026-05-07
Generative Recommendation (GR) reformulates recommendation as a next-token generation problem and has shown promise in industrial applications. However, extending GR to industrial advertising is non-trivial because the system must optimize not only user interest but also commercial value. Existing GR pipelines remain largely semantics-centric, making it difficult to align value signals across...
46
EGA: Adapting Frozen Encoders for Vector Search with Bounded Out-of-Distribution Degradation
Dongfang Zhao
📅 2026-05-07
Vector search systems built on frozen vision encoders face queries from unseen classes at deployment, yet existing adapter training collapses under this shift: high-capacity adapters with global contrastive losses silently reassign unseen-class samples to wrong seen-class clusters, dropping worst-case Label Precision by over 40 points below the frozen baseline in our tests. We propose Euclidean...
47
Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections
Vinit Katariya, Seungjin Kim, Curtis Craig et al. (5 authors)
📅 2026-05-06
Artificial intelligence (AI) and computer vision are transforming transportation data collection. This study introduces an AI-enabled analytics framework leveraging existing CCTV infrastructure to evaluate the impact of soft interventions, such as temporary pedestrian refuges and curb extensions, on vehicle speed and safety. Using deep learning and perspective-based speed estimation, we evaluated...
48
Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization
Xin Yu, Liuchen Liao, Yiwen Zhang et al. (6 authors)
📅 2026-05-06
On-policy distillation is an efficient alternative to reinforcement learning, offering dense token-level training signals. However, its reliance on a stronger external teacher has driven recent work on on-policy self-distillation, where the same model serves as both teacher and student under different prompt contexts. Yet, existing self-distillation methods largely reduce learning to KL matching...
49
Federated Learning for Early Prediction of EV Charging Demand
Vasilis Perifanis, Foteini Nikolaidou, Nikolaos Pavlidis et al. (5 authors)
📅 2026-05-06
Accurate forecasting of electric vehicle (EV) charging demand is critical for grid stability, infrastructure planning, and real-time charging optimization. In this work, we study the problem of early prediction of charging demand, where the total energy of a session is estimated using only information available at plug-in time and during the first minutes of charging. This enables actionable...
50
AICoFe: Implementation and Deployment of an AI-Based Collaborative Feedback System for Higher Education
Alvaro Becerra, Alejandra Palma, Ruth Cobos
📅 2026-05-06
Effective peer feedback is essential for developing critical reflection in higher education, yet its impact is often limited by the inconsistent quality of student-generated comments. This paper presents the implementation and deployment of AICoFe (AI-based Collaborative Feedback), a system designed to bridge this gap through a human-centered AI approach. We describe a modular architecture that...