Passport name: CHENBOYU · publications & ORCID under "Baiyu Chen" 证件英文名:CHENBOYU · 学术成果与 ORCID 统一署名 "Baiyu Chen"
Undergraduate Researcher · School of Economics and Management, Minnan University of Science and Technology 本科生科研人员 · 闽南理工学院 经济管理学院
Financial Engineering · 2022 – Present | Exchange: TAR UMT, Kuala Lumpur · Jan – May 2026 金融工程 · 2022 年至今 | 交换:东姑阿都拉曼管理及工艺大学,吉隆坡 · 2026.01–05
Baiyu Chen's research centres on LLM hallucination — spanning evaluation framework construction, technical mitigation methods, and domain-specific governance — with high-stakes financial regulation serving as the primary empirical validation domain. 陈柏聿的核心研究方向为大语言模型幻觉,涵盖评估框架构建、技术缓解方法与领域特化治理,以金融监管等高风险场景作为主要实证验证域。
Shuangyang Zheng, Yangchen Zhong, Xiaobao Que, Jingming Shen, Baiyu Chen, Weiqin Shen, Shibao Zheng, Bo Liu, Jin Gao
ICCSMT '25 · ACM · pp. 1747–1755 (9 pages) · Conf. dates:会议时间: 26–28 Dec 2025, Xiamen, China · Published:发布日期: 01 April 2026
Shuangyang Zheng, Yangchen Zhong, Xiaobao Que, Jingming Shen, Baiyu Chen, Weiqin Shen, Shibao Zheng, Hengxin Deng, Jin Gao
BDEIM '25 · ACM · pp. 1477–1487 (11 pages) · Conf. dates:会议时间: 19–21 Dec 2025, Shanghai, China · Published:发布日期: 15 May 2026
| Title报告题目 | Conference会议 | Location地点 | Date时间 | Role角色 |
|---|---|---|---|---|
| RAG-Enhanced LLM and RL Scheduling: Optimizing a Multi-Agent Framework for Abnormal Futures Price Monitoring | ICCSMT '25 | Xiamen, China厦门,中国 | 26–28 Dec 2025 | Presenter (Oral)报告人(口头报告) |
| Patent No.专利号 | ZL 2025 1 1403494.2 |
| Announcement No.公告号 | CN 120873192 B |
| Filing Date申请日 | 2025.09.29 |
| Grant Date授权公告日 | 2026.01.13 |
| Inventors发明人 | Shuangyang Zheng, Baiyu Chen, Yi Lin, Jingming Shen, Wenhui Huang, Yangchen Zhong, Yuhong Huang郑双阳、陈柏聿、林一、沈景铭、黄文辉、钟阳晨、黄宇宏 |
| Patent Holder专利权人 | Minnan University of Science and Technology闽南理工学院 |
| Patent No.专利号 | ZL 2026 1 0018069.X |
| Announcement No.公告号 | CN 121480738 B |
| Filing Date申请日 | 2026.01.08 |
| Grant Date授权公告日 | 2026.03.17 |
| Inventors发明人 | Shuangyang Zheng, Baiyu Chen, Yi Lin, Jingming Shen, Wenhui Huang, Yuwen Wang, Xi Liang, Jin Gao, Weiqin Shen, Quanzhou Chen郑双阳、陈柏聿、林一、沈景铭、黄文辉、王煜文、梁曦、高谨、沈伟钦、陈权洲 |
| Patent Holder专利权人 | Minnan University of Science and Technology闽南理工学院 |
| Status状态 | Patent Pending — Institution approved: 2026.05.19申请中 — 校内审批:2026.05.19 |
| Lead Inventor第一发明人 | Baiyu Chen陈柏聿 |
| Co-Inventors共同发明人 | Jingming Shen · Kaixi Zhang (Tsinghua Univ., PhD) · Ruolong Ma (CAICT AI Institute) · Tingqing Ye (China Univ. of Geosciences Beijing) · Wentao Chen (CAICT, Deputy Director) · Wei Dai (CUFE) · Li Luo · Qisi Wang · Muchen Chen · Shuangyang Zheng 沈景铭 · 张开西(清华大学数学系博士)· 马若龙(中国信通院AI所)· 叶廷青(中国地质大学北京)· 陈文弢(中国信通院AI所副所长)· 戴韡(中央财经大学)· 罗立 · 汪启思 · 陈牧晨 · 郑双阳 |
| Patent Holder专利权人 | Minnan University of Science and Technology闽南理工学院 |
| # | Standard Title标准名称 | Standard No.标准编号 | Issue Date发布时间 | Role参与身份 |
|---|---|---|---|---|
| 1 | Hallucination Evaluation Framework for Generative AI Model Applications 《生成式人工智能模型应用幻觉评估框架》 | AIIA/T 0294-2026 | 2026.04 | Principal Drafter (9th of drafters · inst. ranked 3rd) 主要起草人 (起草人排名第 9 · 单位排名第 3) |
| 2 | Security of Large Model Dev & Ops Platform, Part 3: Security Operations Management Capability Requirements 《大模型开发运营平台安全 第3部分:大模型平台安全运营管理能力要求》 | AIIA/T 0296-2026 | 2026.05 | Principal Drafter (10th of drafters · inst. ranked 11th) 主要起草人 (起草人排名第 10 · 单位排名第 11) |
| 3 | Security Classification Standard for AI Agent Applications 《AI智能体应用安全分级规范》 | Pending待定 | In Development制定中 | Principal Drafter主要起草人 |
| Drafter Rank起草人排名 | Baiyu Chen — 9th of principal drafters陈柏聿 — 主要起草人第 9 位 |
| Institution Rank起草单位排名 | Minnan University of Science and Technology — 3rd of drafting organisations闽南理工学院 — 起草单位第 3 位 |
| Lead Organisation主要起草单位 | China Academy of Information and Communications Technology (CAICT/CICT) · 40+ co-drafting organisations中国信息通信研究院(牵头)· 联合 40+ 家单位 |
Defines hallucination evaluation requirements for generative AI models (LLMs, multimodal models, AI agents) across 6 core dimensions: factual accuracy · source fidelity · internal consistency · instruction following · logical reasoning · honest uncertainty expression. 规定生成式 AI 模型(大语言模型、多模态模型、AI 智能体)幻觉评估的 6 大核心维度:事实准确性·来源忠实性·内部一致性·指令遵循能力·逻辑推理能力·不确定性诚实表达。
Key contribution: First proposed the concept of Execution Hallucination (执行幻觉) — a new hallucination type specific to tool-calling AI agents, where the model fabricates execution states, falsely reports task completion, or claims unauthorised capabilities. This fills a critical gap in AI safety evaluation for agentic systems. 核心贡献:首次提出执行幻觉(Execution Hallucination)概念——专门针对具备工具调用能力的 AI 智能体,指模型虚构执行状态、虚报任务完成状态或声称具备未被授权执行能力的幻觉现象,填补了 AI 智能体安全评估领域的重要空白。
| Drafter Rank起草人排名 | Baiyu Chen — 10th of principal drafters陈柏聿 — 主要起草人第 10 位 |
| Institution Rank起草单位排名 | Minnan University of Science and Technology — 11th of drafting organisations闽南理工学院 — 起草单位第 11 位 |
| Lead Organisation主要起草单位 | China Academy of Information and Communications Technology (CAICT/CICT) · 10 co-drafting organisations中国信息通信研究院(牵头)· 联合 10 家单位 |
Project Researcher / Principal Researcher · Contributed to RAG-enhanced LLM, multi-agent systems, RL applied to financial regulation, futures anomaly monitoring, arbitrage strategy, and A-share manipulation detection research. 项目研究员 / 首席研究员 · 参与 RAG 增强 LLM、多智能体系统、强化学习在金融监管中的应用,以及期货异常监测、跨市场套利策略、A 股市场操纵识别等研究。
| Research Focus研究方向 | Role角色 |
|---|---|
| Generative AI Hallucination Governance & LLM Reliability Evaluation生成式 AI 幻觉治理与大语言模型可靠性评估 | Principal Researcher首席研究员 |
| AI Safety & Large Model Platform GovernanceAI 安全与大模型平台治理 | Principal Researcher首席研究员 |
| Financial Regulatory Technology (RegTech)金融监管科技(RegTech) | Project Researcher项目研究员 |
| Quantitative Investment & Financial Data Analytics量化投资与金融数据分析 | Project Researcher项目研究员 |
| Multi-Source Heterogeneous Data Fusion & Intelligent Decision-Making多源异构数据融合与智能决策 | Project Researcher项目研究员 |
| Organiser主办单位 | Artificial Intelligence Industry Alliance of China (AIIA)中国人工智能产业发展联盟(AIIA) |
| Venue地点 | 7/F, Shouxiang Technology Building, Haidian District, Beijing北京市海淀区首享科技大厦 7 层 |
| Role参与身份 | Invited Expert / Principal Researcher, Data Intelligence Collaborative Innovation Centre, Minnan University of Science and Technology专家 / 闽南理工学院数据智能协同创新中心首席研究员 |
| Presentation分享主题 | Generative AI Hallucination Control Based on Quadruple Verification Mechanism and Mixture of Experts Model基于四重验证机制及混合专家模型的生成式 AI 幻觉控制原理及方案 |
| Organiser主办单位 | Artificial Intelligence Industry Alliance of China (AIIA)中国人工智能产业发展联盟(AIIA) |
| Venue地点 | 7/F, Shouxiang Technology Building, Haidian District, Beijing北京市海淀区首享科技大厦 7 层 |
| Focus研讨方向 | LLM Platform Security · AI Safety Governance · Security Operations Management Requirements大模型平台安全 · AI 安全治理 · 平台安全运营管理能力要求 |
Independently led the end-to-end development of two core AI systems. Assessed as "Excellent" by on-site supervisor. 个人主导完成两大核心系统的全流程研发,获实习单位指导教师评定"实习成绩优秀"。
Sole developer. Playwright browser automation + parallel multi-model AI core. ASR accuracy >95%, classification >90%, data production cycle reduced >90%. 独立研发者。Playwright 浏览器自动化 + 多模型并行 AI 核心。ASR 准确率 >95%,分类准确率 >90%,数据生产周期缩短 >90%。
Lead developer (search engine module). Dual-layer expert agent pipeline. Human-review-friendly output with confidence scores and uncertainty flags. 主要开发者(搜索引擎模块)。双层专家智能体架构。系统输出包含可信度评分与疑问点提示,构建人机协同标注新范式。
Independently refactored a semi-automatic order-execution strategy into a fully automated system with volatility filtering and dynamic long/short signal logic. Conducted multi-contract backtesting and live simulated trading with systematic post-trade analysis. Gained deep insight into the gap between live-trading constraints and backtest results, and into the practical logic of human–AI collaboration within a quantitative team. 独立将半自动下单策略改造为融合波动率筛选与动态多空信号逻辑的全自动系统;进行多品种回测与模拟实盘交易,建立系统化复盘机制。深入理解实盘约束与策略回测结果之间的差异,以及量化团队中人机协同的实践逻辑。
| Certification证书名称 | Issuing Body颁发机构 | Date时间 | Status状态 |
|---|---|---|---|
| Fund Practitioner Qualification – Securities Investment Fund Fundamentals基金从业人员资格——证券投资基金基础知识 | AMAC | 2025.05.24 | Passed合格 |
| Fund Practitioner Qualification – Fund Laws, Regulations, Ethics & Business Standards基金从业人员资格——基金法律法规、职业道德与业务规范 | AMAC | 2025.05.24 | Passed合格 |
| # | Award奖项 | Competition / Event赛事 | Level级别 | Date时间 | Team成员 |
|---|---|---|---|---|---|
| 1 | Second Prize二等奖 | 1st National AI Application Innovation Competition (Fujian Regional)首届全国人工智能应用创新大赛(福建省赛区) | Provincial省级 | 2025.06 | Baiyu Chen, Yi Lin, Jingming Shen陈柏聿、林一、沈景铭 |
| 2 | First Prize一等奖 | 2025 National College Student High-Tech Competition – Python Data Analysis Track2025 年全国大学生高新技术竞赛——Python 数据分析赛项 | National全国 | 2025 | Baiyu Chen陈柏聿 |
| 3 | B-Level AwardB 级 | 6th China AI Competition – Large Model Hallucination Challenge第六届中国人工智能大赛——大模型幻觉挑战赛 | National全国 | 2025.12 | MUST Team闽南理工学院代表队 |
| 4 | First Prize (Undergrad)本科组一等奖 | 2025 "Innovation Cup" College Student Big Data Challenge2025 年"创新杯"大学生大数据挑战赛 | National全国 | 2025.08 | Baiyu Chen, Yuxin Zhong, Xinye Huang陈柏聿、钟雨欣、黄新页 |
| 5 | First Prize (Undergrad)本科组一等奖 | 2025 APMCM Asia-Pacific Mathematical Modelling Competition2025 年 APMCM 亚太地区大学生数学建模竞赛 | International国际 | 2025.06 | Jingming Shen, Baiyu Chen, Xinye Huang沈景铭、陈柏聿、黄新页 |
| 6 | First Prize一等奖 | 12th Cross-Strait & HK-Macao Vocational Skills Competition – Futures Investment Simulation第十二届海峡两岸暨港澳大学生职业技能大赛——期货投资模拟赛 | Cross-Strait两岸四地 | 2025 | Baiyu Chen, Wenhui Huang, Minhong Wu陈柏聿、黄文辉、吴敏虹 |
Designed and deployed a full-stack personalised stock-investment advisory Agent combining Computer Science × Finance × Psychology. Quantifies behavioural biases (overconfidence, loss aversion) as real-time emotional signals. Data sources: Sina Finance API · Eastmoney · Weibo sentiment · Baidu AI Search MCP · CSMAR database. Deployed on Baidu Qianfan AppBuilder as a production AI-native app. 设计并上线全栈个性化股票投资策略智能体,融合计算机科学 × 金融工程 × 心理学三学科视角,将过度自信、损失厌恶等行为偏差量化为实时情绪信号。数据源:新浪财经 API · 东方财富 · 微博舆情 · 百度 AI 搜索 MCP · 国泰安数据库。已在百度智能云千帆 AppBuilder 上线投入使用。
4-Dimension Analysis Framework → MoE四维分析框架 → 混合专家模型(MoE)
All four streams fed into a unified feature pipeline → MoE model → dynamic-weight recommendation.四路数据统一进入特征工程管道 → MoE 模型 → 动态权重综合推荐。
6-Dimension User Profile → Personalised Strategy6 维度用户画像 → 个性化策略匹配
Conservative → low-volatility blue chips; Aggressive → high-growth sector stocks. Macro → industry → stock 3-tier analysis.保守型 → 低波动蓝筹;激进型 → 高成长赛道。宏观 → 行业 → 个股三层分析链路。
Constructed a comprehensive quantitative analysis framework on B2B software-sales data, covering sales funnel efficiency, opportunity prioritisation, customer lifetime value, and win-rate prediction. 基于 B2B 软件服务公司销售数据,构建涵盖销售漏斗效率、商机优先级评估、客户生命周期价值与赢单概率预测的综合定量分析框架。
Designed and implemented a competitive LLM hallucination detection & mitigation system through 7 major architectural iterations. Core innovation: replaced 5 domain experts with 12 task-type experts (translation · code generation · knowledge QA · reasoning · problem solving, etc.) plus a meta-expert router — enabling finer-grained, task-aware hallucination control. Dual API failover: DeepSeek (primary) + Qwen3-480B (backup). 经过 7 次主要架构迭代,设计并实现竞赛级 LLM 幻觉检测与缓解系统。核心创新:将 5 个领域专家替换为 12 个任务型专家(翻译·代码生成·知识问答·逻辑推理·问题求解等)+ 元专家路由器,实现更细粒度的任务感知幻觉控制。双 API 容错:DeepSeek(主)+ Qwen3-480B(备)。
Tiered Hallucination Detection (per task)按任务类型的分级幻觉检测策略
Lightweight tasks (translation, extraction) use FORMAT-only; high-uncertainty tasks (knowledge QA) activate FULL pipeline with second-model verification — reducing unnecessary compute.轻量任务(翻译、信息抽取)仅走 FORMAT;高不确定性任务(知识问答)激活 FULL 流程并启用第二模型验证,显著节省算力。
Task-Adaptive DeepConf Multi-Round Budgets任务自适应 DeepConf 多轮预算
| Translation / Extraction翻译 / 信息抽取 | 1 round |
| Classification / Computation分类 / 计算 | 2 rounds |
| Reasoning / Problem Solving逻辑推理 / 问题求解 | 3–4 rounds |
| Knowledge QA (highest uncertainty)知识问答(最高不确定性) | 5 rounds |
Built a complete "prediction → feature fusion → optimisation → Agent" pipeline on ~100,000 road segment records. Stage 1 Prediction: TabNet (sequential attention, 99.99%) + custom DRGANDRNELM / LTCNBLS / TCAETCNResNet hybrid (99.85%) + DNN+SHAP. Stage 2 Feature Fusion: RLHF-based dual-weight fusion reconciles conflicting TabNet and SHAP importance rankings by simulating human preference — producing a more robust feature priority list. Stage 3 Optimisation: quantum annealing → HCAOA (budget-aware initialisation + adaptive threshold + multi-strategy local search). Stage 4 Agent: RAG knowledge base (road history data) + full-stack LLM chatbot for real-time maintenance advisory. 在约 10 万条路段记录上构建"预测 → 特征融合 → 优化 → 智能体"完整流水线。预测层:TabNet(顺序注意力机制,99.99%)+ 定制 DRGANDRNELM/LTCNBLS/TCAETCNResNet 混合架构(99.85%)+ DNN+SHAP。特征融合层:RLHF 双权重融合机制——模拟人类偏好,调和 TabNet 与 SHAP 特征重要性冲突,输出更鲁棒的特征优先级排名。优化层:量子退火 → HCAOA(预算感知初始化 + 自适应阈值 + 多策略局部搜索)。智能体层:RAG 路面历史知识库 + 全栈 LLM 实时维护顾问 Agent。
Constructed a multi-layer ensemble prediction framework on physical-examination biochemical data. Identified key urinary biomarkers (specific gravity, pH, osmolality, etc.) via modified Z-score outlier detection, Spearman rank correlation, and SHAP analysis. Compared four algorithms — automated deep forest, Bayesian network, Gaussian process classifier, and TabNet — with cross-validation optimisation. 基于体检尿液生化指标数据,构建多层次集成预测框架。通过修正 Z-Score 异常值处理、Spearman 秩相关与 SHAP 值分析,识别与肾结石高度相关的关键生理指标(尿比重、pH、渗透压等)。比较自动化深度森林、贝叶斯网络、高斯过程分类器与 TabNet 四种算法,交叉验证优化模型参数。
Designed and led implementation of a DeepSeek LLM-driven futures trading strategy. Market state is formatted as structured prompts integrating multi-dimensional technical indicators (RSI, MACD, Bollinger Bands + novel academic indicators GVR, GDMA, DoMA) across three timeframes (30 min / 1 h / 4 h); DeepSeek generates trade direction, confidence, target price, and stop-loss. Strict 1:2 risk-reward ratio with multi-layer risk controls (max position 25%, min holding time, frequency limits). 主导设计并实现 DeepSeek 大语言模型驱动的期货交易策略。将多维度技术指标(RSI、MACD、布林带 + 前沿学术指标 GVR、GDMA、DoMA)整合为结构化提示词,跨三个时间周期(30 分钟 / 1 小时 / 4 小时)验证后输入 DeepSeek,由模型生成含方向、置信度、目标价与止损价的决策信号。严格执行 1:2 风险收益比,配备多层次风控机制(最大仓位 25%、最小持仓时间、交易频率限制)。