Chen Baiyu  陈柏聿 陈柏聿  Chen Baiyu

Passport name: CHENBOYU  ·  publications & ORCID under "Baiyu Chen" 证件英文名:CHENBOYU  ·  学术成果与 ORCID 统一署名 "Baiyu Chen"

Undergraduate Researcher · School of Economics and Management, Minnan University of Science and Technology 本科生科研人员 · 闽南理工学院 经济管理学院

Financial Engineering · 2022 – Present  |  Exchange: TAR UMT, Kuala Lumpur · Jan – May 2026 金融工程 · 2022 年至今  |  交换:东姑阿都拉曼管理及工艺大学,吉隆坡 · 2026.01–05

2295070968@qq.com
+86 13779960135
ORCID: 0009-0003-7841-3112
2
ACM PapersACM 论文
2
Patents Granted已授权专利
1
Patent Pending (1st Inv.)申请中专利(第一发明人)
3
AIIA StandardsAIIA 标准参与
2
Invited / Oral Talks受邀报告 / 口头报告
6
Competition Awards竞赛获奖
LLM Hallucination Research大语言模型幻觉研究 Hallucination Evaluation幻觉评估 Hallucination Mitigation幻觉缓解 Domain-Specific LLM Governance领域 LLM 治理 RAG · Multi-Agent · RL FinTech / RegTech AI Safety StandardsAI 安全标准

Research Profile学术定位

Baiyu Chen's research centres on LLM hallucination — spanning evaluation framework construction, technical mitigation methods, and domain-specific governance — with high-stakes financial regulation serving as the primary empirical validation domain. 陈柏聿的核心研究方向为大语言模型幻觉,涵盖评估框架构建、技术缓解方法与领域特化治理,以金融监管等高风险场景作为主要实证验证域。

Primary LLM Hallucination: Evaluation · Mitigation · Domain Governance 主轴 LLM 幻觉研究:评估 · 缓解 · 领域治理
From standardised evaluation frameworks (AIIA/T 0294-2026) and general mitigation methods (granted patent on triple-confidence structured reasoning) to domain-specific hallucination elimination in legal contexts (pending patent, 1st inventor). Research trajectory: evaluation standards → engineering solutions → domain-specific patents → hallucination mechanism and training-stage intervention. 从标准化评估框架(AIIA/T 0294-2026)和通用幻觉缓解方法(三重置信度结构化推理专利),到法律领域特化幻觉消除(申请中,第一发明人)。研究路径:评估标准 → 工程解决方案 → 领域特化专利 → 幻觉机制与训练阶段介入。
Application Domain AI-Driven Financial Regulation & RegTech 应用域 AI 驱动金融监管与监管科技
Financial regulation serves as the empirical testing ground: RAG-enhanced LLM and multi-agent systems for futures anomaly monitoring (ACM); PPO reinforcement learning for cross-market arbitrage (ACM). High-stakes financial decisions demand the highest reliability — making this an ideal domain to validate hallucination solutions and build domain-specific datasets. 金融监管作为实证验证域:RAG 增强 LLM 与多智能体期货异常监测(ACM 论文);PPO 强化学习跨市场套利(ACM 论文)。金融决策的高风险性使其成为验证幻觉解决方案、积累领域数据集的理想场景。
Research arc: AIIA evaluation standard (defining hallucination) → mitigation patents (solving hallucination) → domain-specific patents in legal / supply-chain finance / accounting · audit → empirical papers → long-term: hallucination mechanism research & training-stage intervention. 研究路径:AIIA 评估标准(定义幻觉)→ 缓解专利(解决幻觉)→ 法律 / 供应链金融 / 财会审计领域特化专利 → 实证论文 → 长期:幻觉机制研究与训练阶段介入。

Key Highlights核心成果亮点

2 ACM Conference Publications — FinTech & AI Governance 2 篇 ACM 国际会议论文 · 金融科技与 AI 治理
[1] ICCSMT '25 · RAG+DRL multi-agent futures monitoring — 92% detection accuracy, hallucination rate 35%→7%.  [2] BDEIM '25 · PPO copper futures cross-market arbitrage — 95.05% cumulative return. [1] ICCSMT '25 · RAG+DRL 多智能体期货异常监测 — 检测准确率 92%,幻觉率 35%→7%。 [2] BDEIM '25 · PPO 铜期货跨市场套利算法 — 累积收益率 95.05%。
3 National Invention Patents — 2 Granted · 1 Pending (1st Inventor) 3 项国家发明专利 · 2 项已授权 · 1 项申请中(第一发明人)
Granted: intelligent data annotation (ZL 2025 1 1403494.2) · LLM hallucination reduction (ZL 2026 1 0018069.X). Pending as 1st inventor: legal-domain LLM hallucination elimination via uncertainty theory & formal logic — cross-institutional with Tsinghua · CAICT · CUFE. 已授权:高质量智能数据标注(ZL 2025 1 1403494.2)· 降低大语言模型幻觉(ZL 2026 1 0018069.X)。申请中(第一发明人):基于不确定理论与形式逻辑验证的法律领域 LLM 幻觉消除——清华·CAICT·中央财经大学 跨校合作。
Principal Drafter — 3 AIIA National Technical Standards AIIA 国家技术标准主要起草人 · 3 项
AIIA/T 0294-2026 — Hallucination Evaluation Framework: first proposed the concept of Execution Hallucination (执行幻觉) for tool-calling AI agents.  AIIA/T 0296-2026 — LLM Platform Security Ops Management.  AI Agent Security Classification Standard — in development. AIIA/T 0294-2026——幻觉评估框架:首次提出面向工具调用智能体的执行幻觉概念。 AIIA/T 0296-2026——大模型平台安全运营管理能力要求。 AI 智能体应用安全分级规范——在制定中。
National & International Awards — First Prize ×2 · International First Prize ×1 国家级一等奖 ×2 · 国际一等奖 ×1(共 6 项获奖)
Python Data Analysis (National) · "Innovation Cup" Big Data Challenge (National) · APMCM Asia-Pacific Mathematical Modelling (International) · 6 competition awards in total. 全国高校 Python 数据分析赛一等奖 · 创新杯全国大数据挑战赛一等奖 · APMCM 亚太数学建模国际一等奖,共 6 项获奖。
Invited Expert — AIIA National AI Standardisation Workshop, Beijing 受邀专家 · AIIA 全国人工智能标准化专家研讨会,北京
Presented on "Generative AI Hallucination Control Based on Quadruple Verification & MoE Model" · Jan 29, 2026 · Shouxiang Technology Building, Haidian District. 主题报告:基于四重验证机制及混合专家模型的生成式 AI 幻觉控制原理及方案 · 2026 年 1 月 29 日 · 北京海淀首享科技大厦。
Conference Oral Presentation Certificate — ICCSMT 2025, Xiamen 国际会议口头报告证书 · ICCSMT 2025,厦门
Presented "RAG-Enhanced LLM and RL Scheduling" at the 2025 6th Int'l Conference on Computer Science and Management Technology · Dec 26–28, 2025. 在第六届计算机科学与管理技术国际会议发表口头报告 · 2025 年 12 月 26–28 日,厦门。

Featured Publications代表性论文

[1] RAG-Enhanced LLM and RL Scheduling: Optimizing a Multi-Agent Framework for Abnormal Futures Price Monitoring ICCSMT '25
92%
Detection Acc.检测准确率
7%
Hallucination Rate (↓35%)幻觉率(↓35%)
DOI 10.1145/3795154.3795432
[2] Research on an Intelligent Algorithm for Cross-Market Arbitrage of Copper Futures Based on Dynamic Slope Calculation – EMA Filtering – PPO Reinforcement Learning BDEIM '25
95.05%
Cumulative Return累积收益率
9.89%
Max Drawdown最大回撤
DOI 10.1145/3800000.3800226

Education教育背景

B.Eng. Financial Engineering (in progress) 金融工程学士 (在读) 2022 – Present至今
Minnan University of Science and Technology, School of Economics and Management · Shishi, Fujian, China 闽南理工学院 经济管理学院 · 福建石狮
Student Exchange — Mobility Programme 学生交流——流动项目 Jan – May 2026
Tunku Abdul Rahman University of Management and Technology (TAR UMT) · Kuala Lumpur, Malaysia 东姑阿都拉曼管理及工艺大学(TAR UMT)· 马来西亚吉隆坡

Conference Papers国际会议论文

[1] RAG-Enhanced LLM and RL Scheduling: Optimizing a Multi-Agent Framework for Abnormal Futures Price Monitoring ACM · ICCSMT 2025

Shuangyang Zheng, Yangchen Zhong, Xiaobao Que, Jingming Shen, Baiyu Chen, Weiqin Shen, Shibao Zheng, Bo Liu, Jin Gao

ICCSMT '25 · ACM · pp. 1747–1755 (9 pages) · Conf. dates:会议时间: 26–28 Dec 2025, Xiamen, China · Published:发布日期: 01 April 2026

DOI: 10.1145/3795154.3795432 Open Access
92%
Detection Accuracy检测准确率
7%
Hallucination Rate (↓ from 35%)幻觉率(↓ 从 35%)
Oral
Presentation at ICCSMT 2025ICCSMT 2025 口头报告
RAGLLMDRL Multi-AgentFutures Monitoring期货监测 FinTech RegTechHallucination Mitigation幻觉缓解
[2] Research on an Intelligent Algorithm for Cross-Market Arbitrage of Copper Futures Based on Dynamic Slope Calculation – EMA Filtering – PPO Reinforcement Learning ACM · BDEIM 2025

Shuangyang Zheng, Yangchen Zhong, Xiaobao Que, Jingming Shen, Baiyu Chen, Weiqin Shen, Shibao Zheng, Hengxin Deng, Jin Gao

BDEIM '25 · ACM · pp. 1477–1487 (11 pages) · Conf. dates:会议时间: 19–21 Dec 2025, Shanghai, China · Published:发布日期: 15 May 2026

DOI: 10.1145/3800000.3800226 Open Access
95.05%
Cumulative Return累积收益率
16.76%
Annualised Return年化收益率
9.89%
Max Drawdown最大回撤
PPO · RL Copper Futures铜期货 Cross-Market Arbitrage跨市场套利 EMA Filtering Dynamic Slope Calc.动态斜率计算 Quantitative Finance量化金融

Conference Oral Presentation国际会议口头报告

Title报告题目 Conference会议 Location地点 Date时间 Role角色
RAG-Enhanced LLM and RL Scheduling: Optimizing a Multi-Agent Framework for Abnormal Futures Price Monitoring ICCSMT '25 Xiamen, China厦门,中国 26–28 Dec 2025 Presenter (Oral)报告人(口头报告)
Certificate of Oral Presentation issued by the Committee of ICCSMT 2025 (December 2025). Affiliation: School of Economics and Management, Minnan University of Science and Technology. 口头报告证书由 ICCSMT 2025 会议委员会颁发(2025 年 12 月)。证书署名单位:闽南理工学院经济管理学院。

Invention Patents发明专利

[1] A High-Quality Intelligent Data Annotation Method一种高质量智能数据标注方法 Granted已授权
Patent No.专利号ZL 2025 1 1403494.2
Announcement No.公告号CN 120873192 B
Filing Date申请日2025.09.29
Grant Date授权公告日2026.01.13
Inventors发明人Shuangyang Zheng, Baiyu Chen, Yi Lin, Jingming Shen, Wenhui Huang, Yangchen Zhong, Yuhong Huang郑双阳、陈柏聿、林一、沈景铭、黄文辉、钟阳晨、黄宇宏
Patent Holder专利权人Minnan University of Science and Technology闽南理工学院
Core Method:核心方法: Input text is mapped to a d-dimensional semantic vector space. LLM-driven rule feature extraction constructs a rule feature vector. A mixed feature space (semantic + rule) is built; a similarity graph is constructed for label propagation. Uncertainty-based confidence scoring (vector evidence · rule evidence · LLM direct-judgment evidence) is fused to determine final labels — improving annotation accuracy from multiple representation angles. 输入文本映射为 d 维语义向量空间;LLM 驱动规则特征提取构建规则特征向量;语义+规则混合特征空间 → 构建相似图进行标签传播;不确定数学信度检验(向量证据·规则证据·LLM 直判证据三重融合)→ 最终标签决策。多角度表征有效提升标注准确性。
[2] A Method for Reducing Large Language Model Hallucinations一种降低大语言模型幻觉的方法 Granted已授权
Patent No.专利号ZL 2026 1 0018069.X
Announcement No.公告号CN 121480738 B
Filing Date申请日2026.01.08
Grant Date授权公告日2026.03.17
Inventors发明人Shuangyang Zheng, Baiyu Chen, Yi Lin, Jingming Shen, Wenhui Huang, Yuwen Wang, Xi Liang, Jin Gao, Weiqin Shen, Quanzhou Chen郑双阳、陈柏聿、林一、沈景铭、黄文辉、王煜文、梁曦、高谨、沈伟钦、陈权洲
Patent Holder专利权人Minnan University of Science and Technology闽南理工学院
Core Method:核心方法: LLM generates N structured reasoning candidates, each with a chain of reasoning steps carrying self-assessed confidence. Triple confidence evaluation: opposition check confidence · entailment check confidence · minimum step confidence → aggregated composite confidence. The candidate with highest composite confidence is selected as optimal; the weakest reasoning step undergoes global diagnostic analysis → targeted correction strategy → suffix reconstruction or rewrite → convergence judgement. Significantly enhances interpretability and auditability of LLM outputs. LLM 生成 N 个结构化候选推理链(每步携带前提陈述与自评置信度);三重置信度评估:对立检查置信度·蕴含检查置信度·最小步骤置信度 → 综合置信度聚合;选最优候选 → 全局推理链弱点诊断(根本原因·上游依赖·修正策略)→ 步骤修正与后缀重建 → 收敛判定。显著提升 LLM 输出的可解释性与可审计性。
[3] A Method for Eliminating Hallucinations in Legal Domain LLMs Based on Uncertainty Theory and Formal Logic Verification一种基于不确定理论与形式逻辑验证的法律领域大语言模型幻觉消除方法 Pending · 1st Inventor申请中 · 第一发明人
Status状态Patent Pending — Institution approved: 2026.05.19申请中 — 校内审批:2026.05.19
Lead Inventor第一发明人Baiyu Chen陈柏聿
Co-Inventors共同发明人 Jingming Shen · Kaixi Zhang (Tsinghua Univ., PhD) · Ruolong Ma (CAICT AI Institute) · Tingqing Ye (China Univ. of Geosciences Beijing) · Wentao Chen (CAICT, Deputy Director) · Wei Dai (CUFE) · Li Luo · Qisi Wang · Muchen Chen · Shuangyang Zheng 沈景铭 · 张开西(清华大学数学系博士)· 马若龙(中国信通院AI所)· 叶廷青(中国地质大学北京)· 陈文弢(中国信通院AI所副所长)· 戴韡(中央财经大学)· 罗立 · 汪启思 · 陈牧晨 · 郑双阳
Patent Holder专利权人Minnan University of Science and Technology闽南理工学院
Cross-Institutional Collaboration:跨机构合作: Co-inventors span Tsinghua University, CAICT/AIIA, Central University of Finance and Economics (CUFE), China University of Geosciences Beijing, and Fujian Normal University. 联合发明团队跨越清华大学、中国信通院(AIIA 主管机构)、中央财经大学、中国地质大学(北京)、福建师范大学。

Technical Standards Participation — AIIA, China 标准参与 — 中国人工智能产业发展联盟(AIIA)

# Standard Title标准名称 Standard No.标准编号 Issue Date发布时间 Role参与身份
1 Hallucination Evaluation Framework for Generative AI Model Applications 《生成式人工智能模型应用幻觉评估框架》 AIIA/T 0294-20262026.04 Principal Drafter (9th of drafters · inst. ranked 3rd) 主要起草人 (起草人排名第 9 · 单位排名第 3)
2 Security of Large Model Dev & Ops Platform, Part 3: Security Operations Management Capability Requirements 《大模型开发运营平台安全 第3部分:大模型平台安全运营管理能力要求》 AIIA/T 0296-20262026.05 Principal Drafter (10th of drafters · inst. ranked 11th) 主要起草人 (起草人排名第 10 · 单位排名第 11)
3 Security Classification Standard for AI Agent Applications 《AI智能体应用安全分级规范》 Pending待定 In Development制定中 Principal Drafter主要起草人
AIIA/T 0294-2026 · Hallucination Evaluation Framework for Generative AI Model Applications生成式人工智能模型应用幻觉评估框架 Published Apr 20262026.04 发布
Drafter Rank起草人排名 Baiyu Chen — 9th of principal drafters陈柏聿 — 主要起草人第 9 位
Institution Rank起草单位排名 Minnan University of Science and Technology — 3rd of drafting organisations闽南理工学院 — 起草单位第 3 位
Lead Organisation主要起草单位 China Academy of Information and Communications Technology (CAICT/CICT) · 40+ co-drafting organisations中国信息通信研究院(牵头)· 联合 40+ 家单位

Defines hallucination evaluation requirements for generative AI models (LLMs, multimodal models, AI agents) across 6 core dimensions: factual accuracy · source fidelity · internal consistency · instruction following · logical reasoning · honest uncertainty expression. 规定生成式 AI 模型(大语言模型、多模态模型、AI 智能体)幻觉评估的 6 大核心维度:事实准确性·来源忠实性·内部一致性·指令遵循能力·逻辑推理能力·不确定性诚实表达。

Key contribution: First proposed the concept of Execution Hallucination (执行幻觉) — a new hallucination type specific to tool-calling AI agents, where the model fabricates execution states, falsely reports task completion, or claims unauthorised capabilities. This fills a critical gap in AI safety evaluation for agentic systems. 核心贡献:首次提出执行幻觉(Execution Hallucination)概念——专门针对具备工具调用能力的 AI 智能体,指模型虚构执行状态、虚报任务完成状态或声称具备未被授权执行能力的幻觉现象,填补了 AI 智能体安全评估领域的重要空白。

AIIA/T 0296-2026 · LLM Dev & Ops Platform Security, Part 3: Security Operations Management Capability Requirements大模型开发运营平台安全 第3部分:大模型平台安全运营管理能力要求 Published May 20262026.05 发布
Drafter Rank起草人排名 Baiyu Chen — 10th of principal drafters陈柏聿 — 主要起草人第 10 位
Institution Rank起草单位排名 Minnan University of Science and Technology — 11th of drafting organisations闽南理工学院 — 起草单位第 11 位
Lead Organisation主要起草单位 China Academy of Information and Communications Technology (CAICT/CICT) · 10 co-drafting organisations中国信息通信研究院(牵头)· 联合 10 家单位
Specifies the core security operations management capabilities for LLM platforms across 3 modules: (1) Full lifecycle security management — deployment, upgrade & maintenance, operations, decommissioning; (2) Personnel & permission governance — role division, least-privilege access, operation audit & traceability; (3) Security monitoring — threat & situational awareness, vulnerability management, security analysis, emergency response. Part of the multi-series Security of Large Model Development and Operations Platform standards. 规定大模型平台安全运营管理的 3 大核心能力模块:①全生命周期安全管理——部署、升级维护、运维操作、平台下线各阶段规范;②人员与权限管理体系——角色划分、权限最小化、操作审计与追溯;③安全监测——安全态势感知、漏洞管理、安全研判分析、安全应急处置。属《大模型开发运营平台安全》系列标准第3部分。

Research Experience科研经历

Data Intelligence Collaborative Innovation Centre · Minnan University of Science and Technology
formerly Digital-Intelligent Economy Industrial Innovation Research Centre
数据智能协同创新中心 · 闽南理工学院
曾用名:数智经济产业创新研究中心
2024 – Present至今

Project Researcher / Principal Researcher · Contributed to RAG-enhanced LLM, multi-agent systems, RL applied to financial regulation, futures anomaly monitoring, arbitrage strategy, and A-share manipulation detection research. 项目研究员 / 首席研究员 · 参与 RAG 增强 LLM、多智能体系统、强化学习在金融监管中的应用,以及期货异常监测、跨市场套利策略、A 股市场操纵识别等研究。

Research Focus研究方向 Role角色
Generative AI Hallucination Governance & LLM Reliability Evaluation生成式 AI 幻觉治理与大语言模型可靠性评估Principal Researcher首席研究员
AI Safety & Large Model Platform GovernanceAI 安全与大模型平台治理Principal Researcher首席研究员
Financial Regulatory Technology (RegTech)金融监管科技(RegTech)Project Researcher项目研究员
Quantitative Investment & Financial Data Analytics量化投资与金融数据分析Project Researcher项目研究员
Multi-Source Heterogeneous Data Fusion & Intelligent Decision-Making多源异构数据融合与智能决策Project Researcher项目研究员

Academic Activities & Invited Presentations学术活动与受邀报告

AIIA Expert Workshop — Generative AI Hallucination & Reliability Evaluation Framework AIIA 专家研讨会——生成式 AI 幻觉与可靠性评估框架 2026.01.29
Organiser主办单位Artificial Intelligence Industry Alliance of China (AIIA)中国人工智能产业发展联盟(AIIA)
Venue地点7/F, Shouxiang Technology Building, Haidian District, Beijing北京市海淀区首享科技大厦 7 层
Role参与身份Invited Expert / Principal Researcher, Data Intelligence Collaborative Innovation Centre, Minnan University of Science and Technology专家 / 闽南理工学院数据智能协同创新中心首席研究员
Presentation分享主题Generative AI Hallucination Control Based on Quadruple Verification Mechanism and Mixture of Experts Model基于四重验证机制及混合专家模型的生成式 AI 幻觉控制原理及方案
AIIA Expert Workshop — Large Model Dev & Ops Platform Security Standards AIIA 专家研讨会——大模型开发运营平台安全系列标准 2026.03.18
Organiser主办单位Artificial Intelligence Industry Alliance of China (AIIA)中国人工智能产业发展联盟(AIIA)
Venue地点7/F, Shouxiang Technology Building, Haidian District, Beijing北京市海淀区首享科技大厦 7 层
Focus研讨方向LLM Platform Security · AI Safety Governance · Security Operations Management Requirements大模型平台安全 · AI 安全治理 · 平台安全运营管理能力要求

Work Experience & Applied Projects工作经历与应用项目

Xiamen Zhongxing Carbon Smart Technology Co., Ltd. · AI Engineering Intern 厦门中星碳智慧科技有限公司 · AI 工程实习生 Sep – Dec 2025 · Xiamen 2025.09–12 · 厦门

Independently led the end-to-end development of two core AI systems. Assessed as "Excellent" by on-site supervisor. 个人主导完成两大核心系统的全流程研发,获实习单位指导教师评定"实习成绩优秀"

Project 1: Automated Multi-Model Audio Annotation System项目一:自动化多模型音频标注系统

Sole developer. Playwright browser automation + parallel multi-model AI core. ASR accuracy >95%, classification >90%, data production cycle reduced >90%. 独立研发者。Playwright 浏览器自动化 + 多模型并行 AI 核心。ASR 准确率 >95%,分类准确率 >90%,数据生产周期缩短 >90%。

Project 2: Novel Information AI Auto-Annotation System项目二:小说信息 AI 自动标注系统

Lead developer (search engine module). Dual-layer expert agent pipeline. Human-review-friendly output with confidence scores and uncertainty flags. 主要开发者(搜索引擎模块)。双层专家智能体架构。系统输出包含可信度评分与疑问点提示,构建人机协同标注新范式。

Xiamen Qihuo Network Technology Co., Ltd. · Quantitative Trading Intern 厦门奇获网络科技有限公司 · 量化交易实习生 Dec 2025 · Xiamen 2025.12 · 厦门

Independently refactored a semi-automatic order-execution strategy into a fully automated system with volatility filtering and dynamic long/short signal logic. Conducted multi-contract backtesting and live simulated trading with systematic post-trade analysis. Gained deep insight into the gap between live-trading constraints and backtest results, and into the practical logic of human–AI collaboration within a quantitative team. 独立将半自动下单策略改造为融合波动率筛选与动态多空信号逻辑的全自动系统;进行多品种回测与模拟实盘交易,建立系统化复盘机制。深入理解实盘约束与策略回测结果之间的差异,以及量化团队中人机协同的实践逻辑。


Professional Certifications专业资格证书

Certification证书名称 Issuing Body颁发机构 Date时间 Status状态
Fund Practitioner Qualification – Securities Investment Fund Fundamentals基金从业人员资格——证券投资基金基础知识 AMAC2025.05.24 Passed合格
Fund Practitioner Qualification – Fund Laws, Regulations, Ethics & Business Standards基金从业人员资格——基金法律法规、职业道德与业务规范 AMAC2025.05.24 Passed合格

Awards & Honours竞赛获奖

# Award奖项 Competition / Event赛事 Level级别 Date时间 Team成员
1 Second Prize二等奖 1st National AI Application Innovation Competition (Fujian Regional)首届全国人工智能应用创新大赛(福建省赛区) Provincial省级 2025.06 Baiyu Chen, Yi Lin, Jingming Shen陈柏聿、林一、沈景铭
2 First Prize一等奖 2025 National College Student High-Tech Competition – Python Data Analysis Track2025 年全国大学生高新技术竞赛——Python 数据分析赛项 National全国 2025 Baiyu Chen陈柏聿
3 B-Level AwardB 级 6th China AI Competition – Large Model Hallucination Challenge第六届中国人工智能大赛——大模型幻觉挑战赛 National全国 2025.12 MUST Team闽南理工学院代表队
4 First Prize (Undergrad)本科组一等奖 2025 "Innovation Cup" College Student Big Data Challenge2025 年"创新杯"大学生大数据挑战赛 National全国 2025.08 Baiyu Chen, Yuxin Zhong, Xinye Huang陈柏聿、钟雨欣、黄新页
5 First Prize (Undergrad)本科组一等奖 2025 APMCM Asia-Pacific Mathematical Modelling Competition2025 年 APMCM 亚太地区大学生数学建模竞赛 International国际 2025.06 Jingming Shen, Baiyu Chen, Xinye Huang沈景铭、陈柏聿、黄新页
6 First Prize一等奖 12th Cross-Strait & HK-Macao Vocational Skills Competition – Futures Investment Simulation第十二届海峡两岸暨港澳大学生职业技能大赛——期货投资模拟赛 Cross-Strait两岸四地 2025 Baiyu Chen, Wenhui Huang, Minhong Wu陈柏聿、黄文辉、吴敏虹
Award #5 (APMCM) is an international competition covering Asia-Pacific universities. Award #6 covers universities across Mainland China, Taiwan, Hong Kong, and Macao. 第 5 项(APMCM)为覆盖亚太地区高校的国际级竞赛。第 6 项覆盖中国大陆、台湾、香港及澳门地区高校。

Selected Technical Highlights代表性竞赛技术亮点

Provincial · Second Prize省级二等奖 #1 · 1st National AI Application Innovation Competition — Multi-Source Stock Investment Strategy Agent #1 · 首届全国人工智能应用创新大赛——面向多源异构数据的上市公司股票投资策略分析 Deployed已上线

Designed and deployed a full-stack personalised stock-investment advisory Agent combining Computer Science × Finance × Psychology. Quantifies behavioural biases (overconfidence, loss aversion) as real-time emotional signals. Data sources: Sina Finance API · Eastmoney · Weibo sentiment · Baidu AI Search MCP · CSMAR database. Deployed on Baidu Qianfan AppBuilder as a production AI-native app. 设计并上线全栈个性化股票投资策略智能体,融合计算机科学 × 金融工程 × 心理学三学科视角,将过度自信、损失厌恶等行为偏差量化为实时情绪信号。数据源:新浪财经 API · 东方财富 · 微博舆情 · 百度 AI 搜索 MCP · 国泰安数据库。已在百度智能云千帆 AppBuilder 上线投入使用。

4-Dimension Analysis Framework → MoE四维分析框架 → 混合专家模型(MoE)

Fundamentals基本面 Sentiment情绪分析 News / Macro新闻宏观 Technical技术指标

All four streams fed into a unified feature pipeline → MoE model → dynamic-weight recommendation.四路数据统一进入特征工程管道 → MoE 模型 → 动态权重综合推荐。

6-Dimension User Profile → Personalised Strategy6 维度用户画像 → 个性化策略匹配

Risk Tolerance风险承受力 Experience投资经验 Goals · Horizon目标 · 期限 Style · Capital风格 · 资金

Conservative → low-volatility blue chips; Aggressive → high-growth sector stocks. Macro → industry → stock 3-tier analysis.保守型 → 低波动蓝筹;激进型 → 高成长赛道。宏观 → 行业 → 个股三层分析链路。

MoE · RAG User Profiling用户画像 Sentiment Analysis情绪分析 Behavioural Finance行为金融 multilingual-embedding Baidu AppBuilder千帆 AppBuilder CS × Finance × Psychology计算机×金融×心理学
National · First Prize全国一等奖 #2 · Python Data Analysis — B2B Sales Multi-Dimensional Analysis & Optimisation Model #2 · Python 数据分析赛项——基于 B2B 销售数据的多维度分析与优化模型研究 Solo个人参赛

Constructed a comprehensive quantitative analysis framework on B2B software-sales data, covering sales funnel efficiency, opportunity prioritisation, customer lifetime value, and win-rate prediction. 基于 B2B 软件服务公司销售数据,构建涵盖销售漏斗效率、商机优先级评估、客户生命周期价值与赢单概率预测的综合定量分析框架。

58.1%
Key funnel bottleneck (Intent→Commit)关键漏斗瓶颈转化率(意向→投入)
0.1506
Decision-maker support — top win-rate driver决策者支持度 — 赢单率首要影响因子
7.03%
High-value customers → 13.84% of total value高价值客户占比 → 贡献 13.84% 总价值
Sales Funnel Analysis销售漏斗分析 Customer Lifetime Value客户生命周期价值 3D Scoring Priority Model三维评分优先级模型 SHAP · Feature Importance
National · B-Level全国 B 级 #3 · 6th China AI Competition — Large Model Hallucination Challenge #3 · 第六届中国人工智能大赛——大模型幻觉挑战赛 Phase 7

Designed and implemented a competitive LLM hallucination detection & mitigation system through 7 major architectural iterations. Core innovation: replaced 5 domain experts with 12 task-type experts (translation · code generation · knowledge QA · reasoning · problem solving, etc.) plus a meta-expert router — enabling finer-grained, task-aware hallucination control. Dual API failover: DeepSeek (primary) + Qwen3-480B (backup). 经过 7 次主要架构迭代,设计并实现竞赛级 LLM 幻觉检测与缓解系统。核心创新:将 5 个领域专家替换为 12 个任务型专家(翻译·代码生成·知识问答·逻辑推理·问题求解等)+ 元专家路由器,实现更细粒度的任务感知幻觉控制。双 API 容错:DeepSeek(主)+ Qwen3-480B(备)。

Tiered Hallucination Detection (per task)按任务类型的分级幻觉检测策略

FORMAT SYNTAX FAITHFULNESS LOGICAL FACTUAL FULL

Lightweight tasks (translation, extraction) use FORMAT-only; high-uncertainty tasks (knowledge QA) activate FULL pipeline with second-model verification — reducing unnecessary compute.轻量任务(翻译、信息抽取)仅走 FORMAT;高不确定性任务(知识问答)激活 FULL 流程并启用第二模型验证,显著节省算力。

Task-Adaptive DeepConf Multi-Round Budgets任务自适应 DeepConf 多轮预算

Translation / Extraction翻译 / 信息抽取1 round
Classification / Computation分类 / 计算2 rounds
Reasoning / Problem Solving逻辑推理 / 问题求解3–4 rounds
Knowledge QA (highest uncertainty)知识问答(最高不确定性)5 rounds
12 Task-Type Experts12 任务型专家 Meta-Expert Router元专家路由 Tiered Hallucination Detection分级幻觉检测 Multi-Trace Consensus多轨迹共识分析 DeepSeek · Qwen3-480B Dual API Failover双 API 容错
National · First Prize全国一等奖(本科组) #4 · "Innovation Cup" Big Data — Road Maintenance Prediction & Intelligent Decision System #4 · "创新杯"大数据挑战赛——道路路面维护预测与智能决策系统 Team Leader队长

Built a complete "prediction → feature fusion → optimisation → Agent" pipeline on ~100,000 road segment records. Stage 1 Prediction: TabNet (sequential attention, 99.99%) + custom DRGANDRNELM / LTCNBLS / TCAETCNResNet hybrid (99.85%) + DNN+SHAP. Stage 2 Feature Fusion: RLHF-based dual-weight fusion reconciles conflicting TabNet and SHAP importance rankings by simulating human preference — producing a more robust feature priority list. Stage 3 Optimisation: quantum annealing → HCAOA (budget-aware initialisation + adaptive threshold + multi-strategy local search). Stage 4 Agent: RAG knowledge base (road history data) + full-stack LLM chatbot for real-time maintenance advisory. 在约 10 万条路段记录上构建"预测 → 特征融合 → 优化 → 智能体"完整流水线。预测层:TabNet(顺序注意力机制,99.99%)+ 定制 DRGANDRNELM/LTCNBLS/TCAETCNResNet 混合架构(99.85%)+ DNN+SHAP。特征融合层:RLHF 双权重融合机制——模拟人类偏好,调和 TabNet 与 SHAP 特征重要性冲突,输出更鲁棒的特征优先级排名。优化层:量子退火 → HCAOA(预算感知初始化 + 自适应阈值 + 多策略局部搜索)。智能体层:RAG 路面历史知识库 + 全栈 LLM 实时维护顾问 Agent。

99.99%
TabNet accuracyTabNet 准确率
99.85%
Hybrid arch. accuracy混合架构准确率
3.23×
HCAOA vs quantum annealing (1.28×)HCAOA 成本效益比(量子退火 1.28×)
100%
Budget utilisation预算利用率
TabNet · SHAP RLHF Dual-Weight Fusion双权重融合 Quantum Annealing量子退火 HCAOA RAG · Agent NP-hard Combinatorial Opt.组合优化
International · First Prize国际一等奖(本科组) #5 · APMCM Asia-Pacific Mathematical Modelling — Kidney Stone Risk Prediction #5 · APMCM 亚太数学建模竞赛——基于集成学习的肾结石风险预测

Constructed a multi-layer ensemble prediction framework on physical-examination biochemical data. Identified key urinary biomarkers (specific gravity, pH, osmolality, etc.) via modified Z-score outlier detection, Spearman rank correlation, and SHAP analysis. Compared four algorithms — automated deep forest, Bayesian network, Gaussian process classifier, and TabNet — with cross-validation optimisation. 基于体检尿液生化指标数据,构建多层次集成预测框架。通过修正 Z-Score 异常值处理、Spearman 秩相关与 SHAP 值分析,识别与肾结石高度相关的关键生理指标(尿比重、pH、渗透压等)。比较自动化深度森林、贝叶斯网络、高斯过程分类器与 TabNet 四种算法,交叉验证优化模型参数。

≥0.70
Mean F1 score across models各模型平均 F1 值
15
Key predictive features identified (TabNet)TabNet 自动识别的关键预测特征数
TabNet Deep Forest深度森林 Bayesian Network贝叶斯网络 SHAP · Spearman Medical AI医疗 AI
Cross-Strait · First Prize两岸四地一等奖 #6 · Cross-Strait Futures Investment Simulation — DeepSeek Agent-Driven Trading Strategy #6 · 海峡两岸期货投资模拟赛——DeepSeek 智能体驱动的量化期货交易策略 Team Leader队长

Designed and led implementation of a DeepSeek LLM-driven futures trading strategy. Market state is formatted as structured prompts integrating multi-dimensional technical indicators (RSI, MACD, Bollinger Bands + novel academic indicators GVR, GDMA, DoMA) across three timeframes (30 min / 1 h / 4 h); DeepSeek generates trade direction, confidence, target price, and stop-loss. Strict 1:2 risk-reward ratio with multi-layer risk controls (max position 25%, min holding time, frequency limits). 主导设计并实现 DeepSeek 大语言模型驱动的期货交易策略。将多维度技术指标(RSI、MACD、布林带 + 前沿学术指标 GVR、GDMA、DoMA)整合为结构化提示词,跨三个时间周期(30 分钟 / 1 小时 / 4 小时)验证后输入 DeepSeek,由模型生成含方向、置信度、目标价与止损价的决策信号。严格执行 1:2 风险收益比,配备多层次风控机制(最大仓位 25%、最小持仓时间、交易频率限制)。

229.41%
Backtest cumulative return回测累计收益率
1402%
Annualised return年化收益率
23
Sharpe Ratio夏普比率
DeepSeek LLM Prompt Engineering提示词工程 Multi-Timeframe Analysis多周期技术分析 RSI · MACD · GVR · GDMA Risk Management风险控制 Quantitative Trading量化交易
Technical Skills 技术能力

LLM Hallucination Research & AI Governance大语言模型幻觉研究与 AI 治理

Hallucination Evaluation幻觉评估 Hallucination Mitigation幻觉缓解 Tiered Hallucination Detection分级幻觉检测 Multi-Trace Consensus Analysis多轨迹共识分析 Structured Confidence Reasoning结构化置信度推理 RAG Multi-Agent Systems多智能体系统 Prompt Engineering LLM EvaluationLLM 评估 MoE LoRA Fine-Tuning微调 AI Safety Standards (AIIA)AI 安全标准(AIIA)

Machine Learning & Data Science机器学习与数据科学

PyTorch TabNet XGBoost SHAP Ensemble Learning集成学习 RLHF DRL · PPO Feature Engineering特征工程 Time-Series Analysis时间序列分析 Semi-Supervised Learning半监督学习

Engineering & Implementation工程实现

Python (Async) Playwright Automation自动化 LLM Agent DevelopmentLLM Agent 开发 Multi-Model AI Pipeline多模型 AI 流水线 Quantitative Strategy Dev量化策略开发 RegTech · FinTech