Skip to content
View TinderZ's full-sized avatar
🛸
at sea
🛸
at sea
  • 22:14 (UTC +08:00)

Highlights

  • Pro

Block or report TinderZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
TinderZ/README.md

TINDERZ-Lab Logo

👋 Hi, I’m TinderZ      wakatime Python Visitors



✨ Research Focus ✨

🔍 Research Area 📖 Description
🧠 LLM Reasoning Exploring the mechanisms and boundaries of LLMs in complex logical reasoning, covering areas such as Test-time Scaling and RL4LLM.
⚙️ Auxiliary Policy Model Integrating lightweight auxiliary policy models, trained via Reinforcement Learning, into LLM architectures to enhance model capabilities.
🤖 MARL Focusing on multi-agent system, including both LLM-based agents and traditional agents, to solve coordination and game-theoretic equilibrium.

Let's Connect! 👀

  • I'm always open to collaboration and discussion on these topics.
  • Email:b23042510@njupt.edu.cn
  • Feel free to reach out if you share similar interests or have exciting projects in mind!

Pinned Loading

  1. LLM-Tracker LLM-Tracker Public

    A tracker for the latest updates on Large Language Models from various companies. (Keep updating~)

    Python 4

  2. MathModelingTraining MathModelingTraining Public

    这是一个数学建模竞赛训练题目的集合仓库,主要包含各种数学建模竞赛的训练或实战题目、代码和解决方案。持续更新中...🚀

    Jupyter Notebook 6

  3. AwesomePrompt AwesomePrompt Public

    a repository of prompts4llms (code is cheap, show me your prompt 👀)

    4