Skip to content
View erenup's full-sized avatar

Highlights

  • Pro

Block or report erenup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
erenup/README.md

Hi, I'm @erenup πŸ‘‹

πŸš€ LLM & AGENT & NLP & Deep Learning Researcher

Scale agents (model+harness) and go beyond the imagination.

πŸ† Notable Contributions

πŸ€– AI Agents & Agentic Search

  • verl-tool (980+ ⭐) - Core design contributor | Unified RL framework for tool-calling agents
  • OpenResearcher (750+ ⭐) - Core contributor | Fully open pipeline for long-horizon deep research
  • ClawBench (320+ ⭐) - Project Lead/Advisor | Browser AI agent benchmark across 144 live websites
  • BrowseComp-Plus (270+ ⭐) - Core contributor | Fair evaluation for deep-research agents | ACL 2026
  • DCI-Agent-Lite (270+ ⭐) - Project Lead/Advisor | Agentic search via direct corpus interaction
  • BrowserAgent (33 ⭐) - Core contributor | Web agents with human-inspired browsing | TMLR 2025
  • SWE-Next - Core contributor | Scalable real-world software engineering tasks for agents

πŸ’» Code LLMs & Reasoning

  • AceCoder (101 ⭐) - Core contributor | Coder RL via automated test-case synthesis | ACL 2025
  • ScholarCopilot (250+ ⭐) - Core contributor | Academic writing with accurate citations | COLM 2025
  • One-Shot-CFT (34 ⭐) - Core contributor | Unleashing reasoning via critique fine-tuning | EMNLP 2025
  • StructEval (23 ⭐) - Core contributor | Benchmarking LLMs' structured output | TMLR 2025
  • SWE-QA-Pro - Core contributor | Repository-level code understanding | ACL 2026
  • EvolveCoder - Core contributor | Adversarial test-case evolution for code RL

🎬 Multimodal & Video

  • EditReward (146 ⭐) - Core contributor | Reward model for image editing | ICLR 2026
  • Context-Forcing (85 ⭐) - Core contributor | Consistent autoregressive video generation | ICML 2026
  • VideoScore2 (46 ⭐) - Core contributor | Think before you score in video evaluation
  • VisualWebInstruct (40 ⭐) - Core contributor | Scaling multimodal instruction data | EMNLP 2025
  • ImagenWorld (33 ⭐) - Core contributor | Stress-testing image generation | ICLR 2026
  • VisCoder (20 ⭐) - Core contributor | LLMs for Python visualization code | EMNLP 2025

πŸ“š Education & Open Source

πŸ“Š Stats

49 publications | 1,040+ citations | h-index 16 | Google Scholar

NeurIPS (Spotlight) Β· ACL Β· EMNLP Β· SIGIR Β· ICLR Β· ICML Β· COLM Β· TMLR

Research supported by OpenAI, Google Cloud, HuggingFace, Lambda AI


Making state-of-the-art NLP and LLM accessible

Pinned Loading

  1. datawhalechina/learn-nlp-with-transformers datawhalechina/learn-nlp-with-transformers Public

    we want to create a repo to illustrate usage of transformers in chinese

    Shell 3.2k 511

  2. huggingface/transformers huggingface/transformers Public

    πŸ€— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 161k 33.3k

  3. TIGER-AI-Lab/verl-tool TIGER-AI-Lab/verl-tool Public

    A version of verl to support diverse tool use

    Python 982 81

  4. texttron/BrowseComp-Plus texttron/BrowseComp-Plus Public

    BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)

    Python 277 46

  5. TIGER-AI-Lab/OpenResearcher TIGER-AI-Lab/OpenResearcher Public

    OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

    Python 758 74

  6. NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

    Python 13.7k 2.4k