Skip to content
View renee-jia's full-sized avatar
πŸ€
πŸ€

Block or report renee-jia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
renee-jia/README.md

Hey there, I'm Renee πŸ‘‹

AI Research Engineer @ Meta

I work on AI systems β€” from ranking and recommendation to LLMs and agents. I like thinking about how models behave in messy, real-world environments. Previously worked at Google and Amazon Alexa AI. Visiting Scholar at Harvard and University of Waterloo.


πŸ“¨ Connect with me


✏️ Writing

I write about things I'm learning and researching. Here's what I've been covering on my blog:

πŸ›‘οΈ AI Safety & Alignment β€” How reward hacking evolved from classical RL specification gaming to jailbreaks and deceptive alignment in LLMs. What it means for RLHF and building systems we can trust.

🧠 LLM Reasoning β€” What "reasoning" actually means in the context of large language models, grounded in research from chain-of-thought prompting to inference-time compute scaling.

🌐 Browser Agents & Goal Fidelity β€” Why the web is an adversarial environment for agents, and why being capable is not the same as being hard to manipulate.

🎯 Ranking & Recommendation Systems β€” A deep-dive series covering the full evolution: from foundational collaborative filtering, through the deep learning era, to modern sequential learning and long user history modeling in ads systems.


πŸŒ„ Beyond the code

When I'm not thinking about AI:

πŸ‚ Snowboarding β€” PSIA-AASI Level 1 certified instructor

♠️ Tournament Poker β€” Part-time player who loves the intersection of game theory, math, and psychology. Check out my results on Hendon Mob.


πŸ“Š GitHub Stats

Pinned Loading

  1. latent-feed latent-feed Public

    Automated AI news aggregation that feeds directly into your Obsidian vault. Stay current on the latest LLM research, trending repos, and breakthroughs β€” curated daily

    Python 109 18

  2. alpha-agent alpha-agent Public

    An AI-driven multi-agent trading platform for options trading and stock trends analysis. This project leverages advanced machine learning, real-time market data, and a modular multi-agent framework.

    Python 19 2