• Ideas

    Ian and the Limits of Rationality - Issue 107: The Edge - Nautilus

    Setting: Chesterfield High, an unusual school in the suburbs of Ohio.The teacher writes on the board:2, 3, 5, 7, ...How, he asks,…

    File not found

    Professors are struggling to teach Gen Z

    Implementing Code Review Feedback: To Squash or Not to Squash?

    Should code review feedback be addressed by appending commits or rebasing changes? Combine both approaches with fixup commits!

    Mathematician Answers Chess Problem About Attacking Queens | Quanta Magazine

    The n-queens problem is about finding how many different ways queens can be placed on a chessboard so that none attack each other. A mathematician has now all but solved it.

    How truthful is GPT-3? A benchmark for language models - AI Alignment Forum

    This is an edited excerpt of a new ML paper (pdf, code) by Stephanie Lin (FHI Oxford), Jacob Hilton (OpenAI) and Owain Evans (FHI Oxford). The paper is under review at NeurIPS. TITLE: TRUTHFULQA: MEASURING HOW MODELS MIMIC HUMAN FALSEHOODS ABSTRACT We propose a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics (see Figure 1). We crafted questio...

    Code Review

    One of the most important practices every engineering team should have

    • Publications

    Unmatched: Repairing the U.S. Medical Residency Pipeline

    The U.S. medical residency pipeline is broken and in dire need of reform.