AR
Andrew Richardson 10 months ago
Relatable.
EL
Evan Liu 11 months ago
Me too
EG
Eric Gu last year
👍
AF
Abe Fetterman last year (edited)
The seventh planet is Uranus, interesting if it got this wrong because having ca
AR
Andrew Richardson last year
I think they’re using a relatively small model that’s just not very knowledgeabl
MR
Michael Rosenthal last year
did anything they suggested seem like it’d make a truly substantial impact?
EL
Evan Liu last year
Wow, last paper doc was 4 onths ago?
AR
Andrew Richardson last year
We had a bunch of outside speakers come in and we didn’t use the doc. I’m glad w
KQ
Kanjun Qiu 2 years ago (edited)
I’m also curious, if we’re in this regime, what would be causing the model to be
EL
Evan Liu 2 years ago
+1
BF
Bryden Fogelman 2 years ago
🙂
EK
Ellie Kitanidis 2 years ago
at a certain level of abstraction, no, not hard at all lol
BO
Bas van Opheusden 2 years ago
Yes
- Imbue Paper Party Questions
- Norms:
- 2024/10/04
- 2024/09/27
- 2024/09/13
- 2024/09/06
- Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
- Questions
- (Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
- Questions
- 8/30/2024
- 8/23/2024
- Stream of Search (SoS): Learning to Search in Language
- 8/16/2024
- 8/9/2024
- 8/2/2024
- 07/26/2024
- The Llama3 Herd of Models
- 07/19/2024
- Prover-verifier games improved legibility LLM outputs
- 07/12/2024
- V-STaR: Training Verifiers for Self-Taught Reasoners
- 06/28/2024
- 06/14/2024
- 05/31/2024
- 05/17/2024
- STaR: https://arxiv.org/pdf/2203.14465
- Quiet-STaR: https://arxiv.org/pdf/2403.09629
- 05/03/2024
- April 19, 2024
- April 12, 2024
- March 29, 2024
- March 22, 2024
- March 15, 2024
- Mar 1, 2024
- Feb 23, 2024
- Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
- Feb 16, 2024
- Nov 10, 2023
- Nov 03, 2023
- Questions
- Oct 27, 2023
- Questions
- Oct 20, 2023
- Questions
- Oct 13, 2023
- Oct 6, 2023
- Questions
- Subsequent discussion:
- Sep 29, 2023
- Questions
- Sep 23, 2023
- Faith and Fate: Limits of Transformers on Compositionality
- Questions
- Notes
- Sep 15, 2023
- Questions
- Sep 8, 2023
- Questions
- Sep 1, 2023
- Questions
- Aug 25, 2023
- Questions
- Aug 11, 2023
- Questions:
- Questions:
- August 4, 2023
- July 28, 2023
- Questions:
- July 21, 2023
- Questions:
- July 14, 2023
- July 7, 2023
- June 30, 2023
- Questions Block-Recurrent:
- Questions RWKV:
- June 23, 2023
- Questions:
- May 19, 2023
- May 12, 2023
- Questions:
- May 5, 2023
- Questions:
- Ideas
- April 14, 2023
- Questions:
- Mar 31, 2023
- Questions:
- Mar 24, 2023
- Questions:
- Mar 17, 2023
- Questions:
- Mar 10, 2023
- Mar 3, 2023
- Summary:
- Questions:
- Feb 24, 2023
- Questions:
- Feb 17, 2023
- Toolformer
- Questions
- Feb 10, 2023
- Constitutional AI: Harmlessness from AI Feedback
- Jan 6, 2023
- RETRO: Improving language models by retrieving from trillions of tokens
- Nov 18, 2022
- Temporally Consistent Video Transformer for Long-Term Video Prediction
- Oct 14, 2022
- Toy Models of Superposition
- July 21, 2022
- BYOL-Explore
- July 8, 2022
- DayDreamer
- Deep Hierarchical Planning from Pixels (Director)
- July 1, 2022
- BIG-bench
- March 4, 2022
- Questions
- World model
- Explorer
- Achiever
- January 14, 2022
- Questions
- December 3, 2021
- Questions
- November 20, 2021
- Questions
- October 28, 2021
- Questions
- October 14, 2021
- Questions
- September 30, 2021
- Summary
- Questions
- September 16, 2021
- Summary
- Questions
- June 10, 2021
- Summary
- Questions
- May 27, 2021
- HiPPO: Recurrent Memory with Optimal Polynomial Projections
- Summary
- Questions
- April 1, 2021
- Neuroevolution of Self-Interpretable Agents
- Questions
- Mar 25, 2021
- Evaluating representations by the complexity of learning low-loss predictors
- Questions
- Mar 11, 2021
- Space-Time Correspondence as a Contrastive Random Walk
- Questions
- Mar 4, 2021
- Questions
- Feb 25, 2021
- Network-to-Network Translation with Conditional Invertible Neural Networks
- Questions:
- Feb 18, 2021
- Probabilistic Future Prediction for Video Scene Understanding
- Questions:
- Notes:
- Space-Time Correspondence as a Contrastive Random Walk
- Questions:
2024/10/04
2024/09/27
2024/09/13
2024/09/06
Questions
Questions
8/30/2024
8/23/2024
8/16/2024
8/9/2024
8/2/2024
07/26/2024
evals(actually this is for generating finetuning data). Am I reading this right?FP8?jk that’s just for inference07/19/2024
07/12/2024
06/28/2024
its answerthe proposed prompt, rather than the entireanswerproposed prompt at once?06/14/2024
EvanVincent means specifically the code eval sets we’re working on right now?If I’m understanding you properly — it seems like the agent should be able to use the command line — it feels like using the command line is a special case of“runcode”, and the agent should be able to implement but also run code.05/31/2024
05/17/2024
05/03/2024
(didthey train it?)yes they didApril 19, 2024
April 12, 2024
March 29, 2024
March 22, 2024
March 15, 2024
obviousunsurprising to me honestly. It just upweights the value of all “honest” semantics regardless of reason or w/e. Maybe a combined PCA could get more nuanced behavior.Mar 1, 2024
Feb 23, 2024
Feb 16, 2024
Nov 10, 2023
Nov 03, 2023
Oct 27, 2023
Oct 20, 2023
Oct 13, 2023
Oct 6, 2023
Sep 29, 2023
Sep 23, 2023
Sep 15, 2023
Terrible, most likelySep 8, 2023
Sep 1, 2023
Aug 25, 2023
Aug 11, 2023
August 4, 2023
July 28, 2023
July 21, 2023
July 14, 2023
July 7, 2023
June 30, 2023
June 23, 2023
May 19, 2023
May 12, 2023
May 5, 2023
April 14, 2023
Mar 31, 2023
Mar 24, 2023
Mar 17, 2023
Mar 10, 2023
Mar 3, 2023
Feb 24, 2023
Feb 17, 2023
Toolformer
Feb 10, 2023
Constitutional AI: Harmlessness from AI Feedback
Jan 6, 2023
RETRO: Improving language models by retrieving from trillions of tokens
Nov 18, 2022
Temporally Consistent Video Transformer for Long-Term Video Prediction
Oct 14, 2022
Toy Models of Superposition
July 21, 2022
BYOL-Explore
July 8, 2022
DayDreamer
Deep Hierarchical Planning from Pixels (Director)
July 1, 2022
BIG-bench
March 4, 2022
Questions
January 14, 2022
December 3, 2021
November 20, 2021
October 28, 2021
October 14, 2021
September 30, 2021
September 16, 2021
June 10, 2021
May 27, 2021
HiPPO: Recurrent Memory with Optimal Polynomial Projections
April 1, 2021
Neuroevolution of Self-Interpretable Agents
What did they end up feeding into their network after selecting patches? Just the raw pixels or was it just the position of the patches?Mar 25, 2021
Evaluating representations by the complexity of learning low-loss predictors
Mar 11, 2021
Space-Time Correspondence as a Contrastive Random Walk
Would this work if the video had a blank frame?Mar 4, 2021
Feb 25, 2021
Network-to-Network Translation with Conditional Invertible Neural Networks
Feb 18, 2021
Probabilistic Future Prediction for Video Scene Understanding
Space-Time Correspondence as a Contrastive Random Walk