Unsorted 22 - Burny

Robotics: [Mobile ALOHA - A Smart Home Robot - Compilation of Autonomous Skills - YouTube](<[Mobile ALOHA - A Smart Home Robot - Compilation of Autonomous Skills - YouTube](https://www.youtube.com/watch?v=zMNumQ45pJ8>),) [Eureka! Extreme Robot Dexterity with LLMs | NVIDIA Research Paper - YouTube](<[Eureka! Extreme Robot Dexterity with LLMs | NVIDIA Research Paper - YouTube](https://youtu.be/sDFAWnrCqKc?si=LEhIqEIeHCuQ0W2p>),) [Shaping the future of advanced robotics - Google DeepMind](<https://deepmind.google/discover/blog/shaping-the-future-of-advanced-robotics/>), [Optimus - Gen 2 - YouTube](<[Optimus - Gen 2 - YouTube](https://www.youtube.com/watch?v=cpraXaw7dyc>),) [Atlas Struts - YouTube](<https://www.youtube.com/shorts/SFKM-Rxiqzg>), [Figure Status Update - AI Trained Coffee Demo - YouTube](<[Figure Status Update - AI Trained Coffee Demo - YouTube](https://www.youtube.com/watch?v=Q5MKo7Idsok>),) [Curiosity-Driven Learning of Joint Locomotion and Manipulation Tasks - YouTube](<[Curiosity-Driven Learning of Joint Locomotion and Manipulation Tasks - YouTube](https://www.youtube.com/watch?v=Qob2k_ldLuw>)) Agency: [[2305.16291] Voyager: An Open-Ended Embodied Agent with Large Language Models](<https://arxiv.org/abs/2305.16291>), [[2309.07864] The Rise and Potential of Large Language Model Based Agents: A Survey](<https://arxiv.org/abs/2309.07864>), [Agents | Langchain](<https://python.langchain.com/docs/modules/agents/>), [GitHub - THUDM/AgentBench: A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)](<https://github.com/THUDM/AgentBench>), [[2401.12917] Active Inference as a Model of Agency](<https://arxiv.org/abs/2401.12917>), [CAN AI THINK ON ITS OWN? - YouTube](<[CAN AI THINK ON ITS OWN? - YouTube](https://www.youtube.com/watch?v=zMDSMqtjays>),) [Artificial Curiosity Since 1990](<https://people.idsia.ch/~juergen/artificial-curiosity-since-1990.html>) Generalizing: [[2402.10891] Instruction Diversity Drives Generalization To Unseen Tasks](<https://arxiv.org/abs/2402.10891>), [Automated discovery of algorithms from data | Nature Computational Science](<https://www.nature.com/articles/s43588-024-00593-9>), [[2402.09371] Transformers Can Achieve Length Generalization But Not Robustly](<https://arxiv.org/abs/2402.09371>), [[2310.16028] What Algorithms can Transformers Learn? A Study in Length Generalization](<https://arxiv.org/abs/2310.16028>), [[2307.04721] Large Language Models as General Pattern Machines](<https://arxiv.org/abs/2307.04721>), [A Tutorial on Domain Generalization | Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining](<https://dl.acm.org/doi/10.1145/3539597.3572722>), [[2311.06545] Understanding Generalization via Set Theory](<https://arxiv.org/abs/2311.06545>), [[2310.08661] Counting and Algorithmic Generalization with Transformers](<https://arxiv.org/abs/2310.08661>), [Neural Networks on the Brink of Universal Prediction with DeepMind’s Cutting-Edge Approach | Synced](<https://syncedreview.com/2024/01/31/neural-networks-on-the-brink-of-universal-prediction-with-deepminds-cutting-edge-approach/>), [[2401.14953] Learning Universal Predictors](<https://arxiv.org/abs/2401.14953>), [Spectral bias and task-model alignment explain generalization in kernel regression and infinitely wide neural networks | Nature Communications](<[Spectral bias and task-model alignment explain generalization in kernel regression and infinitely wide neural networks | Nature Communications](https://www.nature.com/articles/s41467-021-23103-1>)) AGI definitions: [[2311.02462] Levels of AGI: Operationalizing Progress on the Path to AGI](https://arxiv.org/abs/2311.02462) https://twitter.com/IntuitMachine/status/1721845203030470956 [[0712.3329] Universal Intelligence: A Definition of Machine Intelligence](https://arxiv.org/abs/0712.3329) https://twitter.com/xiao_ted/status/1761865996716114412 long token context windows The attention schema theory (AST) of consciousness (or subjective awareness) is a neuroscientific and evolutionary theory of consciousness. It proposes that brains construct subjective awareness as a schematic model of the process of attention. Attention schema is like the body schema. Just like the brain constructs a simplified model of the body to help monitor and control movements of the body, so the brain constructs a simplified model of attention to help monitor and control attention. [ALL OF PHYSICS explained in 14 minutes - YouTube](https://www.youtube.com/watch?v=ZAqIoDhornk) Evidence has been found that generative image models have representations of these scene characteristics: surface normals, depth, albedo, and shading https://twitter.com/anand_bhattad/status/1730230190159135175 [[2311.17137] Generative Models: What do they know? Do they know things? Let's find out!](https://arxiv.org/abs/2311.17137) [Andrew Jardine on LinkedIn: #llms | 15 comments](https://www.linkedin.com/posts/andrew-iain-jardine_llms-activity-7165730371215589376-ltdU?utm_source=share&utm_medium=member_android) i see one more in the reddit thread https://onlinelibrary.wiley.com/doi/full/10.1002/ajpa.24216 [Reddit - Dive into anything](https://www.reddit.com/r/AskSocialScience/s/xwW57E0Nip) 15.2. AI progress [Imgur: The magic of the Internet](https://imgur.com/2RIZJLm) [Sora](https://openai.com/sora) [Reddit - Dive into anything](https://www.reddit.com/r/singularity/s/PihQuJGjok) [‎Demystifying Intelligence: From Brains to Machines](https://g.co/gemini/share/18e305d43b61) [Quanta Magazine](https://www.quantamagazine.org/wormhole-experiment-called-into-question-20230323/) [Reddit - Dive into anything](https://www.reddit.com/r/blueprint_/s/FXNPQbVLH1) [[2402.08268] World Model on Million-Length Video And Language With RingAttention](https://arxiv.org/abs/2402.08268) [Wanja Wiese, Artificial consciousness: A perspective from the free energy principle - PhilPapers](https://philpapers.org/rec/WIECLL) [[2402.08871] Position Paper: Challenges and Opportunities in Topological Deep Learning](https://arxiv.org/abs/2402.08871) [[2402.08268] World Model on Million-Length Video And Language With RingAttention](https://arxiv.org/abs/2402.08268) [China Says It Plans to Mass-Produce Humanoid Robots Within 2 Years](https://www.businessinsider.com/china-plans-mass-production-humanoid-robots-within-two-years-2023-11) Sources of AGI capabilities [Imgur: The magic of the Internet](https://imgur.com/ehrz0q8) [DeepMind’s New AI Beats Billion Dollar Systems - For Free! - YouTube](https://www.youtube.com/watch?v=BufUW7h9TB8) [Reddit - Dive into anything](https://www.reddit.com/r/singularity/s/PihQuJGjok) [Quanta Magazine](https://www.quantamagazine.org/physicists-discover-exotic-patterns-of-synchronization-20190404/) [Quanta Magazine](https://www.quantamagazine.org/what-your-brain-is-doing-when-youre-not-doing-anything-20240205/) [[2402.03268] Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation](https://arxiv.org/abs/2402.03268) [Nvidia Uses AI to Produce Its AI Chips Faster](https://www.businessinsider.com/nvidia-uses-ai-to-produce-its-ai-chips-faster-2024-2?utm_source=reddit.com) I think Active Inference architecture definitely has lots of potential and includes very useful stuff that the current AI systems don't have. It's interesting to see the emergence of people trying to make planning collective curious agents from the current LLMs by all sorts of hacks with various degree of success [2024 is the Year of the AI AGENT - YouTube](https://www.youtube.com/watch?v=hmt5MnStKUI) but it being more hardcoded would most likely be much more efficient in terms of what is mentioned here. [CAN AI THINK ON ITS OWN? - YouTube](https://www.youtube.com/watch?v=zMDSMqtjays) Active inference adds to the mix planning policies on beliefs about the world [CAN AI THINK ON ITS OWN? - YouTube](https://youtu.be/zMDSMqtjays?si=9DBkfo6O2zc4kCyj) [[2402.00746] Health-LLM: Personalized Retrieval-Augmented Disease Prediction Model](https://arxiv.org/abs/2402.00746) Kurgesagt na to má fakt popsci video [Emergence – How Stupid Things Become Smart Together - YouTube](https://youtu.be/16W7c0mb-rE?si=THj3qDZaLbdp6J95) [The Most Useful Curve in Mathematics - YouTube](https://www.youtube.com/watch?v=OjIwCOevUew&list=WL&index=3&pp=gAQBiAQB) [Just Because You Don't Like Zuckerberg Doesn't Mean He's Wrong - YouTube](https://www.youtube.com/watch?v=hlZTv5vGJIo&list=WL&index=4&pp=gAQBiAQB) [[2402.01825] Fractal Patterns May Unravel the Intelligence in Next-Token Prediction](https://arxiv.org/abs/2402.01825) [[2402.00746] Health-LLM: Personalized Retrieval-Augmented Disease Prediction Model](https://arxiv.org/abs/2402.00746) Kurgesagt na to má fakt popsci video [Emergence – How Stupid Things Become Smart Together - YouTube](https://youtu.be/16W7c0mb-rE?si=THj3qDZaLbdp6J95) [The Most Useful Curve in Mathematics - YouTube](https://www.youtube.com/watch?v=OjIwCOevUew&list=WL&index=3&pp=gAQBiAQB) [Just Because You Don't Like Zuckerberg Doesn't Mean He's Wrong - YouTube](https://www.youtube.com/watch?v=hlZTv5vGJIo&list=WL&index=4&pp=gAQBiAQB) [[2402.01825] Fractal Patterns May Unravel the Intelligence in Next-Token Prediction](https://arxiv.org/abs/2402.01825) Goal oriented prompting [[2401.14043] Towards Goal-oriented Large Language Model Prompting: A Survey](https://arxiv.org/abs/2401.14043) [A Deep Conceptual Guide to Mutual Information | by Sean McClure | The Startup | Medium](https://medium.com/swlh/a-deep-conceptual-guide-to-mutual-information-a5021031fad0) DRµGS: [Reddit - Dive into anything](https://www.reddit.com/r/LocalLLaMA/comments/18toidc/stop_messing_with_sampling_parameters_and_just/) [MobileDiffusion: Rapid text-to-image generation on-device – Google Research Blog](https://blog.research.google/2024/01/mobilediffusion-rapid-text-to-image.html) [The Math behind Adam Optimizer | Towards Data Science](https://towardsdatascience.com/the-math-behind-adam-optimizer-c41407efe59b) https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5003634/ Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs - Outperforms DALL-E 3 and SDXL, particularly in multi-category object composition and text-image semantic alignment with chain-of-thought https://twitter.com/burny_tech/status/1751648334698123758 [[2401.11708v1] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs](https://arxiv.org/abs/2401.11708v1) [GitHub - YangLing0818/RPG-DiffusionMaster: Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)](https://github.com/YangLing0818/RPG-DiffusionMaster) [Merge Large Language Models. Combine Mistral, WizardMath and… | by Sergei Savvov | Jan, 2024 | Medium](https://slgero.medium.com/merge-large-language-models-29897aeb1d1a) [The $2M Longevity Protocol: Bryan Johnson’s Biohacking Blueprint | Rich Roll Podcast - YouTube](https://youtu.be/roHeUk7ApUo?si=A5OJqOh5fTN6I0sm) - Recursive retrieval, aka the small-to-big retrieval technique, embeds smaller chunks for retrieval while returning larger parent context for the language model's synthesis. Smaller text chunks contribute to more accurate retrieval, while larger chunks provide richer contextual information for the language model. [Recursive Retriever + Query Engine Demo - LlamaIndex 🦙 v0.10.18.post1](https://docs.llamaindex.ai/en/stable/examples/query_engine/pdf_tables/recursive_retriever.html) - Sentence window retrieval process fetches a single sentence and returns a window of text around that particular sentence. [Metadata Replacement + Node Sentence Window - LlamaIndex 🦙 v0.10.18.post1](https://docs.llamaindex.ai/en/latest/examples/node_postprocessor/MetadataReplacementDemo.html) - [Real-time Observability for GenAI Apps and Models | Galileo](https://docs.rungalileo.io/galileo/llm-studio/llm-monitor) Finetuning resources [Curated the largest collection of fine-tuning notebooks for Language Model Models (LLMs) | Sunil Ghimire posted on the topic | LinkedIn](https://www.linkedin.com/posts/ghimiresunil_naturallanguageprocessing-transformer-genai-ugcPost-7158485069735571459-6Ttc?utm_source=share&utm_medium=member_android) [Quanta Magazine](https://www.quantamagazine.org/plants-find-light-using-gaps-between-their-cells-20240131/) [[2401.14295] Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts](https://arxiv.org/abs/2401.14295) if ideologies were inside a game [Imgur: The magic of the Internet](https://imgur.com/w66RwCb) If you seek the respect from people now, you're married to the past and you can't push the boundaries of what's possible.” - Bryan Johnson https://twitter.com/liron/status/1750258305216508059 [Must Watch: Bryan Johnson’s Most Revealing Interview About His Mission To Live Forever - YouTube](https://www.youtube.com/watch?v=Qu1Xjj798zI) In other words, the looming specter of the heat death of the universe can indeed be vanquished! 😄🎉" https://twitter.com/MikePFrank/status/1750337497463169426 https://twitter.com/PropheticAI/status/1750534355242418300 [White House science chief signals US-China co-operation on AI safety](https://www.ft.com/content/94b9878b-9412-4dbc-83ba-aac2baadafd9) [Top 22 Humanoid Robots in Use Right Now | Built In](https://builtin.com/robotics/humanoid-robots) https://twitter.com/8teAPi/status/1750574300049084793 https://twitter.com/Andercot/status/1676786085106683904 [From addition to quantum physics - YouTube](https://www.youtube.com/watch?v=IShHchFyAWE) channels and pick the relevant sections and compile them in as connected order as possible. https://twitter.com/burny_tech/status/1750701550131949769 According to AdS/CFT + ER = EPR theory, the fabric of spacetime itself is made up of this wormhole mycelium network which is a geometric embodiment of multipartite quantum entanglement https://twitter.com/BasedBeffJezos/status/1750766792874840334?t=7KDLk_vpcXUeqlBpSpZ1Ig&s=19 https://twitter.com/danfaggella/status/1750866620321296732?t=bGGAl5qfxReMbyTzZcg41w&s=19 7 phases of transhumanism https://twitter.com/AISafetyMemes/status/1750823796779409582?t=aEvUBWk33D6DMbpbID8vcg&s=19 China and USA cooperate on AI safety https://twitter.com/burny_tech/status/1750864901612949685?t=JXW1Jo-asRYouAVOv-5u5A&s=19 https://twitter.com/svpino/status/1750887295400509893?t=nLwpK3Zya2I3rmtQ9HvjeQ&s=19 Lora from stratch https://twitter.com/rasbt/status/1749475825664029102?t=VZGK67oVwIU6JxU1QsVo2Q&s=19 https://twitter.com/Outdoctrination/status/1750903148678459426 https://twitter.com/eshear/status/1750995830948135206?t=cTO7OPovGfduI2ZCnCEa8A&s=19 Gender political polarization accelerating https://twitter.com/jburnmurdoch/status/1750849189834022932?t=2VZJMqpL_3Rl83qn_pqxTQ&s=19 Fuck nuance in sociology https://twitter.com/kareem_carr/status/1750905007803605192?t=LnpfOAmGBErUB3dlkUbp6g&s=19 AbacusAI building agents https://twitter.com/bindureddy/status/1751086988613280026?t=etZp92abc9Ygk9OeNnE3Iw&s=19 Concurrency add to RAG https://twitter.com/virattt/status/1751019033531437382?t=SqYwSGBEXZYksmgaf86IhA&s=19 "Some governments will go 'woah, AI is dangerous, we better not build it'. And some governments will go 'woah, AI is powerful, we better be the ones to build it'. And this time, there's a good chance it'll be net harm, because most governments have in fact a lot more power to do bad than good, here." https://twitter.com/RokoMijic/status/1751296767004422203 https://twitter.com/emollick/status/1751353615334146460?t=iZYFohshgHMle4pIc1g7ug&s=19 AI biorisk [Matthew E. Walsh on LinkedIn: Towards Risk Analysis of the Impact of AI on the Deliberate Biological…](https://www.linkedin.com/posts/matthewwalsh3_towards-risk-analysis-of-the-impact-of-ai-activity-7155915709393354752-kLZS?utm_source=share&utm_medium=member_android) Homotopy theory is a branch of algebraic topology, a sphere spectrum is a concept from stable homotopy theory, and "stable ∞-categories" are related to higher category theory [Imgur: The magic of the Internet](https://imgur.com/vL4cX1b) [Imgur: The magic of the Internet](https://imgur.com/a/EGw8Ppb) [Imgur: The magic of the Internet](https://imgur.com/a/KvFitmH) https://twitter.com/llama_index/status/1751411893687001168?t=vNuygCZix-Q1MIrMkSV5xw&s=19 Enterprise RAG https://twitter.com/llama_index/status/1751291798843212111?t=S4U14-jGnn907XJIkYkePQ&s=19 [Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique](https://huggingface.co/blog/lyogavin/airllm) https://twitter.com/burny_tech/status/1749803996418920789 Non-determinism in GPT-4 is caused by Sparse MoE [Non-determinism in GPT-4 is caused by Sparse MoE - 152334H](https://152334h.github.io/blog/non-determinism-in-gpt-4/) https://twitter.com/maksym_andr/status/1749546209755463953 [FNet: Mixing Tokens with Fourier Transforms – Paper Explained - YouTube](https://www.youtube.com/watch?v=j7pWPdGEfMA) https://twitter.com/jerryjliu0/status/1749830961590882714?t=W-dfAJPrY_QnjN0WMEkrvA&s=19 4 Levels of Agents for RAG https://twitter.com/intrstllrninja/status/1744630539896651918 mixtral routing analysis shows that experts did not specialize to specific domains Statespace Models ml https://twitter.com/LeopolisDream/status/1749852694091555265?t=dZssympiVzEqbsO6p6a0Uw&s=19 [[2401.08392] DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models](https://arxiv.org/abs/2401.08392) MemGPT https://twitter.com/jerryjliu0/status/1749959840959774922 https://twitter.com/NeelNanda5/status/1749886478673682677 https://twitter.com/wesg52/status/1749829624933322886 LLM Ops https://twitter.com/AndrewYNg/status/1750200384600309872?t=s48okvcU-BJWlTc0T0Z2hQ&s=19 Merging LLMs https://twitter.com/rasbt/status/1750180383398744106 [[2401.07103] Leveraging Large Language Models for NLG Evaluation: A Survey](https://arxiv.org/abs/2401.07103) https://twitter.com/burny_tech/status/1750298853952131570 Heat death of the universe might be prevented https://twitter.com/BasedBeffJezos/status/1750388754517492180?t=AwQt599JKryJnx-2phQOgw&s=19 https://twitter.com/fchollet/status/1748780260295164114?t=ushdliEKfXa42Tb2pqdk5g&s=19 Battling climate change with a mechanical tree that does the work of 1000 trees by the Arizona state university! [Mechanical TreeTM – Carbon collect](https://carboncollect.com/mechanical-tree/) Mechinterp backdoors https://twitter.com/StephenLCasper/status/1748872347699081682?t=d2Hb1jKdRMny-tzY5lXiIg&s=19 [Practices for Governing Agentic AI Systems](https://openai.com/research/practices-for-governing-agentic-ai-systems) Visualizing RAG https://twitter.com/hwchase17/status/1748730564130283914?t=8qLWfkdkLBi99MMHOsB2Ug&s=19 [[2401.07103] Leveraging Large Language Models for NLG Evaluation: A Survey](https://arxiv.org/abs/2401.07103) ML podcasts https://twitter.com/chrisalbon/status/1749101112710565956 https://twitter.com/ESYudkowsky/status/1749157823785931220 How would we percieve if we upgraded our hardware https://twitter.com/BasedBeffJezos/status/1749316697033691406?t=vU2A65ZcDDDehRPmWjSQIw&s=19 AI: Unpredictable, Unexplainable, Uncontrollable Roman yampolskyi https://twitter.com/romanyam/status/1748790916586889385?t=WXJqBe2af1BCk6iRmzTqrw&s=19 AI GPT4 in harward course https://twitter.com/emollick/status/1749239679788974247?t=SH_0llDKaOnt8o4sX5EFzA&s=19 AI agent Finding news and tweeting https://twitter.com/DivGarg9/status/1749149133317996796?t=2kJOCBAHdTWg8ULUk5jrBQ&s=19 Longevity protocols https://twitter.com/powerfultakes/status/1749090751693099188?t=hiasn5ux9hy6VwVsh_duAg&s=19 BCI/acc https://twitter.com/trentmc0/status/1749095289909072063?t=sO5i5FzgxdHnhT-3Aif4Rw&s=19 Amazon robotics acceleration https://twitter.com/LinusEkenstam/status/1749216813416636791?t=C4FgnhO1S9L9N7yNywlwKA&s=19