sistemas.ai

Home
sistemas.ai

Artificial Intelligence
(4)

16/08/2024

𝐓𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐞𝐫 𝐄𝐱𝐩𝐥𝐚𝐢𝐧𝐞𝐫
Beautiful visualization of the inner workings of a Transformer.

An interactive visualization tool showing you how transformer models work in large language models (LLM) like GPT.

15/08/2024

“𝗔𝗹𝗴𝗼𝗿𝗶𝘁𝗵𝗺𝘀 𝗳𝗼𝗿 𝗗𝗲𝗰𝗶𝘀𝗶𝗼𝗻 𝗠𝗮𝗸𝗶𝗻𝗴”, MIT publishing book is freely available.
Book: https://algorithmsbook.com/

13/08/2024

🔥𝗧𝗵𝗲 𝗔𝗜 𝗦𝗰𝗶𝗲𝗻𝘁𝗶𝘀𝘁
Paper: https://arxiv.org/abs/2408.06292
GitHub: https://github.com/SakanaAI/AI-Scientist

08/08/2024

𝗖𝗼𝗱𝗶𝗻𝗴 𝗮 𝗠𝘂𝗹𝘁𝗶𝗺𝗼𝗱𝗮𝗹 (𝗩𝗶𝘀𝗶𝗼𝗻) 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗠𝗼𝗱𝗲𝗹 𝗳𝗿𝗼𝗺 𝘀𝗰𝗿𝗮𝘁𝗰𝗵 𝗶𝗻 𝗣𝘆𝗧𝗼𝗿𝗰𝗵.
Topics:
Transformer model (Embeddings, Positional Encoding, Multi-Head Attention, Feed Forward Layer, Logits, Softmax)
Vision Transformer model
Contrastive learning (CLIP, SigLip)
Numerical stability of the Softmax and the Cross Entropy Loss
Rotary Positional Embedding
Multi-Head Attention
Grouped Query Attention
Normalization layers (Batch, Layer and RMS)
KV-Cache (prefilling and token generation)
Attention masks (causal and non-causal)
Weight tying
Top-P Sampling and Temperature
and much more!

https://youtu.be/vAmKB7iPkWw?si=Lp4xV3xJlMgQfdtL

Full coding of a Multimodal (Vision) Language Model from scratch using only Python and PyTorch. We will be coding the PaliGemma Vision Language Model from sc...

30/07/2024

🔥Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images & videos.

SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences

Details ➡️
https://ai.meta.com/blog/segment-anything-2

Blog Post: https://ai.meta.com/blog/segment-anything-2/Paper: https://ai.meta.com/research/publications/sam-2-segment-anything-in-images-and-videos/Demo: htt...

07/07/2024

🔥https://artificialanalysis.ai/ is a great site that benchmarks quality/speed/price of different LLM API providers to help developers pick which models to use.

06/07/2024

🔥An excellent new book to learn the concepts of Deep Learning.
Topics include fundamental building blocks, Transformers, GNNs, RL, diffusion models, and more.
FREE PDF: https://udlbook.github.io/udlbook/

04/07/2024

GraphRAG, a graph-based approach to retrieval-augmented generation (RAG) that significantly improves question-answering over private or previously unseen datasets, is now available on GitHub.
https://www.microsoft.com/en-us/research/blog/graphrag-new-tool-for-complex-data-discovery-now-on-github/

27/06/2024

🔥𝐓𝐎𝐏 𝐂𝐕𝐏𝐑 𝟐𝟎𝟐𝟒 𝐩𝐚𝐩𝐞𝐫𝐬
This repository is a curated collection of the most exciting and influential CVPR 2024 papers!
https://github.com/SkalskiP/top-cvpr-2024-papers

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo] - SkalskiP/top-cvpr-2024-papers

26/06/2024

Together AI and Morph Labs collaborated to create an excellent blog post on tuning models for RAG (Retrieval Augmented Generation). RAG fine-tuning combines code retrieval with model training, addressing the limitations of outdated knowledge and hallucinations in LLMs.

Large Language Models (LLMs) have shown promising capabilities on multiple applications such as code generation, task planning, and document understanding. Despite the impressive performance, these models often fall short due to two main reasons: hallucinations and outdated knowledge in the models.....

10/06/2024

🔥𝐋𝐞𝐭’𝐬 𝐫𝐞𝐩𝐫𝐨𝐝𝐮𝐜𝐞 𝐆𝐏𝐓-𝟐 (𝟏𝟐𝟒𝐌) by Andrej Karpathy (former OpenAI scientist and Tesla's former head of AI)
📽️ New 4 hour video lecture on YouTube:
https://youtu.be/l8pRSuU81PU?si=M1kznmR5XSiYW-Qz

We reproduce the GPT-2 (124M) from scratch. This video covers the whole process: First we build the GPT-2 network, then we optimize its training to be really...

07/06/2024

🔥𝐐𝐰𝐞𝐧𝟐 is the newest Alibaba's open source large language model. It slightly surpasses Llama 3 70B on benchmark performance in English while being a better multilingual model.
Blog:
https://qwenlm.github.io/blog/qwen2/

05/06/2024

𝐓𝐡𝐞 𝐑𝐢𝐬𝐞 𝐨𝐟 𝐀𝐠𝐞𝐧𝐭𝐢𝐜 𝐑𝐞𝐭𝐫𝐢𝐞𝐯𝐚𝐥-𝐀𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐨𝐧 (𝐑𝐀𝐆) 𝐢𝐧 𝐀𝐫𝐭𝐢𝐟𝐢𝐜𝐢𝐚𝐥 𝐈𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞 𝐀𝐈
In the rapidly developing fields of data science and Artificial Intelligence (AI), the search for increasingly effective systems is also increasing significantly. The development of Agentic Retrieval-Augmented Generation (RAG) is among the most revolutionary developments of recent times. This strategy is set to completely transform the way information is used and managed, offering a substantial improvement over current RAG systems.
https://www.marktechpost.com/2024/05/28/the-rise-of-agentic-retrieval-augmented-generation-rag-in-artificial-intelligence-ai/

The Rise of Agentic Retrieval-Augmented Generation (RAG) in Artificial Intelligence AI

02/06/2024

𝐆𝐫𝐚𝐩𝐡𝐑𝐀𝐆 (Graph-based Retrieval Augmented Generation) enhances the traditional Retrieval Augmented Generation (RAG) method by integrating knowledge graphs (KGs) or graph databases with large language models (LLMs). It leverages the structured nature of graph databases to organize data as nodes and relationships, enabling more efficient and accurate retrieval of relevant information to provide better context to LLMs for generating responses.
https://gradientflow.substack.com/p/graphrag-design-patterns-challenges

Subscribe • Previous Issues Enhancing RAG with Knowledge Graphs: Blueprints, Hurdles, and Guidelines By Ben Lorica and Prashanth Rao. GraphRAG (Graph-based Retrieval Augmented Generation) enhances the traditional Retrieval Augmented Generation (RAG) method by integrating knowledge graphs (

29/05/2024

𝐆𝐏𝐓 𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡𝐞𝐫
-GPT based autonomous agent that does online comprehensive research on any given topic.
-GPT Researcher supports all major LLM providers.
Repo:
https://github.com/assafelovic/gpt-researcher

27/05/2024

𝐌𝐢𝐬𝐭𝐫𝐚𝐥 𝐅𝐢𝐧𝐞𝐭𝐮𝐧𝐞
Mistral released an official repository to fine-tune its models.
GitHub Repo:
https://github.com/mistralai/mistral-finetune

26/05/2024

𝐃𝐢𝐟𝐟𝐮𝐬𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬
Notes on the theory behind models like Stable Diffusion and their applications.
https://andrewkchan.dev/posts/diffusion.html

ContentsDiffusion ModelsNotes on the theory behind models like Stable Diffusion and their applications. I spent 2022 learning to draw and was blindsided by the rise of AI art models like Stable Diffusion. Suddenly, the computer was a better artist than I could ever hope to be. It's been two years, a...

26/05/2024

𝐄𝐥𝐢𝐚
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
Repo:
https://github.com/darrenburns/elia

24/05/2024

🔥𝐥𝐥𝐚𝐦𝐚𝟑 𝐢𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭𝐞𝐝 𝐟𝐫𝐨𝐦 𝐬𝐜𝐫𝐚𝐭𝐜𝐡
This great tutorial shows every step of reconstructing Llama 3 and running the trained weights.
Repo:
https://github.com/naklecha/llama3-from-scratch

19/05/2024

𝐊𝐨𝐥𝐦𝐨𝐠𝐨𝐫𝐨𝐯 𝐀𝐫𝐧𝐨𝐥𝐝 𝐍𝐞𝐭𝐰𝐨𝐫𝐤𝐬 (𝐊𝐀𝐍) 𝐏𝐚𝐩𝐞𝐫 𝐄𝐱𝐩𝐥𝐚𝐢𝐧𝐞𝐝 - An exciting new paradigm for Deep Learning?
https://youtu.be/7zpz_AlFW2w

This is a paper breakdown video of the paper: Kolmogorov Arnold Networks, which brilliantly provides an alternative to standard Multi Layer Perceptrons. The ...

16/05/2024

🔥𝐀𝐝𝐯𝐚𝐧𝐜𝐞𝐝 𝐍𝐋𝐏 𝐟𝐫𝐨𝐦 𝐂𝐚𝐫𝐧𝐞𝐠𝐢𝐞 𝐌𝐞𝐥𝐥𝐨𝐧 𝐔𝐧𝐢𝐯𝐞𝐫𝐬𝐢𝐭𝐲! (2024)
Great recent NLP topics like prompting, fine-tuning and instruction-tuning, retrieval and RAG, ensembling and mixture of experts (MoE), and more.

One of the best NLP courses on the web:
https://www.youtube.com/playlist?list=PL8PYTP1V4I8D0UkqW2fEhgLrnlDW9QK7z

16/05/2024

This year, more and more developers are talking about AI agents - autonomous or semi-autonomous systems capable of handling a wider range of tasks and making decisions on their own. Unlike co-pilots, agents have a higher degree of autonomy and can take proactive actions based on their goals and understanding of the environment. They can complete tasks without constant human intervention, learning and adapting based on their interactions and experiences.
https://gradientflow.substack.com/p/agentic-ai-challenges-and-opportunities

Subscribe • Previous Issues Navigating the Complex World of AI Agents Last year, the buzz in the AI community revolved around the concept of AI co-pilots - systems designed to work alongside humans, assisting them in tasks and decision-making processes. These co-pilots, such as GitHub Copilot for ...

15/05/2024

🔥🔥🔥𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐀𝐠𝐞𝐧𝐭, a Google labs experiment! An experiment to build an AI generated Colab notebook that handles data cleaning, data exploration, plotting, Q&A on data, and predictive modeling.
◆ Helps with complex tasks like planning, and error correction.
◆ Helps with data science tasks like predictive modeling.
◆ Outputs an AI-generated Colab notebook based on your prompt.

Try it at https://labs.google.com/code/

11/05/2024

🔥𝐈𝐧𝐜𝐫𝐞𝐝𝐢𝐛𝐥𝐞 𝐬𝐞𝐫𝐢𝐞𝐬 𝐨𝐟 𝟗𝟖 𝐥𝐞𝐜𝐭𝐮𝐫𝐞𝐬 𝐟𝐫𝐨𝐦 𝐔𝐓 𝐀𝐮𝐬𝐭𝐢𝐧 𝐨𝐧 𝐍𝐋𝐏 𝐚𝐧𝐝 𝐋𝐋𝐌𝐬
It gives descent synopses of modern NLP topics and recent ones like RLHF, instruction-tuning, few-shot prompting, chain-of-thought, and more.
Lectures:
https://www.youtube.com/playlist?list=PLofp2YXfp7TZZ5c7HEChs0_wfEfewLDs7

09/05/2024

🔥𝐆𝐫𝐚𝐧𝐢𝐭𝐞 𝐂𝐨𝐝𝐞 𝐌𝐨𝐝𝐞𝐥𝐬: 𝐀 𝐅𝐚𝐦𝐢𝐥𝐲 𝐨𝐟 𝐎𝐩𝐞𝐧 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 𝐟𝐨𝐫 𝐂𝐨𝐝𝐞 𝐈𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞
- IBM has released four variations of the Granite code model.
- Ranging in size from 3 to 34B parameters
- Trained on 3 to 4T tokens sourced from 𝟏𝟏𝟔 𝐩𝐫𝐨𝐠𝐫𝐚𝐦𝐦𝐢𝐧𝐠 𝐥𝐚𝐧𝐠𝐮𝐚𝐠𝐞𝐬
- The models have outperformed other comparable models like Code Llama and Llama 3 in many tasks.
- Repo:
https://github.com/ibm-granite/granite-code-models
-Paper:
https://arxiv.org/abs/2405.04324

09/05/2024

🔥𝐊𝐨𝐥𝐦𝐨𝐠𝐨𝐫𝐨𝐯-𝐀𝐫𝐧𝐨𝐥𝐝 𝐍𝐞𝐭𝐰𝐨𝐫𝐤 𝐟𝐨𝐫 𝐑𝐞𝐢𝐧𝐟𝐨𝐫𝐜𝐞𝐦𝐞𝐧𝐭 𝐋𝐞𝐚𝐧𝐢𝐧𝐠, 𝐢𝐧𝐢𝐭𝐢𝐚𝐥 𝐞𝐱𝐩𝐞𝐫𝐢𝐦𝐞𝐧𝐭𝐬
This small project test the novel architecture Kolmogorov-Arnold Networks (KAN) in the reinforcement learning paradigm to the CartPole problem.
Repo:
https://github.com/riiswa/kanrl

09/05/2024

🔥𝐱𝐋𝐒𝐓𝐌: 𝐄𝐱𝐭𝐞𝐧𝐝𝐞𝐝 𝐋𝐨𝐧𝐠 𝐒𝐡𝐨𝐫𝐭-𝐓𝐞𝐫𝐦 𝐌𝐞𝐦𝐨𝐫𝐲
Sepp Hochreiter, who invented the LSTM, just dropped a new LLM architecture!
-The xLSTM architecture is shown to be efficient at handling different aspects of long context problems.
-Major component is a new parallelizable LSTM.
-One of the major weaknesses of prior LSTMs was the sequential nature (can't be done at once)
Check the paper for more interesting insights and results:
https://arxiv.org/abs/2405.04517

08/05/2024

🔥🔥🔥𝐀𝐰𝐞𝐬𝐨𝐦𝐞 𝐊𝐀𝐍(𝐊𝐨𝐥𝐦𝐨𝐠𝐨𝐫𝐨𝐯-𝐀𝐫𝐧𝐨𝐥𝐝 𝐍𝐞𝐭𝐰𝐨𝐫𝐤)
A curated list of awesome libraries, tutorials, papers, and other resources related to Kolmogorov-Arnold Network (KAN). This repository aims to be a comprehensive and organized collection that will help researchers and developers in the world of KAN!
Repo: https://github.com/mintisan/awesome-kan

07/05/2024

🔥🔥🔥𝐊𝐀𝐍-𝐆𝐏𝐓
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
Repo:
https://github.com/AdityaNG/kan-gpt

07/05/2024

𝐈𝐂𝐋𝐑 𝟐𝟎𝟐𝟒 𝐎𝐮𝐭𝐬𝐭𝐚𝐧𝐝𝐢𝐧𝐠 𝐏𝐚𝐩𝐞𝐫 𝐀𝐰𝐚𝐫𝐝𝐬 (International Conference on Learning Representations)
1. Generalization in diffusion models arises from geometry-adaptive harmonic representations
2. Learning Interactive Real-World Simulators
3. Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors
4. Protein Discovery with Discrete Walk-Jump Sampling
5. Vision Transformers Need Registers
Blog:
https://blog.iclr.cc/2024/05/06/iclr-2024-outstanding-paper-awards/

May 6 2024 ICLR 2024 Outstanding Paper Awards Yisong Yue ICLR 2024 Awards Committee: Eunsol Choi, Katja Hofmann, Ming-Yu Liu, Nan Jiang, Stephan Günnemann, Suvrit Sra, Thomas Kipf, Volkan Cevher (This post is written by the Awards Committee, lightly edited by the Program Chairs.) Selection Process ...

Address

Website

http://www.sistemas.ai/

Alerts

Be the first to know and let us send you an email when sistemas.ai posts news and promotions. Your email address will not be used for any other purpose, and you can unsubscribe at any time.

Contact The Business

Send a message to sistemas.ai:

Videos

🔥Animated Line Chart with matplotlib! Tutorial: https://python-graph-gallery.com/web-animated-line-chart-with-annotation/ #SistemasAi #datascience

🔥𝗔𝗻𝗰𝗶𝗲𝗻𝘁 𝗖𝗵𝗶𝗻𝗲𝘀𝗲 𝘁𝗲𝗿𝗿𝗮𝗰𝗼𝘁𝘁𝗮 𝘄𝗮𝗿𝗿𝗶𝗼𝗿 𝗯𝗿𝗼𝘂𝗴𝗵𝘁 𝘁𝗼 𝗹𝗶𝗳𝗲 𝗯𝘆 𝗰𝗼𝗱𝗲𝗿 𝘂𝘀𝗶𝗻𝗴 𝗮𝗿𝘁𝗶𝗳𝗶𝗰𝗶𝗮𝗹 𝗶𝗻𝘁𝗲𝗹𝗹𝗶𝗴𝗲𝗻𝗰𝗲 𝗮𝗹𝗴𝗼𝗿𝗶𝘁𝗵𝗺𝘀 Guerreros 'Terracota' de la antigua China traídos a la vida usando inteligencia artificial. Artículo: https://www.scmp.com/abacus/tech/article/3099065/ancient-chinese-terracotta-warrior-brought-life-coder-using-artificial #SistemasAi #Ai #artificialintelligence #inteligenciaartificial #ComputerVision

𝗧𝘄𝗼 𝗧𝗵𝗼𝘂𝘀𝗮𝗻𝗱 𝗬𝗲𝗮𝗿𝘀 𝗼𝗳 𝗚𝗹𝗼𝗯𝗮𝗹 𝗧𝗲𝗺𝗽𝗲𝗿𝗮𝘁𝘂𝗿𝗲𝘀 𝗶𝗻 𝗧𝘄𝗲𝗻𝘁𝘆 𝗦𝗲𝗰𝗼𝗻𝗱𝘀 🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥 Visualización por bgregory98 vía Reddit. #SistemasAi #Data

𝗡𝗲𝘄 𝗽𝗮𝗽𝗲𝗿: 𝗖𝗼𝗻𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻𝗮𝗹 𝗢𝗰𝗰𝘂𝗽𝗮𝗻𝗰𝘆 𝗡𝗲𝘁𝘄𝗼𝗿𝗸𝘀 Nuevo 'paper' destacado en el #ECCV2020 (23-28 de agosto). Últimamente, "𝗶𝗺𝗽𝗹𝗶𝗰𝗶𝘁 𝗻𝗲𝘂𝗿𝗮𝗹 𝗿𝗲𝗽𝗿𝗲𝘀𝗲𝗻𝘁𝗮𝘁𝗶𝗼𝗻𝘀" tienen un alto grado de aceptación en sistemas de reconstrucciones 3D basados en modelos de aprendizaje automático. Comparado con otros papers, 𝗖𝗼𝗻𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻𝗮𝗹 𝗢𝗰𝗰𝘂𝗽𝗮𝗻𝗰𝘆 𝗡𝗲𝘁𝘄𝗼𝗿𝗸𝘀 se basa en una "implicit neural representation" aún más flexible para una mejor reconstrucción al detalle de objetos y escenas 3D a gran escala que combina "convolutional encoders" con "implicit occupancy decoders". Si estás interesado en reconstrucciones 3D a gran escala... 𝗖𝗼𝗻𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻𝗮𝗹 𝗢𝗰𝗰𝘂𝗽𝗮𝗻𝗰𝘆 𝗡𝗲𝘁𝘄𝗼𝗿𝗸𝘀 es lo mejor que hay hasta el momento. Código y data incluídos! Abstract: https://arxiv.org/abs/2003.04618 Paper: https://arxiv.org/pdf/2003.04618 Project page: https://pengsongyou.github.io/conv_onet Código y data: https://github.com/autonomousvision/convolutional_occupancy_networks YouTube: https://youtu.be/EmauovgrDSM #SistemasAi #Ai #inteligenciaartificial #DeepLearning #artificialintelligence #MachineLearning #ComputerVision

Imágenes originales utilizadas para animar el primer videojuego de Mortal Kombat 😍 Hoy en día usando Deep Learning se pueden producir personajes de video-juegos que se asemejan a la realidad. #SistemasAi #Animation #DeepLearning #ComputerVision

Shortcuts

Address
Alerts
Contact The Business
Videos
Claim ownership or report listing
Want your business to be the top-listed Media Company?

Country:

City:

16/08/2024

15/08/2024

13/08/2024

08/08/2024

30/07/2024

07/07/2024

06/07/2024

04/07/2024

27/06/2024

26/06/2024

10/06/2024

07/06/2024

05/06/2024

02/06/2024

29/05/2024

27/05/2024

26/05/2024

26/05/2024

24/05/2024

19/05/2024

16/05/2024

16/05/2024

15/05/2024

11/05/2024

09/05/2024

09/05/2024

09/05/2024

08/05/2024

07/05/2024

07/05/2024

Address

Website

Alerts

Contact The Business

Videos

Shortcuts

Share