attention-mechanism

Here are 1,499 public repositories matching this topic...

kyegomez / swarms

Orchestrate Swarms of Agents From Any Framework Like OpenAI, Langchain, and Etc for Business Operation Automation. Join our Community: https://discord.gg/DbjBMJTSWD

Updated Jun 11, 2024
Python

lucidrains / vit-pytorch

Star

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

computer-vision transformers artificial-intelligence image-classification attention-mechanism

Updated Jun 11, 2024
Python

rayabhisek123 / CFAT

Star

[CVPR 2024] "CFAT: Unleashing Triangular Windows for Image Super-resolution"

attention-mechanism super-resolution image-restoration restoration deep-learning-framework attention-model image-super-resolution low-level-vision transformer-models rectangular-window vision-transformer image-sr light-weight-image-super-resolution triangular-window

Updated Jun 11, 2024
Python

jordan7186 / GAtt

Star

Source code for the GAtt method in "Revisiting Attention Weights as Interpretations of Message-Passing Neural Networks".

python attention attention-mechanism explainable-ai xai graph-attention-networks graph-attention-model attribution-methods

Updated Jun 11, 2024
Jupyter Notebook

mverbytska / Custom-LSTM-with-Attention-for-FTS

Star

Experimental project on building custom LSTM and LSTM with Attention layer for comparison analysis on FTS forecasting (June 2024)

keras fintech lstm-model attention-mechanism timeseries-forecasting

Updated Jun 11, 2024
Jupyter Notebook

filipbasara0 / simple-diffusion

Star

A minimal implementation of a denoising diffusion model in PyTorch.

computer-vision deep-learning pytorch image-generation unet attention-mechanism diffusion stable-diffusion

Updated Jun 11, 2024
Python

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Jun 11, 2024
Python

erfanzar / FJFormer

Star

Embark on a journey of paralleled/unparalleled computational prowess with FJFormer 🔥 - an arsenal of custom Jax Flax Functions and Utils that elevate your AI endeavors to new heights!

numpy attention llama flax attention-mechanism lax jax llm easydel

Updated Jun 10, 2024
Python

thebrownkidd / ISL-to-text

Star

An attention based approach to convert Indian Sign Language to Text using simulated hand gesture data

python neural-network transformer attention-mechanism hand-gesture-recognition sign-language-recognition indian-sign-language

Updated Jun 10, 2024
Jupyter Notebook

m4urin / temporal-causal-discovery

Star

Researching causal relationships in time series data using Temporal Convolutional Networks (TCNs) combined with attention mechanisms. This approach aims to identify complex temporal interactions. Additionally, we're incorporating uncertainty quantification to enhance the reliability of our causal predictions.

machine-learning causality attention-mechanism causality-analysis time-series-analysis causal-discovery deep-learrning temporal-causal-discovery

Updated Jun 10, 2024
Jupyter Notebook

yogev-namir / HumanChoicePrediction

Star

An extension for the code and data of the paper "Human Choice Prediction in Language-based Non-Cooperative Games: Simulation-based Off-Policy Evaluation" (Shapira et al. 2023). This project was conducted by Yogev Namir and Avishag Nevo.

nlp deep-learning persuasion lstm academic-project attention-mechanism persuasion-algorithm

Updated Jun 10, 2024
Python

kyegomez / MultiModalMamba

Sponsor

Star

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

machine-learning ai ml transformers torch pytorch artificial-intelligence zeta attention-mechanism ssm mamba transformer-architecture

Updated Jun 10, 2024
Python

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Jun 9, 2024
Jupyter Notebook

lucidrains / x-transformers

Star

A simple but complete full-attention transformer with a set of promising experimental features from various papers

deep-learning transformers artificial-intelligence attention-mechanism

Updated Jun 8, 2024
Python

Esmail-ibraheem / Xllama

Star

Xllama🦙 is an Extensible advanced language model framework, inspired by the original Llama model.

pytorch llama attention-mechanism paper-implementations llms llama2

Updated Jun 8, 2024
Python

kyegomez / Jamba

Sponsor

Star

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

ai ml transformers artificial-neural-networks gpt attention-mechanism ssm attention-is-all-you-need attention-mechanisms

Updated Jun 8, 2024
Python

NotShrirang / QuillGPT

Star

QuillGPT is an implementation of the GPT decoder block based on the architecture from Attention is All You Need paper by Vaswani et. al. in PyTorch. Additionally, this repository contains two pre-trained models — Shakespearean GPT and Harpoon GPT, a Streamlit Playground, Containerized FastAPI Microservice, training - inference scripts & notebooks.