Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Micro optimization for softmax_forward_kernel5
#762 opened Sep 20, 2024 by insop Loading…
FP8 with Tensor Reorg
#760 opened Sep 19, 2024 by ademeure Draft
Update download_starter_pack.sh
#758 opened Sep 18, 2024 by dongrixinyu Loading…
RMSNorm - WIP
#757 opened Sep 16, 2024 by gordicaleksa Loading…
Add SwiGLU support - llama3 feature branch
#755 opened Sep 13, 2024 by gordicaleksa Loading…
add llama 3 support to llm.c
#754 opened Sep 13, 2024 by karpathy Draft
Adamw thread coarsening kernel
#753 opened Sep 3, 2024 by saladpalad Loading…
Fix sizing typo in train_gpt2_fp32.cu
#748 opened Aug 25, 2024 by gajanan-choudhary Loading…
log with LINE and FILE for better addressing.
#746 opened Aug 22, 2024 by NEWPLAN Loading…
check libnccl instead of nccl to be more reliable
#742 opened Aug 14, 2024 by dengl11 Loading…
[WIP] initial curand implementation for model init
#741 opened Aug 13, 2024 by ngc92 Loading…
multi-threaded model initialization
#737 opened Aug 12, 2024 by ngc92 Loading…
Add external KV to LLaMA 3
#734 opened Aug 10, 2024 by gordicaleksa Loading…
Add SwiGLU support
#718 opened Jul 29, 2024 by gordicaleksa Loading…
Add RoPE positional encoding
#714 opened Jul 28, 2024 by gordicaleksa Loading…
Add high perf mode
#708 opened Jul 23, 2024 by gordicaleksa Loading…
Add KV cache for inference
#707 opened Jul 22, 2024 by gordicaleksa Loading…
Major FP32 llm.c improvements/refactoring/etc.
#696 opened Jul 18, 2024 by ademeure Loading…
Update README.md with prerequisite of libomp
#691 opened Jul 17, 2024 by nzhang Loading…
ProTip! Updated in the last three days: updated:>2024-09-16.