sentdex
sentdex
  • 1 254
  • 114 944 536
Building an LLM fine-tuning Dataset
Going through the building of a QLoRA fine-tuning dataset for a language model.
NVIDIA GTC signup: nvda.ws/3XTqlB6
Fine-tuning code: github.com/Sentdex/LLM-Finetuning
5000-step Walls1337bot adapter: huggingface.co/Sentdex/Walls1337bot-Llama2-7B-003.005.5000
WSB Dataset: huggingface.co/datasets/Sentdex/WSB-003.005
"I have every reddit comment" original reddit post and torrent info: www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/
2007-2015 Reddit Archive.org: archive.org/download/2015_reddit_comments_corpus/reddit_data/
Reddit BigQuery 2007-2019 (this has other data besides reddit comments too!): reddit.com/r/bigquery/comments/3cej2b/17_billion_reddit_comments_loaded_on_bigquery/
Contents:
0:00 - Introduction to Dataset building for fine-tuning.
02:53 - The Reddit dataset options (Torrent, Archive.org, BigQuery)
06:07 - Exporting BigQuery Reddit (and some other data)
14:44 - Decompressing all of the gzip archives
25:13 - Re-combining the archives for target subreddits
28:29 - How to structure the data
40:40 - Building training samples and saving to database
48:49 - Creating customized training json files
54:11 - QLoRA training and results
Neural Networks from Scratch book: nnfs.io
Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin
Discord: discord.gg/sentdex
Reddit: www.reddit.com/r/sentdex/
Support the content: pythonprogramming.net/support-donate/
Twitter: sentdex
Instagram: sentdex
Facebook: pythonprogramming.net/
Twitch: www.twitch.tv/sentdex
Переглядів: 39 114

Відео

Visualizing Neural Network Internals
Переглядів 41 тис.4 місяці тому
Visualizing some of the internals of a neural network during training and inference. Starting and full code: github.com/Sentdex/neural-net-internals-visualized Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ T...
Getting Back on Grid
Переглядів 18 тис.4 місяці тому
Establishing an internet connection in an internet desert, then figuring out (well, starting to) networking. Combined with Starlink as my internet provider, I ended up going with a wifi bridge implementation with a couple of Ubiquiti nanostation AC locos to network between buildings at 100 meters of distance. The Ubiquiti units can also do point to point (ptp), but so far the wifi bridge setup ...
Open Source AI Inference API w/ Together
Переглядів 31 тис.5 місяців тому
Exploring the Together Inference API (www.together.ai/) Together API basics jupyter notebook examples: github.com/Sentdex/Together-API-Basics Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: twitter.co...
INFINITE Inference Power for AI
Переглядів 25 тис.6 місяців тому
Testing and enjoying the Comino Grando Server machine with 6x RTX 4090s from Comino (www.comino.com/) Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: sentdex Instagram: sentd...
Pandas Dataframes on your GPU w/ CuDF
Переглядів 41 тис.7 місяців тому
An overview and some quick examples of using CuDF's Pandas accelerator and how much faster it can be than vanilla Pandas for data analysis. Colab demo of Rapids: nvda.ws/3LWggQj AI and Data Science Virtual Summit: nvda.ws/3ZR3wjL Notebook in this video: gist.github.com/Sentdex/469c30385d06719519af13125db85edc Install CuDF: pip install cudf-cu11 extra-index-url=pypi.nvidia.com (or cu12) Neural N...
QLoRA is all you need (Fast and lightweight model fine-tuning)
Переглядів 65 тис.9 місяців тому
Learning and sharing my process with QLoRA (quantized low rank adapters) fine-tuning. In this case, I use a custom-made reddit dataset, but you can use anything you want. I referenced a LOT of stuff in this video, I will do my best to link everything, but let me know if I forget anything. Resources: WSB-GPT-7B Model: huggingface.co/Sentdex/WSB-GPT-7B WSB-GPT-13B Model: huggingface.co/Sentdex/WS...
Chat Interface for your Local Llama LLMs
Переглядів 20 тис.10 місяців тому
A tutorial of sorts covering how to create streaming chat interfaces using Gradio for the various chat/instruct large language models from HuggingFace. Sample code: huggingface.co/spaces/Sentdex/StableBeluga-7B-Chat/blob/main/app.py Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.co...
Gzip is all You Need! (This SHOULD NOT work)
Переглядів 150 тис.10 місяців тому
Github code: github.com/Sentdex/Simple-kNN-Gzip Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: sentdex Instagram: sentdex Facebook: pythonprogramming.net/ Twitc...
Better Attention is All You Need
Переглядів 62 тис.11 місяців тому
Addressing the current state of attention for artificial intelligence and why it's currently holding back maximum context lengths. Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming.net/support-donate/ Twitter: sentdex I...
The BEST Open Source LLM? (Falcon 40B)
Переглядів 97 тис.11 місяців тому
TII Call for Proposals with Falcon 40B: falconllm.tii.ae/proposal.php Falcon Github samples: github.com/Sentdex/Falcon-LLM TermGPT: ua-cam.com/video/O4EmRi0_CI4/v-deo.html GPT-4 Overview: ua-cam.com/video/lJNblY3Madg/v-deo.html Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/s...
OpenAI GPT-4 Function Calling: Unlimited Potential
Переглядів 227 тис.Рік тому
Function calling is a new capability for OpenAI's GPT-4 and GPT-3.5 via the API. Function-calling allows you to extract structured outputs from the GPT model. Github notebook: github.com/Sentdex/ChatGPT-API-Basics/blob/main/function_calling.ipynb Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit:...
Letting GPT-4 Control My Terminal (TermGPT)
Переглядів 74 тис.Рік тому
Giving LLMs like GPT-4 the ability to plan and execute terminal commands. TermGPT github: github.com/Sentdex/TermGPT/ OpenAI Chat API tutorial: github.com/Sentdex/ChatGPT-API-Basics Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pythonprogramming...
Building an Open Assistant API
Переглядів 30 тис.Рік тому
Working with one of the Open Assistant models, a 12B parameter Pythia model (huggingface.co/OpenAssistant/oasst-sft-4-pythia-12b-epoch-3.5) Github: github.com/Sentdex/OpenAssistant_API_Pythia_12B Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/channels/fzlCWGWYyIQ0aLC5w48gBQ.htmljoin Discord: discord.gg/sentdex Reddit: www.reddit.com/r/sentdex/ Support the content: pyt...
Sparks of AGI? - Analyzing GPT-4 and the latest GPT/LLM Models
Переглядів 58 тис.Рік тому
An in-depth look into the current state of the art of Generative Pre-trained Transformer (GPT) language models, with a specific focus on the advancements and examples provided by OpenAI in their GPT4 Technical Report (arxiv.org/abs/2303.08774) as well as the Microsoft "Sparks of AGI" Paper (arxiv.org/abs/2303.12712). Neural Networks from Scratch book: nnfs.io Channel membership: ua-cam.com/chan...
ChatGLM: The ChatGPT killer? Checking out ChatGLM6B
Переглядів 102 тис.Рік тому
ChatGLM: The ChatGPT killer? Checking out ChatGLM6B
GPT Journey - A text and image game with ChatGPT
Переглядів 35 тис.Рік тому
GPT Journey - A text and image game with ChatGPT
ChatGPT API in Python
Переглядів 178 тис.Рік тому
ChatGPT API in Python
Image Editing A.I.
Переглядів 21 тис.Рік тому
Image Editing A.I.
The AI wars: Google vs Bing (ChatGPT)
Переглядів 109 тис.Рік тому
The AI wars: Google vs Bing (ChatGPT)
ChatGPT Writes a Chatbot AI
Переглядів 188 тис.Рік тому
ChatGPT Writes a Chatbot AI
OpenAI's ChatGPT is a MASSIVE step forward in Generative AI
Переглядів 470 тис.Рік тому
OpenAI's ChatGPT is a MASSIVE step forward in Generative AI
Google A.I. Diffusion Image Editing w/ Prompt to Prompt
Переглядів 41 тис.Рік тому
Google A.I. Diffusion Image Editing w/ Prompt to Prompt
Google's DreamFusion AI: Text to 3D
Переглядів 200 тис.Рік тому
Google's DreamFusion AI: Text to 3D
Open AI’s Whisper is Amazing!
Переглядів 479 тис.Рік тому
Open AI’s Whisper is Amazing!
The Future of User Interfaces with A.I.
Переглядів 70 тис.Рік тому
The Future of User Interfaces with A.I.
Creating Stable Diffusion Interpolation Videos
Переглядів 54 тис.Рік тому
Creating Stable Diffusion Interpolation Videos
Exploring an AI’s Imagination (Stable Diffusion and MidJourney)
Переглядів 146 тис.Рік тому
Exploring an AI’s Imagination (Stable Diffusion and MidJourney)
$5 MILLION AI for FREE
Переглядів 426 тис.Рік тому
$5 MILLION AI for FREE
Does a Deep Learning Laptop Exist? - Tensorbook Review
Переглядів 50 тис.Рік тому
Does a Deep Learning Laptop Exist? - Tensorbook Review

КОМЕНТАРІ

  • @spysec6525
    @spysec6525 День тому

    After years 🤣

  • @tkumardalai9995
    @tkumardalai9995 День тому

    its a nice explanation and a good hack

  • @gabrielcampos295
    @gabrielcampos295 День тому

    i could not find a video that explained as well as this for the past 2 months, this is gold, congrats, you're amazing!

  • @prateekyadav9811
    @prateekyadav9811 2 дні тому

    Harrison, please inform us if you plan to continue the tutorial series for NNFS. It's a brilliant piece of work. Please do tell us either way. You been super quiet on this for long :P Mighty thanks for your content!

  • @prateekyadav9811
    @prateekyadav9811 2 дні тому

    Bhai please continue this series!! It's so bloody awesome.

  • @prateekyadav9811
    @prateekyadav9811 2 дні тому

    Why don't we clip the predicted values as 1e-7 to 1. I get why we don't add 1e-7 to 1 to give the max possible value cause then the negative log will be negative and negative loss doesn't mean anything. But why subtract 1e-7 from 1 instead? Why not we keep it as 1?

  • @prateekyadav9811
    @prateekyadav9811 2 дні тому

    So few comments compared to the views! Suggests that people aren't really following through. Sad. This is gold!

  • @sevdattufanogullari6581
    @sevdattufanogullari6581 3 дні тому

    Why is the bias added to the end of the formula?

  • @igorson7929
    @igorson7929 3 дні тому

    4:54, actually no output after Softmax activation function can be 0, because when you softmax you take euler number to power of your input, and you can never get 0 out of this unless you take e to power of negative infinity, but I assume that your input is never negative infinity. Then when you normalize the final output you also cannot get 0 out of this.

    • @prateekyadav9811
      @prateekyadav9811 3 дні тому

      I am guessing its possible due to lack of computational power for the lack of a better term. We might obtain a number to the order of negative 40 that might be rounded off to 0 after normalization. Just a guess. Your comment made me wonder.

  • @Moai_rocc
    @Moai_rocc 3 дні тому

    9th grade me trying to make a bot because I couldn't make a good economic simulation when I dictated when someone did something edit: I may have no idea what the weird E / W rotated 90 degrees is

  • @SIDAMKRISHNA-sf7jk
    @SIDAMKRISHNA-sf7jk 3 дні тому

    you got a new cup for every video and what do you drink

  • @yugandharsurya
    @yugandharsurya 4 дні тому

    Please upload other videos 🥹🥹

  • @Ed-ix2vk
    @Ed-ix2vk 4 дні тому

    is numpy 2 ok?

  • @prateekyadav9811
    @prateekyadav9811 4 дні тому

    Absolutely love this series! I have a question: how did we arrive at fitting functions using NN? I thought we use NN for classification. And when fitting to the sine function, what would input data be like? What would be its features? Just one value i.e. x?

  • @SIDAMKRISHNA-sf7jk
    @SIDAMKRISHNA-sf7jk 5 днів тому

    i never expect you to be a car enthusiastic POV : man with actual skills

  • @woojay
    @woojay 5 днів тому

    Thank you so much.

  • @mohammedsufyanrizvi2595
    @mohammedsufyanrizvi2595 5 днів тому

    Why is this not complete????????????

  • @laithovkh8303
    @laithovkh8303 5 днів тому

    if you are here to learn the coding for Neural network, I am advising you to leave now and dont waste your time like we did he just selling his stupid book this series are unfinished for years and there are now launching nor backpropagation coding it is all a scam, try others like 3blue one brown and good luck.

  • @laithovkh8303
    @laithovkh8303 5 днів тому

    i spent 3 hours watching this just to realize this dude does not know also how neural network works, just changed the subject and made different types of random videos.

  • @The_Quaalude
    @The_Quaalude 5 днів тому

    Thank you for explaining this concept with actual python code ‼️🔥🔥🔥

  • @BatteryProductions
    @BatteryProductions 6 днів тому

    you need client socket instead of just s.send so that s socket can keep listening for more connections.. remember the 5 queue.. well if 5 connections come, those connections get passed to 5 client socket objects so s can then go get other 5 connections...

  • @sagarpanwar546
    @sagarpanwar546 6 днів тому

    How can I order the Hard copy of the Book ?

  • @JusticeNDOU
    @JusticeNDOU 6 днів тому

    you cannot DOS someone if you do not await, because once you do not await you probably never even had a coroutine or you have a coroutine but its not sent to the event loop, the effect in eiher way is that you end up with an awaitable which is never executed

  • @alessandrocoppelli3056
    @alessandrocoppelli3056 7 днів тому

    hello,i'm trying to use PPO and A2C for my discrete-box environment. i have set negative rewards in order to teach the agent to avoid impossible operation in my environment. most of the training time is spent to learn to avoid those operations with negative rewards. Is there a method to directly "tell" the agent (inside the agent itself) to avoid those operations, instead of spend training time? thanks in advance

  • @Harshit-qc5ck
    @Harshit-qc5ck 7 днів тому

  • @Harshit-qc5ck
    @Harshit-qc5ck 7 днів тому

  • @Ukraine-0
    @Ukraine-0 7 днів тому

    WTF MAN WHY DID I STUDY VERTEXES AND JOINTS AND ALL THAT STUFF?!

  • @Codingskills-ne2lh
    @Codingskills-ne2lh 8 днів тому

    Hey man looks like the command "py -3.7" is not working for me. I doo have python 3.12 installed and when I check the list of environments it shows 3.12 and 3.8. How can I add 3.7 here. Should I go and add it in the env path and downloading python 3.7?

  • @michaelmoore7568
    @michaelmoore7568 9 днів тому

    Is Sentdex the best Python coder in the world?

  • @snipplerifsky
    @snipplerifsky 10 днів тому

    4 years later and still one of the best series on the subject

  • @helloeveryone6614
    @helloeveryone6614 11 днів тому

    Thankyou bro❤

  • @Anandgamerzzz-ru8xt
    @Anandgamerzzz-ru8xt 11 днів тому

    Get an error string indices must be integers, not 'str'

  • @rachaelkibicho7461
    @rachaelkibicho7461 11 днів тому

    Can anyone access his python shell scripts? I'm having trouble with the input at the attribute category_slug. Were we not supposed to fill that attribute at the python shell. I feel lost

  • @Harshit-qc5ck
    @Harshit-qc5ck 11 днів тому

  • @imad-ice
    @imad-ice 11 днів тому

    why is this resource being underrated, people nowadays import tensorflow, and PyTorch, -> say u build a neural network, but this man combined theory and lab in such a way that it is easy to understand

  • @Uncreeperble
    @Uncreeperble 11 днів тому

    "You won't get through this in a weekend" Challenge accepted.

  • @No7es-Txken
    @No7es-Txken 11 днів тому

    Loved the videos, really helped me understand Neural Networks deeper. I’m looking forward to buying the book but I also enjoy the videos that you do. Are you going to continue video support for this series or is the book my only way forward from here?

  • @norimashiron3635
    @norimashiron3635 12 днів тому

    for row in range(len(game)): diag1.append(game[row][row]) diag2.append(game[row][len(game)-1-row])

  • @souravdey1227
    @souravdey1227 12 днів тому

    Your videos always feel so grounded. Most other videos on such topics make me feel so insignificant. Also, I have a friend, Zeke, from Florida, and you two are soo uncannily similar!!!!

  • @namitshah9535
    @namitshah9535 13 днів тому

    That ain't a home lab it feels like a tiny datacenter

  • @proflead
    @proflead 13 днів тому

    Epic video! :)

  • @Eisenwolf.Ironwulf
    @Eisenwolf.Ironwulf 13 днів тому

    Still here ... still waiting ... with my book and my unwavering Will to wait

    • @DawitMengistuAbajifar
      @DawitMengistuAbajifar 8 днів тому

      I can't afford to buy the book? Any way you can sent it to me?

    • @Eisenwolf.Ironwulf
      @Eisenwolf.Ironwulf 7 днів тому

      @@DawitMengistuAbajifar Sorry, but i cant give my copie away, I still need it. But if you cant afford it, ask sentdex, mby he can give you a code or something. Thats far better than pirating it.

    • @DawitMengistuAbajifar
      @DawitMengistuAbajifar 7 днів тому

      @@Eisenwolf.Ironwulf alright, found it anyway.

  • @rajvaidya1691
    @rajvaidya1691 14 днів тому

    this is going over my head now

  • @GamerDevIND
    @GamerDevIND 14 днів тому

    Brain is glitching

  • @VyreBET
    @VyreBET 14 днів тому

    Can I Also try this on Pycharm IDE? Or do I Really need to Install SUblime?

  • @gamagambogam
    @gamagambogam 14 днів тому

    I just purchased your book! You are too kind

  • @prathyushmadhu2861
    @prathyushmadhu2861 14 днів тому

    Does anybody know about that copilot he used to speed up the decompressing process?

  • @randomdude5634
    @randomdude5634 15 днів тому

    hi

  • @gamagambogam
    @gamagambogam 15 днів тому

    bless you!

  • @rook451
    @rook451 15 днів тому

    Love your website. Thank you.