21 Tutorial Topics
Remove Filter ![]() | |
On September 25, 2024, Meta released [the Llama 3.2 series of multimodal models](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/). The models are lightweight yet extremely powerful for image-to-text and text-to-text tasks. In this article, you will learn how to use the Llama 3.2 Vision Instruct model for general image analysis, graph analysis, and facial sentiment prediction. … | |
On September 19, 2024, [Alibaba released the Qwen 2.5 series of models](https://qwenlm.github.io/blog/qwen2.5/). The Qwen 2.5-72B base and instruct models outperformed larger state-of-the-art models like Llama 3.1-405B on multiple benchmarks. It is safe to assume that Qwen 2.5-72B is a state-of-the-art open-source large language model. This article will show you how … | |
## Introduction ## In a previous article, I explained [how to fine-tune the vision transformer model for image classification in PyTorch](https://www.daniweb.com/programming/computer-science/tutorials/540749/fine-tuning-vision-transformer-for-image-classification-in-pytorch). In this article, I will explain how to fine-tune the pre-trained OpenAI Whisper model for audio classification in PyTorch. Audio classification is an important task that can be applied … | |
Large language models (LLMS) are trained to predict the next token (set of characters) following an input sequence of tokens. This makes LLMs suitable for unstructured textual responses. However, we often need to extract structured information from unstructured text. With the Python [LangChain](https://www.langchain.com/) module, you can extract structured information in … | |
In my previous articles, I presented a [comparison of OpenAI GPT-4o mini model with GPT-4o and GPT-3.5 turbo models for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o). The results showed that GPT-4o mini, while significantly cheaper than its counterparts, achieves comparable performance. On 8 August 2024, OpenAI enabled GPT-4o mini fine-tuning for developers across … | |
In my [previous article](https://www.daniweb.com/programming/computer-science/tutorials/542132/comparing-gpt-4o-vs-claude-3-5-sonnet-for-zero-shot-text-classification) I presented results comparing Anthropic [Claude 3.5 Sonnet](https://www.anthropic.com/news/claude-3-5-sonnet) and [OpenAI GPT-4o](https://openai.com/index/hello-gpt-4o/) models for zero-shot text classification. The results showed that the Claude 3.5 Sonnet significantly outperformed GPT-4o. These results motivated me to develop a simple retrieval augmented generation system with [LangChain](https://www.langchain.com/) that enables the Claude 3.5 … | |
As a data scientist, I have extensively used the Hugging Face library for processing unstructured data such as images, text, and audio. My previous blogs have covered various transformer models for these types of data. Lately, however, I discovered that Hugging Face also provides transformer models for tabular data. One … | |
In this tutorial, you will see how to summarize YouTube video transcriptions using [Distil Whisper large V3](https://huggingface.co/distil-whisper/distil-large-v3) and [Mistral-7b-Instruct](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2). Both Distill Whisper Large V3 and Mistral-7B-Instruct models are open-source and free-to-use models. The Distil Whisper large V3 model is a faster and smaller variant of the [Whisper large V3 model](https://huggingface.co/openai/whisper-large-v3), … | |
In previous articles, I explained how to use natural language to interact with [PDF documents](https://www.daniweb.com/programming/computer-science/tutorials/541732/paris-olympics-ticket-information-chatbot-with-memory-using-langchain) and [SQL databases](https://www.daniweb.com/programming/computer-science/tutorials/541771/using-natural-language-to-query-sql-databases-with-python-langchain-module), using the Python [LangChain module](https://python.langchain.com/docs/get_started/introduction) and [OpenAI API](https://openai.com/blog/openai-api). In this article, you will learn how to use LangChain and OpenAI API to create a question-answering application that allows you to retrieve information … | |
In my previous article, I explained how I developed a simple chatbot using LangChain and Chat-GPT that can answer queries related to Paris Olympics ticket prices. However, one major drawback with that chatbot is that it can only generate a single response based on user queries. It can not answer … | |
In this article, we will compare two state-of-the-art large language models for zero-shot text classification: [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction) and [OpenAI GPT-4](https://openai.com/research/gpt-4). Zero-shot text classification is a task where a model is trained on a set of labeled examples but can then classify new examples from previously unseen classes. This is … | |
I recently tackled a challenging research task involving multimodal data for a classification problem using [TensorFlow Keras](https://www.tensorflow.org/guide/keras). One of the trickiest aspects was figuring out how to load multimodal data in batches from storage efficiently. While TensorFlow Keras offers helpful functions for batch-loading images from various sources, the documentation and … | |
In a [previous tutorial](https://www.daniweb.com/programming/computer-science/tutorials/541123/stock-price-prediction-using-1d-cnn-in-tensorflow-keras), I covered how to predict future stock prices using a deep learning model with 1D CNN layers. This method is effective for basic time series forecasting. Recently, I've enhanced this model by not just considering past closing prices but also factors like Open, High, Low, Volume, … | |
Facial emotion detection, as the name suggests, involves detecting emotions from faces in images or videos. Recently, I was working on a facial emotion detection task and came across the DeepFace library that implements various state-of-the-art facial emotion detection models. However, in my experience, the performance of the DeepFace library … | |
Stock price prediction is a challenging task that requires analyzing historical trends, market sentiments, economic indicators, and company performance. One of the popular methods for stock price prediction is using deep learning models, such as convolutional neural networks (CNNs). CNNs are a type of neural network that can extract features … | |
In this tutorial, you will learn to fine-tune a [Hugging Face Transformers model](https://huggingface.co/docs/transformers/index) for video classification in PyTorch. The Hugging Face documentation provides an example of performing video classification using the Hugging Face Trainer with one of Hugging Face's built-in datasets. However, the process of fine-tuning a video transformer on … | |
Understanding facial expressions is crucial for various tasks, from recognizing emotions to enhancing security measures. While extracting faces from pictures is easy, doing the same in videos is tricky. Imagine creating videos with only highlighted facial expressions, offering a unique perspective on human interactions. Various tools are available for face … | |
In a previous article, I showed you [how to analyze sentiments using Chat-GPT and data augmentation techniques](https://www.daniweb.com/programming/computer-science/tutorials/540502/sentiment-analysis-with-data-augmentation-using-chatgpt#post2293643). Following that, some readers reached out, asking for a breakdown of fine-tuning a Chat-GPT model. In this article, I will guide you through fine-tuning your Chat-GPT model using your own data. First, I'll … | |
## Introduction ## When working with RecyclerView, sometimes we are required to scroll the RecyclerView in code, especially after addition of a list item. In this tutorial, we will learn how to scroll to specific positions on a RecyclerView. ## Goals ## At the end of the tutorial, you would … | |
## Introduction ## Snackbars are great for displaying brief messages providing feedback to your users. In this tutorial, we will learn how to create Snackbars. ## Goals ## At the end of the tutorial, you would have learned: 1. How to create a Snackbar. ## Tools Required ## 1. Android … | |
![]() | ## Contents ## Overview 1. Alternatives to Deprecated mysql_* Functions 2. SQL Injection: What to do with $_POST and $_GET variables 3. How to Insert Tablenames and Fieldnames Properly 4. What to do With Dynamic Values? 5. How to Use Aggregate Functions (COUNT, SUM, MIN etc) 6. How to Paginate … |