195 Topics

Member Avatar for
Member Avatar for dimitrilc

## Introduction ## In a previous tutorial, we learned [how to create a notification on Android](https://www.daniweb.com/programming/mobile-development/tutorials/536756/android-native-how-to-create-a-notification#post2282507). In that tutorial, we did not have to request any permission because applications did not need permission to post notifications on Android 12 and below. It has been almost a year since the release …

Member Avatar for Erussuhsh
1
656
Member Avatar for usmanmalik57

Large language models are trained on a fixed corpus, and their knowledge is often limited by the documents they are trained on. Techniques like retrieval augmented generation, continuous pre-training, and fine-tuning enhance an LLM's default knowledge. However, these techniques can still not enable an LLM to answer queries that require …

0
80
Member Avatar for usmanmalik57

On April 14, 2025, OpenAI released [GPT-4.1](https://openai.com/index/gpt-4-1/) — a model touted as the new state-of-the-art, outperforming GPT-4o on all major benchmarks. As always, I like to evaluate new LLMs on simple tasks like text classification and summarization to see how they compare with current leading models. In this article, I …

2
84
Member Avatar for usmanmalik57

In a [previous article](https://www.daniweb.com/programming/computer-science/tutorials/543028/text-classification-and-summarization-with-deepseek-r1-distill-llama-70b), I presented a comparison of [DeepSeek-R1-Distill-Llama-70b](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) with the [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) for text classification and summarization. Both these models are distilled versions of the original DeepSeek R1 model. Recently, I wanted to try the original version of the DeepSeek R1 model using the DeepSeek API. However, I was …

1
159
Member Avatar for usmanmalik57

In the [last article](https://www.daniweb.com/programming/computer-science/tutorials/542973/benchmarking-deepseek-r1-for-text-classification-and-summarization#post2300447), I explained how you can use the [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) model for text classification and summarization problems. In this article, we will use the [DeepSeek-R1-Distill-Llama-70b](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) for the same tasks. Following results from the [DeepSeek-AI's official paper](https://arxiv.org/pdf/2501.12948) show that `DeepSeek-R1-Distill-Llama-70b` outperform the other distilled models on 4 out of …

Member Avatar for Pelorus_1
0
809
Member Avatar for Dani

The Markdown editor that we use here at DaniWeb is called [CodeMirror](https://codemirror.net/) and we've been using it for forever. The other day, I noticed some bugs in the editor toolbar that we use, which is based on the [CodeMirror API](https://codemirror.net/docs/ref/). (The one that allows the buttons for bold, italic, etc.) …

1
84
Member Avatar for usmanmalik57

In my previous article, I explained how to fine-tune [OpenAI GPT-4o model for natural language processing tasks](https://www.daniweb.com/programming/computer-science/tutorials/542333/how-to-fine-tune-the-openai-gpt-4o-model-the-wait-is-finally-over). In OpenAI DevDay, held on October 1, 2024, OpenAI announced that users can now fine-tune OpenAI vision and multimodal models such as GPT-4o and GPT-4o mini. The best part is that fine-tuning vision …

Member Avatar for RKE2
2
307
Member Avatar for usmanmalik57

DeepSeek-R1 is a groundbreaking family of reinforcement learning (RL)-driven AI models developed by the Chinese AI firm [DeepSeek](https://www.deepseek.com/). It is designed to rival industry leaders like OpenAI and Google in complex decision-making and optimization problems. In this article, we will benchmark the DeepSeek R1 model for text classification and summarization …

1
1K
Member Avatar for usmanmalik57

Open-source LLMs are gaining significant traction due to their ability to match the performance of advanced proprietary LLMs. These models are free to use and allow users to modify their source code or fine-tune them on their own systems, making them highly versatile for various applications. Alibaba's [Qwen](https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1) and Meta's …

4
826
Member Avatar for usmanmalik57

On November 20, 2024, OpenAI updated its GPT-4o model, claiming it is more creative and accurate on several benchmarks. In this article, I compare the GPT-4o November update with the previous version (August update) for text summarization and classification tasks. By the end of this article, you will see whether …

2
209
Member Avatar for usmanmalik57

In my previous article, I presented a [comparison of GPT-4o and Claude 3.5 Sonnet for multi-label text classification](https://www.daniweb.com/programming/computer-science/tutorials/542629/openai-gpt-4o-vs-claude-3-5-sonnet-for-multi-label-text-classification). The accuracies achieved by both models were relatively low. Fine-tuning is one solution to overcome the low performance of large-language models. With fine-tuning, you can incorporate custom domain knowledge into an LLM's …

2
244
Member Avatar for usmanmalik57

In one of my previous articles, you saw a [comparison of GPT-4o vs. Claude 3.5 sonnet for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542132/comparing-gpt-4o-vs-claude-3-5-sonnet-for-zero-shot-text-classification). In that article; we performed multi-class text classification where input tweets belonged to one of the three categories. In this article, we will go a step further and perform zero-shot …

2
182
Member Avatar for usmanmalik57

On September 25, 2024, Meta released [the Llama 3.2 series of multimodal models](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/). The models are lightweight yet extremely powerful for image-to-text and text-to-text tasks. In this article, you will learn how to use the Llama 3.2 Vision Instruct model for general image analysis, graph analysis, and facial sentiment prediction. …

3
431
Member Avatar for usmanmalik57

This article explains how to create a retrieval augmented generation (RAG) chatbot in LangChain using open-source models from [Hugging Face serverless inference API](https://huggingface.co/docs/api-inference/en/index). You will see how to call large language models (LLMs) and embedding models from Hugging Face serverless inference API using LangChain. You will also see how to …

2
758
Member Avatar for usmanmalik57

Open-source LLMS, owing to their comparable performance with advanced proprietary LLMs, have been gaining immense popularity lately. Open-source LLMs are free to use, and you can easily modify their source code or fine-tune them on your systems. [Alibaba's Qwen](https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1) and [Meta's Llama](https://ai.meta.com/blog/meta-llama-3-1/) series of models are two major players in …

Member Avatar for Brandon_38
2
3K
Member Avatar for usmanmalik57

In one of my previous articles, I explained [how to generate stunning images for free using diffusion models](https://www.daniweb.com/programming/computer-science/tutorials/541898/generate-stunning-ai-images-for-free-using-diffusion-models) and showed how to generate Stability AI's diffusion models for text-to-image generation. Since then, the AI domain has progressed considerably, particularly in image generation. Black Forest Labs has released [Flux.1 series of …

Member Avatar for autowrecking
3
309
Member Avatar for usmanmalik57

On September 19, 2024, [Alibaba released the Qwen 2.5 series of models](https://qwenlm.github.io/blog/qwen2.5/). The Qwen 2.5-72B base and instruct models outperformed larger state-of-the-art models like Llama 3.1-405B on multiple benchmarks. It is safe to assume that Qwen 2.5-72B is a state-of-the-art open-source large language model. This article will show you how …

3
2K
Member Avatar for Johannes C.

**Find out how the recent leak of Google’s internal documents affects SEO strategies: All key takeaways and the most important implications for optimizing your content in light of the leak.** ![google-leak-seo-takeaways.jpg](https://static.daniweb.com/attachments/4/33bc1ca16f29c159257318bde2d7ef39.jpg) It’s been one week since news about leaked API documents providing insights into Google’s search algorithm surfaced, and the …

Member Avatar for emiilyyjohnson2
1
267
Member Avatar for usmanmalik57

On August 20, 2024, [OpenAI enabled GPT-4o fine-tuning](https://openai.com/index/gpt-4o-fine-tuning/) in the OpenAI playground and the OpenAI API. The much-awaited feature is free for fine-tuning 1 million daily tokens until September 23, 2024. In this article, I will show you how to fine-tune the OpenAI GPT-4o model for text classification and summarization …

2
1K
Member Avatar for usmanmalik57

In a previous article, I compared [GPT-4o mini vs. GPT-4o and GPT-3.5 Turbo for zero-shot text summarization](https://www.daniweb.com/programming/computer-science/tutorials/542208/gpt-4o-mini-vs-gpt-4o-vs-gpt-3-5-turbo-for-text-summarization). The results showed that the GPT-4o mini achieves almost similar performance for zero-shot text classification at a much-reduced price compared to the other models. I will compare Meta Llama 3.1 70b with OpenAI …

2
1K
Member Avatar for usmanmalik57

In my previous articles, I presented a [comparison of OpenAI GPT-4o mini model with GPT-4o and GPT-3.5 turbo models for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o). The results showed that GPT-4o mini, while significantly cheaper than its counterparts, achieves comparable performance. On 8 August 2024, OpenAI enabled GPT-4o mini fine-tuning for developers across …

1
312
Member Avatar for usmanmalik57

In my previous [article on GPT-4o mini](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o), I compared the performance of GPT-4o mini against GPT-3.5 Turbo and GPT-4o for zero-shot text classification. We saw that GPT-4o mini, being 36% times cheaper, achieves only 2% less accuracy than GPT-4o. Furthermore, while being 1/3 of the price, the GPT-4o mini significantly …

1
297
Member Avatar for usmanmalik57

On July 18th, 2024, [OpenAI released GPT-4o mini](https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/), their most cost-efficient small model. GPT-4o mini is around 60% cheaper than GPT-3.5 Turbo and around 97% cheaper than GPT-4o. As per OpenAI, GPT-4o mini outperforms GPT-3.5 Turbo on almost all benchmarks while being cheaper. In this article, we will compare the …

3
216
Member Avatar for jkon

LMs are great , but if you don't know how to ask the right questions , break your thoughts in manageable pieces they can't do it for you. Even when you do , you must have a level of experience to distinguish a wrong or crappy response from a legitimate …

Member Avatar for Dani
3
49
Member Avatar for usmanmalik57

In my article on [Image Analysis Using OpenAI GPT-4o Model](https://www.daniweb.com/programming/computer-science/tutorials/542030/image-analysis-using-openai-gpt-4o-model), I explained how GPT-4o model allows you to analyze images and answer questions related images precisely. In this article, I will show you how to analyze images with the [Anthropic Claude 3.5 Sonnet](https://www.anthropic.com/news/claude-3-5-sonnet) model, which has shown state-of-the-art performance for …

2
242
Member Avatar for usmanmalik57

Are you interested in finding out what a YouTube channel mostly discusses? Do you want to analyze YouTube videos of a specific channel? If yes, we are in the same boat. YouTube video titles are a great way to determine the channel's primary focus. Plotting a word cloud or a …

4
105
Member Avatar for usmanmalik57

In my [previous article](https://www.daniweb.com/programming/computer-science/tutorials/542132/comparing-gpt-4o-vs-claude-3-5-sonnet-for-zero-shot-text-classification) I presented results comparing Anthropic [Claude 3.5 Sonnet](https://www.anthropic.com/news/claude-3-5-sonnet) and [OpenAI GPT-4o](https://openai.com/index/hello-gpt-4o/) models for zero-shot text classification. The results showed that the Claude 3.5 Sonnet significantly outperformed GPT-4o. These results motivated me to develop a simple retrieval augmented generation system with [LangChain](https://www.langchain.com/) that enables the Claude 3.5 …

3
1K
Member Avatar for usmanmalik57

On June 20, 2024, Anthropic released the [Claude 3.5 sonnet](https://www.anthropic.com/news/claude-3-5-sonnet) large language model. Claude claims it to be the state-of-the-art model for many natural language processing tasks, surpassing the [OpenAI GPT-4o model](https://openai.com/index/hello-gpt-4o/). My first test for comparing two large language models is their zero-shot text classification ability. In this article, …

3
239
Member Avatar for usmanmalik57

# Comparison Between Fine-tuned and Default GPT-3 Turbo for Text Classification In one of my previous articles, I showed you how to perform [zero-shot text classification using OpenAI GPT-4o and Meta Llama 3 models](https://www.daniweb.com/programming/computer-science/tutorials/542001/openai-gpt-4o-vs-meta-llama-3-for-zero-shot-text-classifiation). I used the default models for predicting sentiments of airline tweets. The default models perform substantially …

2
662
Member Avatar for usmanmalik57

On April 18, 2024, Meta AI released [Llama 3](https://ai.meta.com/blog/meta-llama-3/), which they claimed to be the most capable openly available LLM to date. Concurrently, OpenAI announced [GPT-4o (omni)](https://community.openai.com/t/announcing-gpt-4o-in-the-api/744700) on May 13, 2024, which is touted as the state-of-the-art proprietary model for various NLP benchmarks. As a guy who loves to compare …

2
259
Member Avatar for usmanmalik57

## Introduction Text-to-speech (TTS) technology has revolutionized how we interact with devices, making accessing content through auditory means easier. TTS is vital in various applications such as virtual assistants, audiobooks, accessibility tools for the visually impaired, and language learning platforms. This tutorial will explore how to convert text-to-speech using Hugging …

2
297
Member Avatar for usmanmalik57

In previous articles, I explained how to use natural language to interact with [PDF documents](https://www.daniweb.com/programming/computer-science/tutorials/541732/paris-olympics-ticket-information-chatbot-with-memory-using-langchain) and [SQL databases](https://www.daniweb.com/programming/computer-science/tutorials/541771/using-natural-language-to-query-sql-databases-with-python-langchain-module), using the Python [LangChain module](https://python.langchain.com/docs/get_started/introduction) and [OpenAI API](https://openai.com/blog/openai-api). In this article, you will learn how to use LangChain and OpenAI API to create a question-answering application that allows you to retrieve information …

2
179
Member Avatar for usmanmalik57

I was searching for Paris Olympics ticket prices for tennis games recently. The official website directs you to a [PDF document](https://tickets.paris2024.org/obj/media/FR-Paris2024/ticket-prices.pdf) containing ticket prices and venues for all the games. However, I found the PDF document to be very hard to navigate. To make things easier, I developed a chatbot …

4
85
Member Avatar for usmanmalik57

On March 4, 2024, [Anthropic](https://www.anthropic.com/) launched the [Claude 3 family of large language models](https://www.anthropic.com/news/claude-3-family). Anthropic claimed that its Claude 3 Opus model outperforms GPT-4 on various benchmarks. Intrigued by Anthropic's claim, I performed a simple test to compare the performances of Claude 3 Opus, [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction), and [OpenAI's GPT-4](https://openai.com/research/gpt-4) …

2
163
Member Avatar for usmanmalik57

In the rapidly evolving field of Natural Language Processing (NLP), open-source large language models (LLMs) are becoming increasingly popular as they are free to use. Among these, the [Mistral](https://docs.mistral.ai/models/) family of models stands out as a state-of-the-art model that is freely accessible to the public. Comparable in performance to the …

3
131
Member Avatar for usmanmalik57

In a previous article, I explained [how to fine-tune Google's Gemma model for text classification](https://www.daniweb.com/programming/computer-science/tutorials/541544/fine-tuning-google-gemma-model-for-text-classification-in-python). In this article, I will explain how you can improve performance of a pretrained large language model (LLM) using retrieval augmented generation (RAG) technique. So, let's begin without ado. ## What is Retrieval Augmented Generation …

2
1K
Member Avatar for Johannes C.

**ChatGPT has had lazy days before, but this week’s performance marks an unprecedented low. Here’s why many ChatGPT Pro users are canceling their subscriptions – and even more might follow.** ![lobotomized-chatgpt.jpg](https://static.daniweb.com/attachments/4/7421f2d25c68fdee20e906d1ae1bfec9.jpg) Yes, complaints about ChatGPT being lazy have been around for as long as the LLM itself. I have written …

2
210
Member Avatar for usmanmalik57

Integrating language models like ChatGPT into third-party applications has become increasingly popular due to their ability to comprehend and generate human-like text. However, it's crucial to acknowledge the limitations of ChatGPT, such as its knowledge cut-off date in September 2021 and its inability to access external sources like Wikipedia or …

Member Avatar for catherine_11
3
1K
Member Avatar for usmanmalik57

In this article, we will compare two state-of-the-art large language models for zero-shot text classification: [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction) and [OpenAI GPT-4](https://openai.com/research/gpt-4). Zero-shot text classification is a task where a model is trained on a set of labeled examples but can then classify new examples from previously unseen classes. This is …

1
180
Member Avatar for usmanmalik57

## Introduction ## This tutorial explains how to perform multiple-label text classification using the [Hugging Face](https://huggingface.co/) transformers library. Hugging Face library implements advanced transformer architectures, proven to be state-of-the-art for various natural language processing tasks, including text classification. Hugging Face library provides trainable transformer models in three flavors: 1. Via …

Member Avatar for Aravind_11
1
1K
Member Avatar for usmanmalik57

In a [previous tutorial](https://www.daniweb.com/programming/computer-science/tutorials/541123/stock-price-prediction-using-1d-cnn-in-tensorflow-keras), I covered how to predict future stock prices using a deep learning model with 1D CNN layers. This method is effective for basic time series forecasting. Recently, I've enhanced this model by not just considering past closing prices but also factors like Open, High, Low, Volume, …

0
171
Member Avatar for Johannes C.

**Find out how to use Dall-E 3 to create personalized artwork and breathtaking visuals.** AI image generators are getting better by the day and can be used to create a spectrum of possibilities, from stunning illustrations to hyper-realistic photos. This guide is a toolkit for using Dall-E 3 with ChatGPT, …

4
792
Member Avatar for usmanmalik57

As a researcher, I have often found myself buried under a mountain of research articles, each promising insights and breakthroughs crucial for my work. The sheer volume of information is overwhelming, and the time it takes to extract the relevant data can be daunting. However, extracting meaningful information from research …

Member Avatar for AndreRet
3
280
Member Avatar for Johannes C.

**Yesterday, OpenAI's inaugural [DevDay](https://devday.openai.com/) conference in San Francisco unveiled a series of groundbreaking announcements, leaving the tech community humming with both excitement and a degree of uncertainty. The reveal of GPT-4 Turbo, a new wave of customizable AI through user-friendly APIs, and the promise to protect businesses from copyright infringement …

3
334
Member Avatar for dimitrilc

## Introduction ## In Android Studio, we can run multiple tests in parallel across multiple devices using the dropdown menu. ![Screen_Shot_2022-09-26_at_3.02.02_PM.png](https://static.daniweb.com/attachments/4/a7966192c6b1c265a65010a9d1fa236d.png) Running instrumented tests this way is very convenient during development, but there is a problem with this method when your tests are run on a remote build server: * …

Member Avatar for John_387
2
791
Member Avatar for usmanmalik57

In a previous article, I showed you [how to analyze sentiments using Chat-GPT and data augmentation techniques](https://www.daniweb.com/programming/computer-science/tutorials/540502/sentiment-analysis-with-data-augmentation-using-chatgpt#post2293643). Following that, some readers reached out, asking for a breakdown of fine-tuning a Chat-GPT model. In this article, I will guide you through fine-tuning your Chat-GPT model using your own data. First, I'll …

2
510
Member Avatar for usmanmalik57

In one of my research projects, I needed to extract text from video files and create a CSV file that included sentiments expressed in the text. Manual extraction was time-consuming and costly. So, I explored Automatic Speech Recognition (ASR) systems and discovered OpenAI [Whisper](https://openai.com/research/whisper), known for its high accuracy in …

Member Avatar for Reverend Jim
1
865
Member Avatar for Dani

A couple of years ago, I began using a Cloudflare worker to track my Googlebot traffic in Google Analytics. The initial code I used, and thought process to get there, is [here](https://www.daniweb.com/digital-media/digital-marketing/threads/522837/bot-traffic-in-google-analytics), but here's what my code looks like today: const analyticsId = 'UA-98289-3' addEventListener('fetch', event => { event.passThroughOnException() event.respondWith(handleRequest(event)) …

Member Avatar for Naor
0
135
Member Avatar for usmanmalik57

## Introduction ## In this tutorial, you will see how to convert the text in CSV file columns to other languages using the [DeepL API](https://www.deepl.com/translator) in the Python programing language. DeepL is one of the most popular and accurate text translation platforms. DeepL, as the name suggests, incorporates advanced deep …

1
979
Member Avatar for usmanmalik57

## Introduction ## I was working on a problem where I had to scrape tweets related to the T20 Cricket World Cup 2022, which is currently taking place in Australia. I wanted tweets containing location names (cities) and the keyword “T20”. In the response, I want the user names of …

0
142

The End.