'artificial-intelligence-llm'' + 'dataset' Forum Topics

35 Topics

	Topic Title
	Evaluating OpenAI GPT 4.1 for Text Summarization and Classification Tasks 2 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On April 14, 2025, OpenAI released [GPT-4.1](https://openai.com/index/gpt-4-1/) — a model touted as the new state-of-the-art, outperforming GPT-4o on all major benchmarks. As always, I like to evaluate new LLMs on simple tasks like text classification and summarization to see how they compare with current leading models. In this article, I … Computer Science api artificial-intelligence-llm daniweb-api dataset finance github google-api mathematics python 2 0 91
	Question/Answering over SQL Data Using LangGraph Framework 6 Months Ago 2 Months Ago Share on Facebook Share on Twitter Share on LinkedIn This tutorial demonstrates how to build an AI agent that queries SQLite databases using natural language. You will see how to leverage the [LangGraph framework](https://www.langchain.com/langgraph) and the [OpenAI GPT-4o](https://openai.com/index/gpt-4/) model to retrieve natural language answers from an SQLite database, given a natural language query. So, let's begin without ado. ## … Computer Science artificial-intelligence-llm data-structure dataset file-stream python sql sqlite 2 1 705
	DeepSeek R1 vs Llama 3.1-405b for Text Classification and Summarization 3 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In a [previous article](https://www.daniweb.com/programming/computer-science/tutorials/543028/text-classification-and-summarization-with-deepseek-r1-distill-llama-70b), I presented a comparison of [DeepSeek-R1-Distill-Llama-70b](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) with the [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) for text classification and summarization. Both these models are distilled versions of the original DeepSeek R1 model. Recently, I wanted to try the original version of the DeepSeek R1 model using the DeepSeek API. However, I was … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset github google-api python 1 0 164
	Text Classification and Summarization with DeepSeek R1 Distill Llama 70B 4 Months Ago 3 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In the [last article](https://www.daniweb.com/programming/computer-science/tutorials/542973/benchmarking-deepseek-r1-for-text-classification-and-summarization#post2300447), I explained how you can use the [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) model for text classification and summarization problems. In this article, we will use the [DeepSeek-R1-Distill-Llama-70b](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) for the same tasks. Following results from the [DeepSeek-AI's official paper](https://arxiv.org/pdf/2501.12948) show that `DeepSeek-R1-Distill-Llama-70b` outperform the other distilled models on 4 out of … Computer Science api artificial-intelligence-llm daniweb-api dataset github google-api pdf python 0 3 868
	Fine-tuning OpenAI Vision Models for Visual Question-Answering 8 Months Ago 4 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous article, I explained how to fine-tune [OpenAI GPT-4o model for natural language processing tasks](https://www.daniweb.com/programming/computer-science/tutorials/542333/how-to-fine-tune-the-openai-gpt-4o-model-the-wait-is-finally-over). In OpenAI DevDay, held on October 1, 2024, OpenAI announced that users can now fine-tune OpenAI vision and multimodal models such as GPT-4o and GPT-4o mini. The best part is that fine-tuning vision … Computer Science api artificial-intelligence-llm computer-vision daniweb-api data-science data-structure dataset github json os-x printer python 2 1 312
	Benchmarking DeepSeek R1 for Text Classification and Summarization 5 Months Ago Share on Facebook Share on Twitter Share on LinkedIn DeepSeek-R1 is a groundbreaking family of reinforcement learning (RL)-driven AI models developed by the Chinese AI firm [DeepSeek](https://www.deepseek.com/). It is designed to rival industry leaders like OpenAI and Google in complex decision-making and optimization problems. In this article, we will benchmark the DeepSeek R1 model for text classification and summarization … Computer Science api artificial-intelligence-llm daniweb-api dataset github google-api python 1 0 1K
	Qwen 2.5-72b Vs. Llama 3.3-70b for Text Classification and Summarization 6 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Open-source LLMs are gaining significant traction due to their ability to match the performance of advanced proprietary LLMs. These models are free to use and allow users to modify their source code or fine-tune them on their own systems, making them highly versatile for various applications. Alibaba's [Qwen](https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1) and Meta's … Computer Science api artificial-intelligence-llm daniweb-api dataset github open-source python 4 0 838
	Evaluating GPT-4o November Model for Text Classification and Summarization 7 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On November 20, 2024, OpenAI updated its GPT-4o model, claiming it is more creative and accurate on several benchmarks. In this article, I compare the GPT-4o November update with the previous version (August update) for text summarization and classification tasks. By the end of this article, you will see whether … Computer Science api artificial-intelligence-llm daniweb-api dataset finance github google-api mathematics python 2 0 211
	Fine-tuning OpenAI GPT-4o for Multi-label Text Classification 7 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous article, I presented a [comparison of GPT-4o and Claude 3.5 Sonnet for multi-label text classification](https://www.daniweb.com/programming/computer-science/tutorials/542629/openai-gpt-4o-vs-claude-3-5-sonnet-for-multi-label-text-classification). The accuracies achieved by both models were relatively low. Fine-tuning is one solution to overcome the low performance of large-language models. With fine-tuning, you can incorporate custom domain knowledge into an LLM's … Computer Science api artificial-intelligence-llm daniweb-api data-science data-structure dataset finance json mathematics python 2 0 249
	OpenAI GPT-4o vs Claude 3.5 Sonnet for Multi-label Text Classification 7 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In one of my previous articles, you saw a [comparison of GPT-4o vs. Claude 3.5 sonnet for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542132/comparing-gpt-4o-vs-claude-3-5-sonnet-for-zero-shot-text-classification). In that article; we performed multi-class text classification where input tweets belonged to one of the three categories. In this article, we will go a step further and perform zero-shot … Computer Science api artificial-intelligence-llm daniweb-api dataset finance google-api mathematics python 2 0 190
	Qwen vs Llama - Who is winning the Open Source LLM Race 8 Months Ago 8 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Open-source LLMS, owing to their comparable performance with advanced proprietary LLMs, have been gaining immense popularity lately. Open-source LLMs are free to use, and you can easily modify their source code or fine-tune them on your systems. [Alibaba's Qwen](https://www.alibabacloud.com/en/solutions/generative-ai/qwen?_p_lc=1) and [Meta's Llama](https://ai.meta.com/blog/meta-llama-3-1/) series of models are two major players in … Computer Science api artificial-intelligence-llm daniweb-api dataset github open-source python 2 1 3K
	Text Classification and Summarization with Qwen 2.5 Model From Hugging Face 9 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On September 19, 2024, [Alibaba released the Qwen 2.5 series of models](https://qwenlm.github.io/blog/qwen2.5/). The Qwen 2.5-72B base and instruct models outperformed larger state-of-the-art models like Llama 3.1-405B on multiple benchmarks. It is safe to assume that Qwen 2.5-72B is a state-of-the-art open-source large language model. This article will show you how … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback dataset github google-api open-source python 3 0 2K
	Extracting Structured Outputs from LLMs in LangChain 9 Months Ago Share on Facebook Share on Twitter Share on LinkedIn Large language models (LLMS) are trained to predict the next token (set of characters) following an input sequence of tokens. This makes LLMs suitable for unstructured textual responses. However, we often need to extract structured information from unstructured text. With the Python [LangChain](https://www.langchain.com/) module, you can extract structured information in … Computer Science algorithm artificial-intelligence-llm daniweb-feedback data-structure dataset engineering github python 2 0 218
	How to Fine-tune the OpenAI GPT-4o Model - The Wait is Finally Over 10 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On August 20, 2024, [OpenAI enabled GPT-4o fine-tuning](https://openai.com/index/gpt-4o-fine-tuning/) in the OpenAI playground and the OpenAI API. The much-awaited feature is free for fine-tuning 1 million daily tokens until September 23, 2024. In this article, I will show you how to fine-tune the OpenAI GPT-4o model for text classification and summarization … Computer Science api artificial-intelligence-llm daniweb-api data-science data-structure dataset github json python 2 0 1K
	GPT-4o Snapshot vs Meta Llama 3.1 70b for Zero-Shot Text Summarization 10 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In a previous article, I compared [GPT-4o mini vs. GPT-4o and GPT-3.5 Turbo for zero-shot text summarization](https://www.daniweb.com/programming/computer-science/tutorials/542208/gpt-4o-mini-vs-gpt-4o-vs-gpt-3-5-turbo-for-text-summarization). The results showed that the GPT-4o mini achieves almost similar performance for zero-shot text classification at a much-reduced price compared to the other models. I will compare Meta Llama 3.1 70b with OpenAI … Computer Science api artificial-intelligence-llm daniweb-api dataset github open-source python 2 0 1K
	Comparison of Fine-tuning GPT-4o mini vs GPT-3.5 for Text Classification 10 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous articles, I presented a [comparison of OpenAI GPT-4o mini model with GPT-4o and GPT-3.5 turbo models for zero-shot text classification](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o). The results showed that GPT-4o mini, while significantly cheaper than its counterparts, achieves comparable performance. On 8 August 2024, OpenAI enabled GPT-4o mini fine-tuning for developers across … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback data-science data-structure dataset json python 1 0 314
	GPT-4o mini vs. GPT-4o vs GPT-3.5 Turbo for Text Summarization 11 Months Ago Share on Facebook Share on Twitter Share on LinkedIn In my previous [article on GPT-4o mini](https://www.daniweb.com/programming/computer-science/tutorials/542182/gpt-4o-mini-a-cheaper-and-faster-alternative-to-gpt-4o), I compared the performance of GPT-4o mini against GPT-3.5 Turbo and GPT-4o for zero-shot text classification. We saw that GPT-4o mini, being 36% times cheaper, achieves only 2% less accuracy than GPT-4o. Furthermore, while being 1/3 of the price, the GPT-4o mini significantly … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset github python 1 0 299
	GPT-4o mini - A Cheaper and Faster Alternative to GPT-4o 11 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On July 18th, 2024, [OpenAI released GPT-4o mini](https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/), their most cost-efficient small model. GPT-4o mini is around 60% cheaper than GPT-3.5 Turbo and around 97% cheaper than GPT-4o. As per OpenAI, GPT-4o mini outperforms GPT-3.5 Turbo on almost all benchmarks while being cheaper. In this article, we will compare the … Computer Science api artificial-intelligence-llm daniweb-api dataset python 3 0 217
	Comparing GPT-4o vs Claude 3.5 Sonnet for Zero Shot Text Classification 11 Months Ago Share on Facebook Share on Twitter Share on LinkedIn On June 20, 2024, Anthropic released the [Claude 3.5 sonnet](https://www.anthropic.com/news/claude-3-5-sonnet) large language model. Claude claims it to be the state-of-the-art model for many natural language processing tasks, surpassing the [OpenAI GPT-4o model](https://openai.com/index/hello-gpt-4o/). My first test for comparing two large language models is their zero-shot text classification ability. In this article, … Computer Science api artificial-intelligence-llm daniweb-api dataset python 3 0 243
	Comparing Fine-tuned and Default GPT-3.5 Turbo for Text Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn # Comparison Between Fine-tuned and Default GPT-3 Turbo for Text Classification In one of my previous articles, I showed you how to perform [zero-shot text classification using OpenAI GPT-4o and Meta Llama 3 models](https://www.daniweb.com/programming/computer-science/tutorials/542001/openai-gpt-4o-vs-meta-llama-3-for-zero-shot-text-classifiation). I used the default models for predicting sentiments of airline tweets. The default models perform substantially … Computer Science api artificial-intelligence-llm daniweb-api data-science data-structure dataset json python 2 0 662
	OpenAI GPT-4o vs Meta Llama 3 for Zero Shot Text Classifiation 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On April 18, 2024, Meta AI released [Llama 3](https://ai.meta.com/blog/meta-llama-3/), which they claimed to be the most capable openly available LLM to date. Concurrently, OpenAI announced [GPT-4o (omni)](https://community.openai.com/t/announcing-gpt-4o-in-the-api/744700) on May 13, 2024, which is touted as the state-of-the-art proprietary model for various NLP benchmarks. As a guy who loves to compare … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset google-api open-source python 2 0 263
	Claude 3 Opus Vs. Google Gemini Vs. GPT-4 for Zero-Shot Text Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On March 4, 2024, [Anthropic](https://www.anthropic.com/) launched the [Claude 3 family of large language models](https://www.anthropic.com/news/claude-3-family). Anthropic claimed that its Claude 3 Opus model outperforms GPT-4 on various benchmarks. Intrigued by Anthropic's claim, I performed a simple test to compare the performances of Claude 3 Opus, [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction), and [OpenAI's GPT-4](https://openai.com/research/gpt-4) … Computer Science api artificial-intelligence-llm daniweb-api dataset file-stream google google-api json python 2 0 164
	Retrieval Augmented Generation (RAG) with Google Gemma From HuggingFace 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In a previous article, I explained [how to fine-tune Google's Gemma model for text classification](https://www.daniweb.com/programming/computer-science/tutorials/541544/fine-tuning-google-gemma-model-for-text-classification-in-python). In this article, I will explain how you can improve performance of a pretrained large language model (LLM) using retrieval augmented generation (RAG) technique. So, let's begin without ado. ## What is Retrieval Augmented Generation … Computer Science api artificial-intelligence-llm daniweb-api data-science dataset github google google-api json open-source python 2 0 1K
	The Rise of AI Scams: Deciphering Reality in a World of Deepfakes 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Discover the world of AI scams and find out how you can shield yourself against the cunning deceptions of deepfakes. ![deepfakes-deep-implications.jpg](https://static.daniweb.com/attachments/4/782a49e1fa4e86bd0bedf3957bec4df9.jpg) In an incident that underscores the alarming capabilities of artificial intelligence in the realm of fraud, a company in Hong Kong was [defrauded of $25 million](https://www.businessinsider.com/deepfake-coworkers-video-call-company-loses-millions-employee-ai-2024-2) earlier this year. … Community Center abuse artificial-intelligence-llm dataset 2 0 915
	Fine Tuning Google Gemma Model for Text Classification in Python 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn On February 21, 2024, Google released [Gemma](https://ai.google.dev/gemma), a family of state-of-the-art open-source large language models (LLMs). As per initial results, its 7b (seven billion parameter) version is known to perform better than Meta's [Llama 2](https://llama.meta.com/), the previous state-of-the-art open-source LLM. As always, my first test with any new open-source LLM … Computer Science artificial-intelligence-llm dataset open-source python 2 0 1K
	Using ChatGPT to Interact with Third-Party Applications in Python 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Integrating language models like ChatGPT into third-party applications has become increasingly popular due to their ability to comprehend and generate human-like text. However, it's crucial to acknowledge the limitations of ChatGPT, such as its knowledge cut-off date in September 2021 and its inability to access external sources like Wikipedia or … Computer Science api artificial-intelligence-llm daniweb-api dataset python 3 2 1K
	Use of the Word ‘Tapestry’ in Web News More Than Doubled Last Year 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Tracing AI-generated content in online news articles with corpus linguistics ![tapestry-header.JPG](https://static.daniweb.com/attachments/4/c8a5b32abaf78b39bdcb75f328580e4a.JPG) A query in the 'News on the Web' Corpus reveals that the use of the word 'tapestry' in online articles has more than doubled last year – from 3,085 instances in 2022 to 7,891 instances in 2023 “Today, we … Community Center artificial-intelligence-llm dataset social-media 0 0 359
	Comparing Google Gemini Pro with OpenAI GPT-4 for Zero-Shot Classification 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In this article, we will compare two state-of-the-art large language models for zero-shot text classification: [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction) and [OpenAI GPT-4](https://openai.com/research/gpt-4). Zero-shot text classification is a task where a model is trained on a set of labeled examples but can then classify new examples from previously unseen classes. This is … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback dataset engineering file-stream google google-api json python 1 0 181
	Multilabel Text Classification using Hugging Face Models for TensorFlow 2 Years Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn ## Introduction ## This tutorial explains how to perform multiple-label text classification using the [Hugging Face](https://huggingface.co/) transformers library. Hugging Face library implements advanced transformer architectures, proven to be state-of-the-art for various natural language processing tasks, including text classification. Hugging Face library provides trainable transformer models in three flavors: 1. Via … Computer Science api artificial-intelligence-llm daniweb-api dataset machine-learning python tensorflow 1 2 1K
	Sentiment Analysis with Data Augmentation Using ChatGPT 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Sentiment analysis, a subfield of Natural Language Processing (NLP), aims to discern and classify the underlying sentiment or emotion expressed in textual data. Whether it is understanding customers' opinions about a product, analyzing social media posts, or gauging public sentiment towards a political event, sentiment analysis plays a vital role … Computer Science api artificial-intelligence-llm dataset machine-learning python social-media 6 6 2K
	Facial Emotion Detection with Vision Transformers and DeepFace Library 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Facial emotion detection, as the name suggests, involves detecting emotions from faces in images or videos. Recently, I was working on a facial emotion detection task and came across the DeepFace library that implements various state-of-the-art facial emotion detection models. However, in my experience, the performance of the DeepFace library … Computer Science artificial-intelligence-llm computer-vision daniweb-feedback data-science data-structure dataset os-x programming-construct python 3 1 508
	Video Classification using Hugging Face Transformers in PyTorch 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In this tutorial, you will learn to fine-tune a [Hugging Face Transformers model](https://huggingface.co/docs/transformers/index) for video classification in PyTorch. The Hugging Face documentation provides an example of performing video classification using the Hugging Face Trainer with one of Hugging Face's built-in datasets. However, the process of fine-tuning a video transformer on … Computer Science artificial-intelligence-llm audio computer-vision daniweb-feedback data-science dataset os-x python video 2 0 438
	Fine Tuning Text Classification Models with Chat-GPT 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In a previous article, I showed you [how to analyze sentiments using Chat-GPT and data augmentation techniques](https://www.daniweb.com/programming/computer-science/tutorials/540502/sentiment-analysis-with-data-augmentation-using-chatgpt#post2293643). Following that, some readers reached out, asking for a breakdown of fine-tuning a Chat-GPT model. In this article, I will guide you through fine-tuning your Chat-GPT model using your own data. First, I'll … Computer Science api artificial-intelligence-llm daniweb-api daniweb-feedback data-science dataset json programming-construct python 2 0 510
	Enhancing Language Models: Choosing Between RAG and Fine-Tuning 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn In my recent journey of developing various AI solutions powered by Language Models (LLMs), a significant question has emerged: Should we harness the capabilities of Retrieval Augmented Generation (RAG), or should we opt for the path of custom fine-tuning? This decision can profoundly impact the performance and adaptability of our … Computer Science artificial-intelligence-llm dataset engineering operating-system 4 1 851
	Text Classification Using Data Annotation with ChatGPT 1 Year Ago 1 Year Ago Share on Facebook Share on Twitter Share on LinkedIn Data annotation for text classification is time-consuming and expensive. In the case of smaller training datasets, pre-trained ChatGPT models might achieve higher classification accuracy on test sets than training classifiers from scratch or fine-tuning existing models. Additionally, ChatGPT can aid in annotating data for fine-tuning text classification models. In this … Computer Science api artificial-intelligence-llm dataset machine-learning python 3 1 728

[artificial-intelligence-llm] [dataset] Forum Topics

35 Topics