15,167 Topics
| |
Anyone is building Uniswap V4 Hooks? Would love to learn more about what you are working on, looking for developers' pain points to contribute through Uniswap V4 Hook Developer's assistant chatbot! | |
## Introduction Text-to-speech (TTS) technology has revolutionized how we interact with devices, making accessing content through auditory means easier. TTS is vital in various applications such as virtual assistants, audiobooks, accessibility tools for the visually impaired, and language learning platforms. This tutorial will explore how to convert text-to-speech using Hugging … | |
I had some fun this weekend having a go at pdfkit with mixed results. Can anyone nominate a candidate for best PDF writing? | |
In this tutorial, you will see how to generate stunning AI-generated images from text inputs using state-of-the-art diffusion models from [Hugging Face](https://huggingface.co/). You'll learn about base diffusion models and how combining them with a refiner creates even more detailed, refined results. Diffusion models are powerful because they iteratively refine an … | |
Im trying to code a lem-in project. https://github.com/01-edu/public/tree/master/subjects/lem-in Im coding in go lang. But its okay in another language tho. I can convert it to go. Me and my friends cant even solve the algorithm neither. ChatGPT is stuck and in the web i found a 800 rows of code. … | |
In a previous article, I explained [how to extract tabular data from PDF image documents using Multimodal Google Gemini Pro](https://www.daniweb.com/programming/computer-science/tutorials/541449/pdf-image-table-extractor-web-app-with-google-gemini-pro-and-streamlit#post2296083). However, there are a couple of disadvantages with Google Gemini Pro. First, Google Gemini Pro is not free, and second, it needs complex prompt engineering to retrieve table, columns, and … | |
In this tutorial, you will see how to summarize YouTube video transcriptions using [Distil Whisper large V3](https://huggingface.co/distil-whisper/distil-large-v3) and [Mistral-7b-Instruct](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2). Both Distill Whisper Large V3 and Mistral-7B-Instruct models are open-source and free-to-use models. The Distil Whisper large V3 model is a faster and smaller variant of the [Whisper large V3 model](https://huggingface.co/openai/whisper-large-v3), … | |
The advent of large language models (LLM) has replaced complex scripts with natural language for automating various tasks. You can now use LLM to interact with your databases using natural language, which makes life easier for people who do not have sufficient SQL knowledge. In this article, you will learn … | |
In previous articles, I explained how to use natural language to interact with [PDF documents](https://www.daniweb.com/programming/computer-science/tutorials/541732/paris-olympics-ticket-information-chatbot-with-memory-using-langchain) and [SQL databases](https://www.daniweb.com/programming/computer-science/tutorials/541771/using-natural-language-to-query-sql-databases-with-python-langchain-module), using the Python [LangChain module](https://python.langchain.com/docs/get_started/introduction) and [OpenAI API](https://openai.com/blog/openai-api). In this article, you will learn how to use LangChain and OpenAI API to create a question-answering application that allows you to retrieve information … | |
In my previous article, I explained how I developed a simple chatbot using LangChain and Chat-GPT that can answer queries related to Paris Olympics ticket prices. However, one major drawback with that chatbot is that it can only generate a single response based on user queries. It can not answer … | |
Hi I downloaded Tkinter Designer, but there´s no exe there. I´ve used Thonny as a "compiler" for running scripts, and I used it to "make" a a simple program, but I don´t know how to save as an exe, and I don´t know how I would put these files into … | |
I was searching for Paris Olympics ticket prices for tennis games recently. The official website directs you to a [PDF document](https://tickets.paris2024.org/obj/media/FR-Paris2024/ticket-prices.pdf) containing ticket prices and venues for all the games. However, I found the PDF document to be very hard to navigate. To make things easier, I developed a chatbot … | |
Hi I want to make use of GitHub-programs, and so I downloaded a bunch of Python-tools. When I run this command in "Python 3.12", which looks like CMD to me, I get a syntax error. "To create a virtual environment, Python supplies a built in venv module which provides the … | |
Hi Title: A module that comes with Python, doesn´t register. This is the error message: %Run 'open cv - tot.py' Traceback (most recent call last): File "C:\Users\Audun Nilsen\open cv - tot.py", line 1, in <module> import cv2 # OpenCV for image processing ModuleNotFoundError: No module named 'cv2' I double-checked to … | |
On March 4, 2024, [Anthropic](https://www.anthropic.com/) launched the [Claude 3 family of large language models](https://www.anthropic.com/news/claude-3-family). Anthropic claimed that its Claude 3 Opus model outperforms GPT-4 on various benchmarks. Intrigued by Anthropic's claim, I performed a simple test to compare the performances of Claude 3 Opus, [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction), and [OpenAI's GPT-4](https://openai.com/research/gpt-4) … | |
In the rapidly evolving field of Natural Language Processing (NLP), open-source large language models (LLMs) are becoming increasingly popular as they are free to use. Among these, the [Mistral](https://docs.mistral.ai/models/) family of models stands out as a state-of-the-art model that is freely accessible to the public. Comparable in performance to the … | |
Using PySide's QAbstractTableModel allows you to easily customize a widget like QTableView and make it more generic in its application. Here we use it to present solvent data in tabular form and sort each column content by simply clicking on the header. You can feed it any data as long … | |
AI-powered code reviews are bringing about a revolution in the way developers collaborate, enabling more efficient, accurate, and thorough evaluations of code prior to its integration into the central project repository. This transformation is made possible by leveraging machine learning and artificial intelligence to automate and enhance different facets of … | |
In a previous article, I explained [how to fine-tune Google's Gemma model for text classification](https://www.daniweb.com/programming/computer-science/tutorials/541544/fine-tuning-google-gemma-model-for-text-classification-in-python). In this article, I will explain how you can improve performance of a pretrained large language model (LLM) using retrieval augmented generation (RAG) technique. So, let's begin without ado. ## What is Retrieval Augmented Generation … | |
On February 21, 2024, Google released [Gemma](https://ai.google.dev/gemma), a family of state-of-the-art open-source large language models (LLMs). As per initial results, its 7b (seven billion parameter) version is known to perform better than Meta's [Llama 2](https://llama.meta.com/), the previous state-of-the-art open-source LLM. As always, my first test with any new open-source LLM … | |
I am trying to extract three values from the td tags in an html downloaded file. <tr align="right"><td>236</td><td>Roy</td><td>Allyson</td> <tr align="right"><td>237</td><td>Marvin</td><td>Pamela</td> <tr align="right"><td>238</td><td>Micah</td><td>Kristine</td> <tr align="right"><td>239</td><td>Collin</td><td>Raquel</td> I am using the pattern match = re.findall(r'<td.?>([\d+])([.?])*<\/td>', file) The file is created with a read() statement. The output should look like (236, "Roy", "Allyson") (237, … | |
I am trying to extract three values from the td tags in an html downloaded file. <tr align="right"><td>236</td><td>Roy</td><td>Allyson</td> <tr align="right"><td>237</td><td>Marvin</td><td>Pamela</td> <tr align="right"><td>238</td><td>Micah</td><td>Kristine</td> <tr align="right"><td>239</td><td>Collin</td><td>Raquel</td> I am using the pattern match = re.findall(r'<td.*?>([\d+])([.*?])*<\/td>', file) The file is created with a read() statement. The output should look like (236, "Roy", "Allyson") (237, … | |
I am working on an exercise from Google's Python class dealing with popular baby names. I have the program running properly when using only one filename, but when I try to use the wildcard to get all files with baby####.html files I get differing errors every time I run the … | |
In my previous article, I explained [how to convert PDF image to CSV using Multimodal Google Gemini Pro](https://www.daniweb.com/programming/computer-science/tutorials/541365/converting-pdf-image-to-csv-using-multimodal-google-gemini-pro). To do so, I wrote a Python script that passes text command to [Google Gemino Pro](https://blog.google/technology/ai/google-gemini-ai/) for extracting tables from PDF images and storing them in a CSV file. In this article, … | |
I am building a crawler+parser in Python. It has to be run for, like 20 hours. How can I modify the code such that the code execution pauses (before next urllib2.urlopen) when the internet is disconnected, and AUTOMATICALLY resumes with the same variable values, when the internet connection is back … | |
Integrating language models like ChatGPT into third-party applications has become increasingly popular due to their ability to comprehend and generate human-like text. However, it's crucial to acknowledge the limitations of ChatGPT, such as its knowledge cut-off date in September 2021 and its inability to access external sources like Wikipedia or … | |
I'll admit that I have an opinion about this. Is the point of professional coding to write creative software or to write software that adheres closely to standards? | |
In this article, you will learn to use [Google Gemini Pro](https://blog.google/technology/ai/google-gemini-ai/), a state-of-the-art multimodal generative model, to extract information from PDF and convert it to CSV files. You will use a simple text prompt to tell Google Gemini Pro about the information you want to extract. This is a valuable … | |
In this article, you will learn how to track faces within a video using the Python DeepFace library. Additionally, you'll discover how to include portions of the video background in face tracking by implementing custom methods that utilize the DeepFace library's `extract_faces()` method for face extraction. I explained how to … | |
In this article, we will compare two state-of-the-art large language models for zero-shot text classification: [Google Gemini Pro](https://deepmind.google/technologies/gemini/#introduction) and [OpenAI GPT-4](https://openai.com/research/gpt-4). Zero-shot text classification is a task where a model is trained on a set of labeled examples but can then classify new examples from previously unseen classes. This is … |
The End.