lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Comparative Analysis of Language Models: hallucinations in ChatGPT: Prompt Study
Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM).
Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM).
2023 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesisAlternative title
Jämförande analys av språkmodeller: hallucinationer i ChatGPT : Prompt Studie (Swedish)
Abstract [en]

This thesis looks at the percentage of hallucinations in two large language models (LLM), ChatGPT 3.5 and ChatGPT 4 output for a set of prompts. This work was motivated by two factors: the release of ChatGPT 4 and its parent company OpenAI, claiming it to be much more potent than its predecessor ChatGPT 3.5, which raised questions regarding the capabilities of the LLM. Furthermore, the other factor is that ChatGPT 3.5 showcased hallucinations (creating material that is factually wrong, deceptive, or untrue.) in response to different prompts, as shown by other studies. The intended audience was members of the computer science community, such as researchers, software developers, and policymakers. The aim was to highlight large language models' potential capabilities and provide insights into their dependability.

This study used a quasi-experimental study design and a systematic literature review.Our hypothesis predicted that the percentage of hallucinations (creating factually wrong, deceptive, or untrue material) would be more prevalent in ChatGPT 3.5 compared to ChatGPT 4. We based our prediction on the fact that OpenAI trained ChatGPT 4 on more material than ChatGPT 3.5. We experimented on both LLMS, and our findings supported The hypothesis. Furthermore, we looked into the literature and found studies that also agree that ChatGPT 4 is better than ChatGPT 3.5.

The research concluded with suggestions for future work, like using extensive datasets and comparing the performance of different models, not only ChatGPT 3.5 and ChatGPT 4.

Place, publisher, year, edition, pages
2023. , p. 109
Keywords [en]
Large Language Models, Hallucinations, ChatGPT
Keywords [sv]
Stora Språkmodeller, Hallucinationer, ChatGPT
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:lnu:diva-121267OAI: oai:DiVA.org:lnu-121267DiVA, id: diva2:1764165
Subject / course
Computer Science
Educational program
Software Technology Programme, 180 credits
Supervisors
Examiners
Available from: 2023-06-13 Created: 2023-06-08 Last updated: 2023-06-13Bibliographically approved

Open Access in DiVA

Degree project(985 kB)2445 downloads
File information
File name FULLTEXT01.pdfFile size 985 kBChecksum SHA-512
3fb7a7ea63a4fce0e7747a41db3aab05429617231b487948fbcb2fcdd9130baec8216793fbc195f7520d64aabd57062c0f7a3fb192dca728a4035f154cf44b82
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Hanna, EliasLevic, Alija
By organisation
Department of computer science and media technology (CM)
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 2448 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 10074 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf