2023-08-23

Python Summarization Algorithms Quick Guide

MediaWiki markup is a lightweight markup language that I use to format text on this website. This journal entry explores the possibility of using the same format for creating LibreOffice ODT/PDF documents.

I asked Bing LLM to write a guide about Python summarization algorithms with the following prompt:

   You are a senior natural language processing application developer and you are tasked to write a short guide about python summarization algorithms using mediawiki markups.
   Write extensively about NPL algorithms such as: Luhn, Edmundson, LSA, Text-Rank, Lex-Rank, Sum-Basic, KL. 
   Provide also some use cases where a particular algorithm outperforms the rest.
   Please provide me with the mediawiki raw text. Do not write conclusions.

Bing's Copilot answer has been copied to a text file named python_summarization_algorithms_QG.wiki and edited extensively. Then, I opened Windiction to generate the correspondent LibreOffice (ODT) document, as shown in the figure below.

LibreOffice can then export the document in PDF format. Python Summarization Algorithms: A Quick Guide can be downloaded here.

On Windows, the opposite is also possible, a LibreOffice document can be exported as mediawiki formatted text file with:

   File → Export → Save as type: Mediawiki.

I usually prefer the suffix .wiki for mediawiki formatted text files, instead LibreOffice by default uses .txt. If you need to save the file with a custom extension, you have to uncheck Automatic file name extension before saving.