Rescuing 216 Pages from the GeoCities Era: How I Built an HTML-to-Blogger Tool
This is a submission for the 2026 WeCoded Challenge : Echoes of Experience I want to transfer the HTML format to Google Blogger. Introduction The outdoor club's website (which was called a homepage back then) was archived when GeoCities shut down. The last update was in April 2014. If it had become completely inaccessible, I could have accepted it, but it's just too sad that it's become an archive. Okay, I'll do this! I declared to the old site administrator (a friend) "It's okay, let me do it." For now, I'll just go with Vercel's Hobby for the new site. Then I checked the old site's HTML . What? 216 pages. Doing this by hand is impossible! The old site was beautifully made by my friend while he was learning HTML. I tried cleaning it with AI, but my instructions were bad and it just didn't
This is a submission for the 2026 WeCoded Challenge: Echoes of Experience
I want to transfer the HTML format to Google Blogger.
Introduction
The outdoor club's website (which was called a homepage back then) was archived when GeoCities shut down. The last update was in April 2014. If it had become completely inaccessible, I could have accepted it, but it's just too sad that it's become an archive.
Okay, I'll do this! I declared to the old site administrator (a friend) "It's okay, let me do it." For now, I'll just go with Vercel's Hobby for the new site. Then I checked the old site's HTML . What? 216 pages. Doing this by hand is impossible! The old site was beautifully made by my friend while he was learning HTML. I tried cleaning it with AI, but my instructions were bad and it just didn't work. With the help of another friend, I aimed to create a stylish new site using React and other things in VS Code. While doing all that, Windows started to become unstable and eventually crashed. The data was still there, so I reinstalled Windows and DevStudio. I lost the product ID. It 's completely over . So I installed Ubuntu, installed VS Code, and decided to develop with Python. (I've never used it before) While the installation was in progress, I also changed the new site's operational policy.
-
Members can post
-
You can also upload videos. With two additional requirements, the site has also been changed to Google Blogger ♫
Getting Started with Python
I started with 'Hello python' and created a CLI for tag cleaning of HTML files. Next, I tried posting to Blogger using the API, and I'm starting to get the hang of it.
Software Requirements Definition
Ruthless decision
-
HTML history management
-
Cleaning HTML
I want to manage the design entirely in Blogger, so I'm removing font and color settings.
- Keyword extraction and registration
Blogger allows you to set keywords for each post, so I want to use words that match the text in the HTML as keywords (e.g., kayaking, Bouldering).
- Location extraction and registration
This is also a feature that can be added to each Blogger post (Akihabara, Mt. Fuji, etc.).
-
Remove Exif data from images and add watermarks.
-
Upload articles and images
-
GUI
implementation
Just as you all expected, I'm totally hooked.
-
I was struggling to get used to Ubuntu's IME, so I installed Fcitx5 and managed to work around the problem.
-
I discovered that the Blogger API doesn't have image-related features → I established and standardized a somewhat easier manual + automatic linking procedure for images.
-
I couldn't get used to Python nesting → I ended up checking the nesting with my finger.
-
Library version issue occurred → Misunderstood Python virtual environment (venv) → Fumbled around trying to create an environment with virtualBox → Realized it was in the terminal and almost went blank.
-
I accidentally sent about 200 Blogger API requests without using a timer → I received an account lock warning email → I sent an apology email to Google and my account was restored the next day.
-
I installed Wine and Excel (MS-Office) to aggregate regional data → got stuck in cell input mode and switched to LibreOffice → discovered nominatim geocoding and solved the problem.
-
Trying to run a virtual Windows using insurance → Too slow. Insufficient HDD space. Ultimately, the license is too expensive, so I gave up.
-
I asked the AI to do various things, but it didn't understand the language specifications and approved them haphazardly → I ended up having to rewrite almost everything.
-
To create a GUI with Tkinter, you write the code → The base is written by AI, and then you just stare at the source code.
-
Tracing Python arguments → Why isn't copy running? → Wait, even queue instances are referenced when you put them?! And so on, intense debugging.
-
I can't do it well, so I end up pressing the keys too hard → My family tells me it's too loud, and I almost cry.
Reaffirming established theories
When it comes to technology, you either invest time, money, or wisdom. For me, someone who lacks wisdom, it seems it was time. However, I think I gained a lot. You can't know the taste of pudding until you try it. Ubuntu is wonderful. I dabbled in it a little for work, but I didn't know about the community at that time. Ultimately, what's important is human connection.
I would appreciate your help with debugging.
I've tried to code it so that it should work on Windows and Mac as long as you're using a Python environment. It requires Google API authentication credentials, which is a bit of a hassle, but I'd appreciate it if you could debug it. Even just a review would be encouraging. Please feel free to point out any issues with the Python coding!
Introduction:
Video:
code:
HTML to Blogger Ver0.98
A desktop application that automatically processes local HTML files and images and posts them to Blogger. It performs HTML cleaning, adds watermarks to images, assigns keywords and location information, and uploads to Blogger.
Qiita Article
Explanation article for this tool (Qiita)
Introduction Video
Main Features
-
HTML Cleaning: Removes unnecessary tags and normalizes formatting for posts.
-
Image Processing: Removes EXIF data and adds watermarks.
-
Metadata Assignment: Automatically adds keywords (search tags) and location information (georss tags) by analyzing the article content.
-
Blogger Upload: Uploads image links and articles as drafts using the Blogger API.
-
GUI Operation: User-friendly GUI with progress visualization and error recovery features.
Processing Flow
Processing is executed in the following order:
-
import_file.py: Imports files from the source folder to the working folder.
-
serial_file.py: Converts file names into sequential numbering format.
-
clean_html.py: Cleans up HTML…
DEV Community
https://dev.to/amekusa03/rescuing-216-pages-from-the-geocities-era-how-i-built-an-html-to-blogger-tool-2ajkSign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
versionupdateproduct
OSDK and Mobile Applications: Building with the Embedded Ontology
The Embedded Ontology lets you build powerful enterprise applications for teams that operate at the edge. Run the full, context-rich Ontology locally on the device. The power of Palantir, at the point of action. Traditional enterprise platforms are powerful. They aggregate data, enforce governance, orchestrate workflows, and provide a single pane of glass for an organization to run their business. But that glass is mounted in a climate-controlled office, connected to reliable Wi-Fi, and viewed on a large monitor. Now ride along with a field service technician. They’re driving between sites, inspecting equipment in mechanical rooms with no cell signal, documenting findings on a tablet while standing next to a roaring HVAC unit, among myriad other tasks. The reality at the edge is fundamenta

Frontend Engineering at Palantir: Building a Backend-less Cross-Application API
About this Series Frontend engineering at Palantir goes far beyond building standard web apps. Our engineers design interfaces for mission-critical decision-making, build operational applications that translate insight to action, and create systems that handle massive datasets — thinking not just about what the user needs, but what they need when the network is unreliable, the stakes are high, and the margin for error is zero. This series pulls back the curtain on what that work really looks like: the technical problems we solve, the impact we have, and the approaches we take. Whether you’re just curious or exploring opportunities to join us, these posts offer an authentic look at life on our Frontend teams. In this blog post, a frontend engineer based in CA shares an overview of several f

Logitech’s haptics-enhanced MX Master 4 mouse is on sale for under $100
If you’re shopping for a wireless mouse that’ll help you multitask more easily, Logitech’s MX Master 4 is easily one of the best and most comfortable options available. It’s rarely discounted, but the black version is currently down to $99.99 ($20 off) at Newegg with code TRWF233. While it offers similarly quiet clicks and long [ ]
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Products

Frontend Engineering at Palantir: Building a Backend-less Cross-Application API
About this Series Frontend engineering at Palantir goes far beyond building standard web apps. Our engineers design interfaces for mission-critical decision-making, build operational applications that translate insight to action, and create systems that handle massive datasets — thinking not just about what the user needs, but what they need when the network is unreliable, the stakes are high, and the margin for error is zero. This series pulls back the curtain on what that work really looks like: the technical problems we solve, the impact we have, and the approaches we take. Whether you’re just curious or exploring opportunities to join us, these posts offer an authentic look at life on our Frontend teams. In this blog post, a frontend engineer based in CA shares an overview of several f

Beware The Magical Two-Person, $1 Billion AI-Driven Startup
In early 2024, OpenAI CEO Sam Altman predicted there would be a “one-person billion-dollar company, which would’ve been unimaginable without AI, and now it will happen.” Several media outlets recently concluded that the prediction came true (albeit with two employees). But the story looks less promising upon deeper inspection. Retain Healthy Skepticism When Faced With [ ]




Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!