URL Metadata API Endpoint
December 2021
This is a simple one: I wanted an endpoint that would simply
display a JSON
object of some top-level
metadata (eg. title, description, open graph data, etc.)
associated with a URL.
Below are some URLs for to to see how it works:
Note: The host m.imnosy.com
is simply a 301
redirect to the endpoint https://aws-api.416.io/utils/v1/url/metadata
What data will it retrieve?
Here's a breakdown of the open graph / metadata the endpoint can
retrieve:
title
,
canonical
,
description
,
article:published_time
,
article:modified_time
,
og:description
,
og:image
,
og:image:alt
,
og:image:width
,
og:image:height
,
og:image:secure_url
,
og:image:type
,
og:locale
,
og:site_name
,
og:title
,
og:type
,
og:updated_time
,
og:url
,
og:video
,
og:video:duration
,
og:video:release_date
,
generator
,
twitter:card
,
twitter:creator
,
twitter:domain
,
twitter:description
,
twitter:image
,
twitter:image:alt
,
twitter:site
,
twitter:title
,
fb:app_id
,
fb:pages
Why did I build this?
A Chrome Extension I'm working on
(Bookee)
makes use of the Open Graph images a URL might refer to. It does
so by showing a thumbnail associated with a URL. Ideally, it
shows the Open Graph image associated with that URL (otherwise a
screenshot of that URL).
However in order to do so, I need to first load the metadata
associated with that URL.
This is being used by my other experiment:
Open Graph Image Redirect.
How was it built?
I again used AWS to facilitate the backend of this tool. Three
different services are used:
Cheerio is used for
server-side HTML parsing.
Feedback
If you'd like to use the above endpoints in a production
environment, or have any feedback, you can reach me at:
onassar@gmail.com