Next.js Metadata Scraper

A Simple tool to parse website metadata on server-side using puppeteer.

Requirements

To perform metadata scraping in a serverless environment (e.g., Vercel), two key libraries are used:

puppeteer-core: The core API of Puppeteer, distributed without a bundled Chromium binary. This allows us to manage and deploy our own Chromium binary, which is essential in constrained environments like serverless functions.
@sparticuz/chromium-min: A lightweight Chromium binary optimized for serverless environments. It is a smaller fork of chrome-aws-lambda, designed specifically to comply with size limitations (e.g., the 50 MB unzipped limit imposed by platforms like Vercel).

Why `@sparticuz/chromium-min`?

Some serverless platforms, such as Vercel, restrict the maximum size of files within the deployment package. @sparticuz/chromium-min provides a trimmed-down Chromium binary that can be hosted externally (e.g., via a CDN) to keep deployment packages within acceptable limits.

In this setup, the Chromium binary is hosted on GitHub. While its not ideal, it serves as a free and functional CDN alternative for development and low-traffic use cases.

Note: Ensure that the version of puppeteer-core used matches the version of Chromium provided by @sparticuz/chromium-min to avoid compatibility issues.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.husky		.husky
.vscode		.vscode
public		public
src		src
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
.vercelignore		.vercelignore
bun.lock		bun.lock
components.json		components.json
eslint.config.js		eslint.config.js
license		license
next.config.ts		next.config.ts
package.json		package.json
postcss.config.js		postcss.config.js
readme.md		readme.md
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Next.js Metadata Scraper

Requirements

Why `@sparticuz/chromium-min`?

About

Uh oh!

Uh oh!

Languages

License

oviirup/next-metadata-scraper

Folders and files

Latest commit

History

Repository files navigation

Next.js Metadata Scraper

Requirements

Why @sparticuz/chromium-min?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages

Why `@sparticuz/chromium-min`?