How to save a specified element using singlefile.getPageData? #36

tangxiaoqi-tangxiao · 2025-02-04T10:30:21Z

tangxiaoqi-tangxiao
Feb 4, 2025

chrome.runtime.onMessage.addListener(async message => {
	if (message.action === "save-page") {
		const pageData = await singlefile.getPageData({
			removeHiddenElements: true,
			removeUnusedStyles: true,
			removeUnusedFonts: true,
			removeImports: true,
			blockScripts: true,
			blockAudios: true,
			blockVideos: true,
			compressHTML: true,
			removeAlternativeFonts: true,
			removeAlternativeMedias: true,
			removeAlternativeImages: true,
			groupDuplicateImages: true
		});
		console.log(pageData);
		
		const linkElement = document.createElement("a");
		linkElement.download = `${pageData.title}.html`;
		linkElement.href = URL.createObjectURL(new Blob([pageData.content], { type: "text/html" }));
		linkElement.click();
	}
});

Answered by gildas-lormeau

Feb 4, 2025

You could parse pageData.content with the DOMParser API, remove nodes from the resulting document, and serialize it into HTML just before calling new Blob([pageData.content]). See below.

// ...

const doc = new DOMParser().parseFromString(pageData.content, "text/html");
doc.querySelectorAll("div.ads").forEach(element => element.remove()); // remove some elements
pageData.content = getDoctypeString(doc) + doc.documentElement.outerHTML;

linkElement.href = URL.createObjectURL(new Blob([pageData.content], { type: "text/html" }));
// ...


function getDoctypeString(doc) {
  const docType = doc.doctype;
  let docTypeString = "";
  if (docType) {
    docTypeString = "<!DOCTYPE " + docType.nodeName

View full answer

gildas-lormeau · 2025-02-04T14:58:12Z

gildas-lormeau
Feb 4, 2025
Maintainer

You could parse pageData.content with the DOMParser API, remove nodes from the resulting document, and serialize it into HTML just before calling new Blob([pageData.content]). See below.

// ...

const doc = new DOMParser().parseFromString(pageData.content, "text/html");
doc.querySelectorAll("div.ads").forEach(element => element.remove()); // remove some elements
pageData.content = getDoctypeString(doc) + doc.documentElement.outerHTML;

linkElement.href = URL.createObjectURL(new Blob([pageData.content], { type: "text/html" }));
// ...


function getDoctypeString(doc) {
  const docType = doc.doctype;
  let docTypeString = "";
  if (docType) {
    docTypeString = "<!DOCTYPE " + docType.nodeName;
    if (docType.publicId) {
      docTypeString += " PUBLIC \"" + docType.publicId + "\"";
      if (docType.systemId)
        docTypeString += " \"" + docType.systemId + "\"";
    } else if (docType.systemId)
      docTypeString += " SYSTEM \"" + docType.systemId + "\"";
    if (docType.internalSubset)
      docTypeString += " [" + docType.internalSubset + "]";
    docTypeString += "> ";
  }
  return docTypeString;
}

1 reply

tangxiaoqi-tangxiao Feb 4, 2025
Author

Thank you, I understand.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to save a specified element using singlefile.getPageData? #36

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How to save a specified element using singlefile.getPageData? #36

Uh oh!

tangxiaoqi-tangxiao Feb 4, 2025

Replies: 1 comment · 1 reply

Uh oh!

Uh oh!

gildas-lormeau Feb 4, 2025 Maintainer

Uh oh!

tangxiaoqi-tangxiao Feb 4, 2025 Author

tangxiaoqi-tangxiao
Feb 4, 2025

Replies: 1 comment 1 reply

gildas-lormeau
Feb 4, 2025
Maintainer

tangxiaoqi-tangxiao Feb 4, 2025
Author