Description
i dont know if your active or even care anymore, but i think this suggestion would ve a very nice update to httrack i have searched on the net and see countless people having issues logging into webpages. It basically is using edges data all cookies, and saved forms are easily attained by this!! meaning i could log into a webpage on my browser and click remember me or stay signed in. Fire up httrack and bam go to the page and im logged in already
Adding a option in the menu to select the browsers directory for data
##here is a npm script section for puppeteer or playwright. i think this would be a very very useful addition.##
heres a link explaining what it does and how it works
playwright/docs API usedataDIR
async function startBrowser() {
let browser;
try {
console.log("starting Browser");
browser = await puppeteer.launch({
headless: true,
'ignoreHTTPSErrors': true,
**userDataDir: "C:\\Users\\Janss\\AppData\\Local\\Microsoft\\Edge SxS\\User Data\\Default",
executablePath: "C:\\Users\\Janss\\AppData\\Local\\Microsoft\\Edge SxS\\Application\\msedge.exe"**
});
} catch (err) {
console.log("Could not create a browser instance => : ", err);
}
return browser;
i personally have given up i can do almost anything with httrack except scrape behind a login page i am trying to copy a vehicle manual for my truck from chilton and have tried so many times spent hours on this..... i just keep getting this eror
please return to your library's access page and re-authorize a new session. -
- ive tried proxy forms fill out option
- the cookies.txt in the project folder, and also added one into the the main directory
- even a link anyone can click and will be automatically logged into the page
- added a referrer url
- followed robots.txt / didnt follow it.
- excluded all links containing logout, or quit
- tried to keep it in the directory of /lh/Repair/Index