[SOLVED] 403 from requests to Stack Overflow?

autonome · August 16, 2019, 3:38am

I’m pulling a StackOverflow RSS feed fine locally but I get 403 with the same code running in my Glitch. Any ideas?

Example URL: https://stackoverflow.com/feeds/tag?sort=newest&tagnames=ipfs

cori · August 16, 2019, 1:54pm

Hey @autonome I seem to be able to curl that url from the console of a Glitch project without any difficulty; can you share some more details or your project’s name?

charliea21 · August 16, 2019, 3:20pm

The project could potentially be on a banned AWS IP/host? (Yes, Glitch runs on AWS) It’s likely a problem with that project as I’m able to use curl https://stackoverflow.com/feeds/tag?sort=newest&tagnames=ipfs in the console (like @cori) and receive correct data. It could even just be how your sending the request.

autonome · August 16, 2019, 4:28pm

Thanks all! Yeah it feels like a banned IP.

Here’s code example:

autonome · August 16, 2019, 5:38pm

Ok, this is interesting…

For the same URL:

In my browser I get download of the XML file for the feed
Locally in my node.js code I get the XML of the feed
In Glitch running my node.js code I get 403 FORBIDDEN
In Glitch console using curl I get a 404 html page

autonome · August 16, 2019, 5:46pm

SOLVED!

All I needed to get the correct response was to set a user agent header.

It could be anything. I put “fibblebonkers” and it worked fine. No user agent, get a 403.

charliea21 · August 16, 2019, 8:31pm

I do hope you keep “fibblebonkers” as your User Agent.

rene · August 16, 2019, 8:50pm

Yeah, a user-agent that has a significant length is needed. I answered a similar question over on Meta.SE here. It is worth mentioning that SE has extra tips on what they expect crawlers to do as Jeff Atwood wrote here basically:

Use GZIP requests

Identify yourself.

Use the right formats.

Be considerate.

Topic		Replies	Views
403 Forbidden when trying to make a POST request to one of my projects from another Coding Help nodejs	12	1321	March 2, 2021
Getting 403 when making request Coding Help	2	263	June 4, 2021
RESOLVED: Container unable to access GitLab.com API Coding Help nodejs	11	1143	May 7, 2021
Error 403 Forbidden from too many requests? (Pro account) Glitch Help websocket , nodejs	7	581	June 1, 2022
Glitch app to Glitch app 403s Glitch Help	5	1994	June 15, 2020

[SOLVED] 403 from requests to Stack Overflow?

Related Topics