Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Randomize user agent #5

Open
richardvenneman opened this issue Sep 5, 2019 · 2 comments
Open

Randomize user agent #5

richardvenneman opened this issue Sep 5, 2019 · 2 comments
Assignees

Comments

@richardvenneman
Copy link
Owner

I'm getting the idea that Instagram might not be serving content based on our headless user agent. Investigate if it's possible to randomise the user agent to increase success rate.

@richardvenneman richardvenneman self-assigned this Sep 5, 2019
@mattnwa
Copy link

mattnwa commented Sep 24, 2019

Hey Richard, Great work! I'm working through this precise problem on our daily collection. It seems like my lambda service was working fine using a mobile user agent until this last weekend. Now 6 of 10 calls are redirecting to the home login page.

Maybe Instagram is catching the server IP? I'm not familiar enough to know that proxies would help in this situation but figure maybe?

@richardvenneman
Copy link
Owner Author

richardvenneman commented Sep 26, 2019

Hi @mattnwa thanks for the report!

I suspected the same thing. Instagram could be blacklisting or rate-limiting the IP addresses.

I think a proxy-kind of setup could definitely be a solution to this issue. I've also briefly looked at utilising the Tor network, however I not really familiar with both myself either :)

Will have to investigate the possible options. This gem looks pretty interesting: https://github.com/joshweir/scraypa.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants