-
Notifications
You must be signed in to change notification settings - Fork 0
Description
1. How can shadowbanning be detected in social media platforms like Instagram?
Develop algorithms to identify patterns in content visibility restrictions, focusing on anomalies in reach, engagement, or ranking compared to similar content or historical trends.
- Feasibility:
- Public APIs like Instagram Graph API provide limited data on reach and engagement metrics but may be useful.
- Python libraries for anomaly detection (e.g.,
scikit-learn,statsmodels) can help identify unusual patterns. - Limitations: APIs often don’t reveal ranking data directly, so indirect methods (e.g., scraping, if allowed) might be needed.
2. What types of content are most affected by shadowbanning or visibility restrictions?
- Details: Analyze whether certain topics, hashtags, or keywords are disproportionately restricted using text analysis and sentiment analysis.
- Feasibility:
- Collect posts and hashtags via Instagram’s API or third-party tools like
Tweepy(for X) orPRAW(for Reddit). - Use Python NLP libraries like
spaCyorNLTKfor content analysis. - Challenges: Public APIs may not disclose visibility restriction statuses, requiring correlation with engagement trends.
- Collect posts and hashtags via Instagram’s API or third-party tools like
3. Are visibility restrictions applied consistently across different user demographics?
- Details: Investigate whether user demographics (e.g., account age, follower count, location) influence the likelihood of being shadowbanned.
- Feasibility:
- Scrape profile metadata and engagement data (subject to platform rules).
- Use Python libraries like
pandasandmatplotlibfor data analysis. - Challenges: Demographic data is not always available through APIs, potentially requiring indirect inference (e.g., location from bio or posting times).
4. How do visibility restrictions evolve over time on Instagram?
- Details: Track changes in content reach, engagement, or ranking policies to identify trends or patterns over time.
- Feasibility:
- Use archival tools like the Platform Governance Archive or the Wayback Machine.
- Leverage Python web scraping libraries (
BeautifulSoup,Selenium) for historical data collection. - Challenges: Historical data gaps for engagement metrics.
5. Are visibility restrictions correlated with specific platform policies (e.g., sensitive content control)?
- Details: Assess the impact of policy changes on content visibility by comparing before-and-after metrics for affected content.
- Feasibility:
- Use APIs or data scraping to gather pre- and post-policy engagement data.
- Statistical tests (e.g., t-tests, regression) in Python (
scipy,statsmodels) can evaluate correlations. - Challenges: Policy dates and scope need to be precisely known.
6. Can machine learning predict shadowbanning based on content and metadata?
- Details: Build predictive models using features like hashtags, sentiment, metadata, and engagement metrics.
- Feasibility:
- Train models using Python libraries like
scikit-learnorTensorFlow. - Data sources: APIs, scraped data, or labeled datasets (if created).
- Challenges: Labeling shadowbanned content is non-trivial without platform-provided data.
- Train models using Python libraries like
7. Do shadowbanning practices disproportionately affect specific political or cultural content?
- Details: Evaluate whether political or cultural content faces higher rates of visibility restrictions compared to other content.
- Feasibility:
- Use text classification (e.g.,
Transformers,Hugging Face) to categorize content. - Analyze engagement or reach discrepancies using APIs or scraping.
- Challenges: Requires careful operationalization of “political” or “cultural” content.
- Use text classification (e.g.,
8. How transparent are platforms like Instagram in notifying users about visibility restrictions?
- Details: Analyze whether restricted accounts are notified by comparing the presence of notification data or discrepancies in engagement without explanations.
- Feasibility:
- Monitor account activity for any notifications using automated tools (e.g., Selenium).
- User feedback surveys could complement computational analysis.
- Challenges: Platform limitations on API access to such notifications.
9. What role do algorithms (e.g., sentiment analysis or keyword matching) play in shadowbanning?
- Details: Reverse-engineer platform algorithms using adversarial testing or controlled experiments (e.g., posting test content).
- Feasibility:
- Post test cases systematically and measure engagement.
- Use machine learning explainability tools (e.g., SHAP, LIME) to infer algorithmic decisions.
- Challenges: Ethical and legal considerations in adversarial testing.
10. How does the sensitive content opt-in feature affect content discovery?
- Details: Investigate engagement differences for content marked as "sensitive" based on user opt-in preferences.
- Feasibility:
- Scrape or analyze engagement trends for content flagged under sensitive categories.
- Use Python libraries like
numpyandseabornfor exploratory analysis. - Challenges: Access to sensitive-content-specific data may be restricted.