Experiments with Primitive operations : SHORT REPORT / NOTES

Jun 1, 20240 likes5 views

This includes: - Multiply with different modes (map) 1. Performance of sequential execution based vs OpenMP based vector multiply. 2. Comparing various launch configs for CUDA based vector multiply. - Sum with different storage types (reduce) 1. Performance of vector element sum using float vs bfloat16 as the storage type. - Sum with different modes (reduce) 1. Performance of sequential execution based vs OpenMP based vector element sum. 2. Performance of memcpy vs in-place based CUDA based vector element sum. 3. Comparing various launch configs for CUDA based vector element sum (memcpy). 4. Comparing various launch configs for CUDA based vector element sum (in-place). - Sum with in-place strategies of CUDA mode (reduce) 1. Comparing various launch configs for CUDA based vector element sum (in-place).

Multiply with different modes (map)
Sequential OpenMP CUDA
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
float bfloat16
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
Sequential OpenMP CUDA (memcpy, in-place)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
sum-loop sum-reduce
one-loop atomic-add
block-loop template, next-pow2 launch one-reduce, next-pow2 launch
block-loop template, prev. pow2 launch one-reduce, prev-pow2 launch
grid-loop
1. Comparing various launch configs for CUDA based vector element sum (in-place).

Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is Multiply with different modes (map) 1. Performance of sequential execution based vs OpenMP based vector multiply. 2. Comparing various launch configs for CUDA based vector multiply. Sum with different storage types (reduce) 1. Performance of vector element sum using float vs bfloat16 as the storage type. Sum with different modes (reduce) 1. Performance of sequential execution based vs OpenMP based vector element sum. 2. Performance of memcpy vs in-place based CUDA based vector element sum. 3. Comparing various launch configs for CUDA based vector element sum (memcpy). 4. Comparing various launch configs for CUDA based vector element sum (in-place). Sum with in-place strategies of CUDA mode (reduce) 1. Comparing various launch configs for CUDA based vector element sum (in-place).

PageRank Experiments : SHORT REPORT / NOTESSubhajit Sahu

This includes: - Adjusting data types for rank vector - Adjusting Pagerank parameters - Adjusting Sequential approach - Adjusting OpenMP approach - Comparing sequential approach - Adjusting Monolithic (Sequential) optimizations (from STICD) - Adjusting Levelwise (STICD) approach - Comparing Levelwise (STICD) approach - Adjusting ranks for dynamic graphs - Adjusting Levelwise (STICD) dynamic approach - Comparing dynamic approach with static - Adjusting Monolithic CUDA approach - Adjusting Monolithic CUDA optimizations (from STICD) - Adjusting Levelwise (STICD) CUDA approach - Comparing Levelwise (STICD) CUDA approach - Comparing dynamic CUDA approach with static - Comparing dynamic optimized CUDA approach with static

Jvm Performance TunningTerry Cho

Jvm Performance Tunningguest1f2740

The document discusses various topics related to tuning the Java Virtual Machine (JVM) for performance, including: 1. Hotspot compiler options like method inlining that can improve performance. 2. Threading models on Solaris like M:N and 1:1 and how tuning thread-related JVM options can significantly impact throughput. 3. Memory and garbage collection tuning like selecting the right GC algorithm, tuning heap sizes, and analyzing GC logs to identify bottlenecks and optimize full GC frequency and duration.

Java Keeps Throttling Up!José Paumard

Le slide deck de l'Université que nous avons donnée avec Rémi Forax à Devoxx France 2019. Comme promis, Java sort sa version majeure tous les 6 mois. Le train passe et amène son lot de nouveautés. Parmi elles, certaines sont sorties : une nouvelle syntaxe pour les clauses switch et l'instruction de byte code CONSTANT_DYNAMIC. D'autres sont en chantier, plus ou moins avancé : une nouvelle façon d'écrire des méthodes de façon condensée, un instanceof 'intelligent', des constantes évaluées au moment où elles sont utilisées. Les projets progressent. Loom, et son nouveau modèle de programmation concurrente que l'ont peut tester avec Jetty. Amber, qui introduit les data types et des nouvelles syntaxes. Valhalla, dont les value types donnent leurs premiers résultats. S'il est difficile de prévoir une date de sortie pour ces nouveautés, on sait en revanche qu'une fois prêtes elles sortiront en moins de 6 mois. De tout ceci nous parlerons donc au futur et en public, avec des démonstrations de code, des slides, du code, de la joie et de la bonne humeur !

2017 10 17_quantum_program_v2Francisco J. Gálvez Ramírez

About TrueTime, Spanner, Clock synchronization, CAP theorem, Two-phase lockin...Subhajit Sahu

TrueTime is a service that enables the use of globally synchronized clocks, with bounded error. It returns a time interval that is guaranteed to contain the clock’s actual time for some time during the call’s execution. If two intervals do not overlap, then we know calls were definitely ordered in real time. In general, synchronized clocks can be used to avoid communication in a distributed system. The underlying source of time is a combination of GPS receivers and atomic clocks. As there are “time masters” in every datacenter (redundantly), it is likely that both sides of a partition would continue to enjoy accurate time. Individual nodes however need network connectivity to the masters, and without it their clocks will drift. Thus, during a partition their intervals slowly grow wider over time, based on bounds on the rate of local clock drift. Operations depending on TrueTime, such as Paxos leader election or transaction commits, thus have to wait a little longer, but the operation still completes (assuming the 2PC and quorum communication are working).

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu

Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.

Adjusting Bitset for graph : SHORT REPORT / NOTESSubhajit Sahu

Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is commonly used for efficient graph computations. Unfortunately, using CSR for dynamic graphs is impractical since addition/deletion of a single edge can require on average (N+M)/2 memory accesses, in order to update source-offsets and destination-indices. A common approach is therefore to store edge-lists/destination-indices as an array of arrays, where each edge-list is an array belonging to a vertex. While this is good enough for small graphs, it quickly becomes a bottleneck for large graphs. What causes this bottleneck depends on whether the edge-lists are sorted or unsorted. If they are sorted, checking for an edge requires about log(E) memory accesses, but adding an edge on average requires E/2 accesses, where E is the number of edges of a given vertex. Note that both addition and deletion of edges in a dynamic graph require checking for an existing edge, before adding or deleting it. If edge lists are unsorted, checking for an edge requires around E/2 memory accesses, but adding an edge requires only 1 memory access.

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Subhajit Sahu

Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.

Algorithmic optimizations for Dynamic Monolithic PageRank (from STICD) : SHOR...Subhajit Sahu

Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu

For massive graphs that fit in RAM, but not in GPU memory, it is possible to take advantage of a shared memory system with multiple CPUs, each with multiple cores, to accelerate pagerank computation. If the NUMA architecture of the system is properly taken into account with good vertex partitioning, the speedup can be significant. To take steps in this direction, experiments are conducted to implement pagerank in OpenMP using two different approaches, uniform and hybrid. The uniform approach runs all primitives required for pagerank in OpenMP mode (with multiple threads). On the other hand, the hybrid approach runs certain primitives in sequential mode (i.e., sumAt, multiply).

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

Below are the important points I note from the 2020 paper by Martin Grohe: - 1-WL distinguishes almost all graphs, in a probabilistic sense - Classical WL is two dimensional Weisfeiler-Leman - DeepWL is an unlimited version of WL graph that runs in polynomial time. - Knowledge graphs are essentially graphs with vertex/edge attributes ABSTRACT: Vector representations of graphs and relational structures, whether handcrafted feature vectors or learned representations, enable us to apply standard data analysis and machine learning techniques to the structures. A wide range of methods for generating such embeddings have been studied in the machine learning and knowledge representation literature. However, vector embeddings have received relatively little attention from a theoretical point of view. Starting with a survey of embedding techniques that have been used in practice, in this paper we propose two theoretical approaches that we see as central for understanding the foundations of vector embeddings. We draw connections between the various approaches and suggest directions for future research.

DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTESSubhajit Sahu

https://gist.github.com/wolfram77/54c4a14d9ea547183c6c7b3518bf9cd1 There exist a number of dynamic graph generators. Barbasi-Albert model iteratively attach new vertices to pre-exsiting vertices in the graph using preferential attachment (edges to high degree vertices are more likely - rich get richer - Pareto principle). However, graph size increases monotonically, and density of graph keeps increasing (sparsity decreasing). Gorke's model uses a defined clustering to uniformly add vertices and edges. Purohit's model uses motifs (eg. triangles) to mimick properties of existing dynamic graphs, such as growth rate, structure, and degree distribution. Kronecker graph generators are used to increase size of a given graph, with power-law distribution. To generate dynamic graphs, we must choose a metric to compare two graphs. Common metrics include diameter, clustering coefficient (modularity?), triangle counting (triangle density?), and degree distribution. In this paper, the authors propose Dygraph, a dynamic graph generator that uses degree distribution as the only metric. The authors observe that many real-world graphs differ from the power-law distribution at the tail end. To address this issue, they propose binning, where the vertices beyond a certain degree (minDeg = min(deg) s.t. |V(deg)| < H, where H~10 is the number of vertices with a given degree below which are binned) are grouped into bins of degree-width binWidth, max-degree localMax, and number of degrees in bin with at least one vertex binSize (to keep track of sparsity). This helps the authors to generate graphs with a more realistic degree distribution. The process of generating a dynamic graph is as follows. First the difference between the desired and the current degree distribution is calculated. The authors then create an edge-addition set where each vertex is present as many times as the number of additional incident edges it must recieve. Edges are then created by connecting two vertices randomly from this set, and removing both from the set once connected. Currently, authors reject self-loops and duplicate edges. Removal of edges is done in a similar fashion. Authors observe that adding edges with power-law properties dominates the execution time, and consider parallelizing DyGraph as part of future work.

Shared memory Parallelism (NOTES)Subhajit Sahu

My notes on shared memory parallelism. Shared memory is memory that may be simultaneously accessed by multiple programs with an intent to provide communication among them or avoid redundant copies. Shared memory is an efficient means of passing data between programs. Using memory for communication inside a single program, e.g. among its multiple threads, is also referred to as shared memory [REF].

A Dynamic Algorithm for Local Community Detection in Graphs : NOTESSubhajit Sahu

**Community detection methods** can be *global* or *local*. **Global community detection methods** divide the entire graph into groups. Existing global algorithms include: - Random walk methods - Spectral partitioning - Label propagation - Greedy agglomerative and divisive algorithms - Clique percolation https://gist.github.com/wolfram77/b4316609265b5b9f88027bbc491f80b6 There is a growing body of work in *detecting overlapping communities*. **Seed set expansion** is a **local community detection method** where a relevant *seed vertices* of interest are picked and *expanded to form communities* surrounding them. The quality of each community is measured using a *fitness function*. **Modularity** is a *fitness function* which compares the number of intra-community edges to the expected number in a random-null model. **Conductance** is another popular fitness score that measures the community cut or inter-community edges. Many *overlapping community detection* methods **use a modified ratio** of intra-community edges to all edges with atleast one endpoint in the community. Andersen et al. use a **Spectral PageRank-Nibble method** which minimizes conductance and is formed by adding vertices in order of decreasing PageRank values. Andersen and Lang develop a **random walk approach** in which some vertices in the seed set may not be placed in the final community. Clauset gives a **greedy method** that *starts from a single vertex* and then iteratively adds neighboring vertices *maximizing the local modularity score*. Riedy et al. **expand multiple vertices** via maximizing modularity. Several algorithms for **detecting global, overlapping communities** use a *greedy*, *agglomerative approach* and run *multiple separate seed set expansions*. Lancichinetti et al. run **greedy seed set expansions**, each with a *single seed vertex*. Overlapping communities are produced by a sequentially running expansions from a node not yet in a community. Lee et al. use **maximal cliques as seed sets**. Havemann et al. **greedily expand cliques**. The authors of this paper discuss a dynamic approach for **community detection using seed set expansion**. Simply marking the neighbours of changed vertices is a **naive approach**, and has *severe shortcomings*. This is because *communities can split apart*. The simple updating method *may fail even when it outputs a valid community* in the graph.

Scalable Static and Dynamic Community Detection Using Grappolo : NOTESSubhajit Sahu

A **community** (in a network) is a subset of nodes which are _strongly connected among themselves_, but _weakly connected to others_. Neither the number of output communities nor their size distribution is known a priori. Community detection methods can be divisive or agglomerative. **Divisive methods** use _betweeness centrality_ to **identify and remove bridges** between communities. **Agglomerative methods** greedily **merge two communities** that provide maximum gain in _modularity_. Newman and Girvan have introduced the **modularity metric**. The problem of community detection is then reduced to the problem of modularity maximization which is **NP-complete**. **Louvain method** is a variant of the _agglomerative strategy_, in that is a _multi-level heuristic_. https://gist.github.com/wolfram77/917a1a4a429e89a0f2a1911cea56314d In this paper, the authors discuss **four heuristics** for Community detection using the _Louvain algorithm_ implemented upon recently developed **Grappolo**, which is a parallel variant of the Louvain algorithm. They are: - Vertex following and Minimum label - Data caching - Graph coloring - Threshold scaling With the **Vertex following** heuristic, the _input is preprocessed_ and all single-degree vertices are merged with their corresponding neighbours. This helps reduce the number of vertices considered in each iteration, and also help initial seeds of communities to be formed. With the **Minimum label heuristic**, when a vertex is making the decision to move to a community and multiple communities provided the same modularity gain, the community with the smallest id is chosen. This helps _minimize or prevent community swaps_. With the **Data caching** heuristic, community information is stored in a vector instead of a map, and is reused in each iteration, but with some additional cost. With the **Vertex ordering via Graph coloring** heuristic, _distance-k coloring_ of graphs is performed in order to group vertices into colors. Then, each set of vertices (by color) is processed _concurrently_, and synchronization is performed after that. This enables us to mimic the behaviour of the serial algorithm. Finally, with the **Threshold scaling** heuristic, _successively smaller values of modularity threshold_ are used as the algorithm progresses. This allows the algorithm to converge faster, and it has been observed a good modularity score as well. From the results, it appears that _graph coloring_ and _threshold scaling_ heuristics do not always provide a speedup and this depends upon the nature of the graph. It would be interesting to compare the heuristics against baseline approaches. Future work can include _distributed memory implementations_, and _community detection on streaming graphs_.

Application Areas of Community Detection: A Review : NOTESSubhajit Sahu

This is a short review of Community detection methods (on graphs), and their applications. A **community** is a subset of a network whose members are *highly connected*, but *loosely connected* to others outside their community. Different community detection methods *can return differing communities* these algorithms are **heuristic-based**. **Dynamic community detection** involves tracking the *evolution of community structure* over time. https://gist.github.com/wolfram77/09e64d6ba3ef080db5558feb2d32fdc0 Communities can be of the following **types**: - Disjoint - Overlapping - Hierarchical - Local. The following **static** community detection **methods** exist: - Spectral-based - Statistical inference - Optimization - Dynamics-based The following **dynamic** community detection **methods** exist: - Independent community detection and matching - Dependent community detection (evolutionary) - Simultaneous community detection on all snapshots - Dynamic community detection on temporal networks **Applications** of community detection include: - Criminal identification - Fraud detection - Criminal activities detection - Bot detection - Dynamics of epidemic spreading (dynamic) - Cancer/tumor detection - Tissue/organ detection - Evolution of influence (dynamic) - Astroturfing - Customer segmentation - Recommendation systems - Social network analysis (both) - Network summarization - Privary, group segmentation - Link prediction (both) - Community evolution prediction (dynamic, hot field) <br> <br> ## References - [Application Areas of Community Detection: A Review : PAPER](https://ieeexplore.ieee.org/document/8625349)

Community Detection on the GPU : NOTESSubhajit Sahu

This paper discusses a GPU implementation of the Louvain community detection algorithm. Louvain algorithm obtains hierachical communities as a dendrogram through modularity optimization. Given an undirected weighted graph, all vertices are first considered to be their own communities. In the first phase, each vertex greedily decides to move to the community of one of its neighbours which gives greatest increase in modularity. If moving to no neighbour's community leads to an increase in modularity, the vertex chooses to stay with its own community. This is done sequentially for all the vertices. If the total change in modularity is more than a certain threshold, this phase is repeated. Once this local moving phase is complete, all vertices have formed their first hierarchy of communities. The next phase is called the aggregation phase, where all the vertices belonging to a community are collapsed into a single super-vertex, such that edges between communities are represented as edges between respective super-vertices (edge weights are combined), and edges within each community are represented as self-loops in respective super-vertices (again, edge weights are combined). Together, the local moving and the aggregation phases constitute a stage. This super-vertex graph is then used as input fof the next stage. This process continues until the increase in modularity is below a certain threshold. As a result from each stage, we have a hierarchy of community memberships for each vertex as a dendrogram. Approaches to perform the Louvain algorithm can be divided into coarse-grained and fine-grained. Coarse-grained approaches process a set of vertices in parallel, while fine-grained approaches process all vertices in parallel. A coarse-grained hybrid-GPU algorithm using multi GPUs has be implemented by Cheong et al. which grabbed my attention. In addition, their algorithm does not use hashing for the local moving phase, but instead sorts each neighbour list based on the community id of each vertex. https://gist.github.com/wolfram77/7e72c9b8c18c18ab908ae76262099329

Survey for extra-child-process package : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating PageRank : POSTERSubhajit Sahu

This paper presents two algorithms for efficiently computing PageRank on dynamically updating graphs in a batched manner: DynamicLevelwisePR and DynamicMonolithicPR. DynamicLevelwisePR processes vertices level-by-level based on strongly connected components and avoids recomputing converged vertices on the CPU. DynamicMonolithicPR uses a full power iteration approach on the GPU that partitions vertices by in-degree and skips unaffected vertices. Evaluation on real-world graphs shows the batched algorithms provide speedups of up to 4000x over single-edge updates and outperform other state-of-the-art dynamic PageRank algorithms.

Abstract for IPDPS 2022 PhD Forum on Dynamic Batch Parallel Algorithms for Up...Subhajit Sahu

Fast Incremental Community Detection on Dynamic Graphs : NOTESSubhajit Sahu

In this paper, the authors describe two approaches for dynamic community detection using the CNM algorithm. CNM is a hierarchical, agglomerative algorithm that greedily maximizes modularity. They define two approaches: BasicDyn and FastDyn. BasicDyn backtracks merges of communities until each marked (changed) vertex is its own singleton community. FastDyn undoes a merge only if the quality of merge, as measured by the induced change in modularity, has significantly decreased compared to when the merge initially took place. FastDyn also allows more than two vertices to contract together if in the previous time step these vertices eventually ended up contracted in the same community. In the static case, merging several vertices together in one contraction phase could lead to deteriorating results. FastDyn is able to do this, however, because it uses information from the merges of the previous time step. Intuitively, merges that previously occurred are more likely to be acceptable later. https://gist.github.com/wolfram77/1856b108334cc822cdddfdfa7334792a

Can you ﬁx farming by going back 8000 years : NOTESSubhajit Sahu

HITS algorithm : NOTESSubhajit Sahu

1. Webpages tend to behave as authorities or hubs. 2. An authority represents an research thesis, and a hub represents an encyclopedia. 3. Each page has an authority and a hub score. 4. The graph is based on query, included pointed to and from pages. 5. Authority score is the sum of scores of all hubs pointing to it. 6. Hub score is the sum of scores of all authorities is pointing to. 7. Score are normalized with L2-norm in each iteration (root of sum of squares). 8. Needs to be performed at query time. 9. Two scores are returned, instead of just one. https://gist.github.com/wolfram77/3d9ef6c5a5b63f53caabce4812c7ea81

Basic Computer Architecture and the Case for GPUs : NOTESSubhajit Sahu

Computer architectures are facing issues: Memory latencies are far higher. Benefits from instruction level parallelism (ILP) is reducing. With increasing clock rates, power consumption is increasing. Increasing complexity with multi-stage pipelines, intermediate buffers, multi-level caches, out-of-order execution, branch prediction, ... GPUs are parallel computer architectures that are good at some tasks, not so good at others. Running routines with high arithmetic intensity with overlapped memory access is the preferred approach. They may be unsuitable for irregular algorithms, where it is difficult to get high efficiency due to the high latency of accesses. They are less versatile compared to CPUs, using SIMD parallelism, and are dense compute-wise (per currency). NVIDIA's CUDA programming model enables GPUs to be used for general-purpose computing, and hence the term GPGPU. GPU Architectural, Programming, and Performance Models presentation at PPoPP, 2010, Bangalore, India. By Prof. Kishore Kothapalli with Prof. P. J. Narayanan and Suryakant Patidar. https://gist.github.com/wolfram77/43a6660121eef45b78c10d4e652dad6c

Dynamic Batch Parallel Algorithms for Updating Pagerank : SLIDESSubhajit Sahu

For the IPDPS ParSocial event a presentation submission is required by 15th May. The event is on 3rd June. https://gist.github.com/wolfram77/51b15ca09eb28f6909673a2deb1a314d DYNAMIC BATCH PARALLEL ALGORITHMS FOR UPDATING PAGERANK Subhajit Sahut, Kishore Kothapallit and Dip Sankar Banerjeet tInternational Institute of Information Technology Hyderabad, India. tIndian Institute of Technology Jodhpur, India. subhajit.sahu@research. ,[email protected], [email protected] This work is partially supported by a grant from the Department of Science and Technology (DST), India, under the National Supercomputing Mission (NSM) R&D in Exascale initiative vide Ref. No: DST/NSM/R&D Exascale/2021/16. FACEBOOK 15 TAKING A PAGE OUT OF GOOGLE’S PLAYBOOK 10 STOP FAKE NEWS FROM GOING VIRAL PUBLISHED APR 2015 BY SALVADOR RODRIGUEZ Click-Gap: When is Facebook is driving disproportionate amounts of traffic to websites. Effort to rid fakes news from Facebook’s services. Is a website relying on Facebook to drive significant traffic, but not well ranked by the rest of the web? Also News Citation Graph. PAGERANK APPLICATIONS Ranking of websites. Measuring scientific impact of researchers. Finding the best teams and athletes. Ranking companies by talent concentration. Predicting road/foot traffic in urban spaces. Analysing protein networks. Finding the most authoritative news sources Identifying parts of brain that change jointly. Toxic waste management. PAGERANK APPLICATIONS Debugging complex software systems (Moni torRank) Finding the most original writers (BookRank) Finding topical authorities (TwitterRank) WHAT IS PAGERANK l—-d Plu = Cus + —— UCIiNny Pru u->v = (1-—d) x “us ( ) outdegy, PageRank is a lLink-analysis algorithm. By Larry Page and Sergey Brin in 1996. For ordering information on the web. Represented with a random-surfer model. Rank of a page is defined recursively. Calculate iteratively with power-iteration.

Are Satellites Covered in Gold Foil : NOTESSubhajit Sahu

Satellites are usually covered in aluminized polyimide. The yellowish gold color of polyimide with silver aluminium side facing in gives the satellite the appearance of being wrapped in gold. The material is called Multi-layer Insulation (MLI). It helps in radiative insulation of the onboard instruments of satellite. Gold is actually used in electrical contacts to prevent corrosion due to Ultra-violet light or X-rays. https://gist.github.com/wolfram77/8ae2de1a29caf1a2f84babed79943389

apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)apidays

From Rates and Bookings to AI Intelligence: The Evolution of Travel APIs Eric White, CTO at Eviivo apidays New York 2025 API Management for Surfing the Next Innovation Waves: GenAI and Open Banking Convene 360 Madison, New York May 14 & 15, 2025 ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

How Data Annotation Services Drive Innovation in Autonomous Vehicles.docxsofiawilliams5966

More Related Content

More from Subhajit Sahu (20)

Adjusting Bitset for graph : SHORT REPORT / NOTESSubhajit Sahu

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Subhajit Sahu

Algorithmic optimizations for Dynamic Monolithic PageRank (from STICD) : SHOR...Subhajit Sahu

Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTESSubhajit Sahu

Shared memory Parallelism (NOTES)Subhajit Sahu

A Dynamic Algorithm for Local Community Detection in Graphs : NOTESSubhajit Sahu

Scalable Static and Dynamic Community Detection Using Grappolo : NOTESSubhajit Sahu

Application Areas of Community Detection: A Review : NOTESSubhajit Sahu

Community Detection on the GPU : NOTESSubhajit Sahu

Survey for extra-child-process package : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating PageRank : POSTERSubhajit Sahu

Abstract for IPDPS 2022 PhD Forum on Dynamic Batch Parallel Algorithms for Up...Subhajit Sahu

Fast Incremental Community Detection on Dynamic Graphs : NOTESSubhajit Sahu

Can you ﬁx farming by going back 8000 years : NOTESSubhajit Sahu

HITS algorithm : NOTESSubhajit Sahu

Basic Computer Architecture and the Case for GPUs : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating Pagerank : SLIDESSubhajit Sahu

Are Satellites Covered in Gold Foil : NOTESSubhajit Sahu

Adjusting Bitset for graph : SHORT REPORT / NOTESSubhajit Sahu

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Subhajit Sahu

Algorithmic optimizations for Dynamic Monolithic PageRank (from STICD) : SHOR...Subhajit Sahu

Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTESSubhajit Sahu

Shared memory Parallelism (NOTES)Subhajit Sahu

A Dynamic Algorithm for Local Community Detection in Graphs : NOTESSubhajit Sahu

Scalable Static and Dynamic Community Detection Using Grappolo : NOTESSubhajit Sahu

Application Areas of Community Detection: A Review : NOTESSubhajit Sahu

Community Detection on the GPU : NOTESSubhajit Sahu

Survey for extra-child-process package : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating PageRank : POSTERSubhajit Sahu

Abstract for IPDPS 2022 PhD Forum on Dynamic Batch Parallel Algorithms for Up...Subhajit Sahu

Fast Incremental Community Detection on Dynamic Graphs : NOTESSubhajit Sahu

Can you ﬁx farming by going back 8000 years : NOTESSubhajit Sahu

HITS algorithm : NOTESSubhajit Sahu

Basic Computer Architecture and the Case for GPUs : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating Pagerank : SLIDESSubhajit Sahu

Are Satellites Covered in Gold Foil : NOTESSubhajit Sahu

Recently uploaded (20)

apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)apidays

How Data Annotation Services Drive Innovation in Autonomous Vehicles.docxsofiawilliams5966

BADS-MBA-Unit 1 that what data science and Interpretationsrishtisingh1813

Introduction to information about Data Structure.pptxtarrebulehora

Block chauin techncology by engineer saniya samreenShoyeb16

Brain, Bytes & Bias: ML Interview Questions You Can’t Miss!yashikanigam1

Preparing for a machine learning role? Get ready to tackle real-world problem-solving questions! From regression vs. classification to the ETL process, expect a deep dive into algorithms and data pipelines. Most live courses for professionals and best online professional certificates now include mock interviews and case studies to gear you up. Mastering these ML interview questions not only helps in cracking top tech interviews but also builds your confidence. At Tutort Academy, we train you with real-time scenarios and curated interview prep for success.

Embracing AI in Project Management: Final Insights & Future VisionKavehMomeni1

🚀 Unlock the Future of Project Management: Embracing AI – Final Session! This presentation is the culminating session (Session 13) of the "AI Applications in Project Management Workshop," hosted by OnAcademy and instructed by Kaveh Momeni, PMP®, COB & AI Lead at Chaharsotoon. Dive deep into "Embracing AI: Empowering Project Managers for an AI-Driven Future." We consolidate critical learnings from the entire workshop and provide a forward-looking perspective on how AI is revolutionizing project management. Inside, you'll discover: A Comprehensive Course Recap: Key takeaways from across the workshop, covering everything from knowledge management and predictive analytics to AI agents. Cutting-Edge AI Trends: The latest developments in AI impacting PM, including market growth, task automation, and the rise of autonomous project assistants. AI vs. Human Capabilities: Understanding the unique strengths of AI and the irreplaceable value of human intuition, strategic thinking, and leadership in PM. Optimizing Human-AI Collaboration: Practical models and frameworks for seamlessly integrating AI tools into PM workflows, emphasizing prompt engineering and growth mindsets. Cultivating AI-Ready Mindsets: Strategies to foster organizational cultures that embrace AI as an opportunity for innovation and competitive advantage. Essential Skills for Future-Proof PMs: Identifying the core competencies, including AI literacy, data-driven decision-making, and ethical AI governance, crucial for thriving in an AI-augmented world. Implementation Roadmap & Best Practices: A strategic guide for integrating AI into your projects and organizations, from pilot projects to establishing Centers of Excellence. Ethical & Practical Considerations: Navigating data quality, bias, transparency, regulatory compliance (like the EU AI Act), and human-centric values in AI-driven PM. A Vision for AI-Enabled PM: Envisioning AI as a strategic partner, leading to enhanced outcomes, sustainable competitive advantage, and the rise of the "AI-Augmented PM." Actionable Next Steps: Concrete steps you can take today to advance your AI journey in project management. Presented by Kaveh Momeni, a seasoned Project Manager with 15+ years of experience and extensive AI/ML certifications from leading institutions. This session is designed to empower project managers, team leaders, and decision-makers to confidently navigate and leverage AI for transformative project success. Perfect for anyone looking to understand the strategic implications of AI in project delivery and how to prepare for an AI-driven future.

apidays New York 2025 - To tune or not to tune by Anamitra Dutta Majumdar (In...apidays

To tune or not to tune : Benefits and security pitfalls of fine-tuning Anamitra Dutta Majumdar, Principal Engineer at Intuit apidays New York 2025 API Management for Surfing the Next Innovation Waves: GenAI and Open Banking Convene 360 Madison, New York May 14 & 15, 2025 ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

2. Conditional_Probabilkbkjbj,vj,v,ity.pptSalmitaSalman

Faces of the Future The Impact of a Data Science Course in Kerala.pdfjzyphoenix

Splunk_ITSI_Interview_Prep_Deck.pptx interviewwillmorekanan

artificial intelligence (1).pptx hgggfcgfchDevAnshGupta609215

apidays New York 2025 - API Platform Survival Guide by James Higginbotham (La...apidays

API Platform Survival Guide James Higginbotham, API Strategist at LaunchAny apidays New York 2025 API Management for Surfing the Next Innovation Waves: GenAI and Open Banking Convene 360 Madison, New York May 14 & 15, 2025 ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

GST PPT-2 pdf version.pdfhhhhvgehrhhhrhgrhrhrhbrhrhrhhhrhrhrhhrhrhrhrhhrhrhrhrajat367791

Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...Karim Baïna

Artificial Intelligence (AI) is reshaping societies and raising complex ethical, legal, and geopolitical questions. This talk explores the foundations and limits of Trustworthy AI through the lens of global frameworks such as the EU’s HLEG guidelines, UNESCO’s human rights-based approach, OECD recommendations, and NIST’s taxonomy of AI security risks. We analyze key principles like fairness, transparency, privacy, robustness, and accountability — not only as ideals, but in terms of their practical implementation and tensions. Special attention is given to real-world contexts such as Morocco’s deployment of 4,000 intelligent cameras and the country’s positioning in AI readiness indexes. These examples raise critical issues about surveillance, accountability, and ethical governance in the Global South. Rather than relying on standardized terms or ethical "checklists", this presentation advocates for a grounded, interdisciplinary, and context-aware approach to responsible AI — one that balances innovation with human rights, and technological ambition with social responsibility. This rich Trustworthy and Responsible AI frameworks context is a serious opportunity for Human and Social Sciences Researchers : either operate as gatekeepers, reinforcing existing ethical constraints, or become revolutionaries, pioneering new paradigms that redefine how AI interacts with society, knowledge production, and policymaking ?

Chapter 2 protozoa and their phylum to gethamzagobena8

Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvhsaifalroby72

Blue Dark Professional Geometric Business Project Presentation .pdfmohammadhaidarayoobi

Role_Based_Permissions_Kick-off_Deck_202203.pptxSystemsBenya

GROUP 7 CASE STUDY Real Life Incident.pptxmardoglenn21

apidays New York 2025 - The Evolution of Travel APIs by Eric White (Eviivo)apidays

How Data Annotation Services Drive Innovation in Autonomous Vehicles.docxsofiawilliams5966

BADS-MBA-Unit 1 that what data science and Interpretationsrishtisingh1813

Introduction to information about Data Structure.pptxtarrebulehora

Block chauin techncology by engineer saniya samreenShoyeb16

Brain, Bytes & Bias: ML Interview Questions You Can’t Miss!yashikanigam1

Embracing AI in Project Management: Final Insights & Future VisionKavehMomeni1

apidays New York 2025 - To tune or not to tune by Anamitra Dutta Majumdar (In...apidays

2. Conditional_Probabilkbkjbj,vj,v,ity.pptSalmitaSalman

Faces of the Future The Impact of a Data Science Course in Kerala.pdfjzyphoenix

Splunk_ITSI_Interview_Prep_Deck.pptx interviewwillmorekanan

artificial intelligence (1).pptx hgggfcgfchDevAnshGupta609215

apidays New York 2025 - API Platform Survival Guide by James Higginbotham (La...apidays

GST PPT-2 pdf version.pdfhhhhvgehrhhhrhgrhrhrhbrhrhrhhhrhrhrhhrhrhrhrhhrhrhrhrajat367791

Ethical Frameworks for Trustworthy AI – Opportunities for Researchers in Huma...Karim Baïna

Chapter 2 protozoa and their phylum to gethamzagobena8

Lec 12.pdfghhjjhhjkkkkkkkkkkkjfcvhiiugcvvhsaifalroby72

Blue Dark Professional Geometric Business Project Presentation .pdfmohammadhaidarayoobi

Role_Based_Permissions_Kick-off_Deck_202203.pptxSystemsBenya

GROUP 7 CASE STUDY Real Life Incident.pptxmardoglenn21

Experiments with Primitive operations : SHORT REPORT / NOTES

1. Multiply with different modes (map) Sequential OpenMP CUDA 1. Performance of sequential execution based vs OpenMP based vector multiply. 2. Comparing various launch configs for CUDA based vector multiply. Sum with different storage types (reduce) float bfloat16 1. Performance of vector element sum using float vs bfloat16 as the storage type. Sum with different modes (reduce) Sequential OpenMP CUDA (memcpy, in-place) 1. Performance of sequential execution based vs OpenMP based vector element sum. 2. Performance of memcpy vs in-place based CUDA based vector element sum. 3. Comparing various launch configs for CUDA based vector element sum (memcpy). 4. Comparing various launch configs for CUDA based vector element sum (in-place). Sum with in-place strategies of CUDA mode (reduce) sum-loop sum-reduce one-loop atomic-add block-loop template, next-pow2 launch one-reduce, next-pow2 launch block-loop template, prev. pow2 launch one-reduce, prev-pow2 launch grid-loop 1. Comparing various launch configs for CUDA based vector element sum (in-place).

Experiments with Primitive operations : SHORT REPORT / NOTES

Recommended

More Related Content

More from Subhajit Sahu (20)

Recently uploaded (20)

Experiments with Primitive operations : SHORT REPORT / NOTES